☆ 4.6 Article

TICS: text-image-based semantic CAPTCHA synthesis via multi-condition adversarial learning

VISUAL COMPUTER (2022)

期刊

VISUAL COMPUTER

卷 38, 期 3, 页码 963-975

出版社

SPRINGER

DOI: 10.1007/s00371-021-02061-1

关键词

Text– image-based CAPTCHA; security mechanism; Generative Adversarial Network; semantic image synthesis

类别

Computer Science, Software Engineering

资金

Fundamental Research Funds for the Central Universities
Artificial Intelligence Research Foundation of Baidu Inc., Zhejiang University
Cybervein Joint Research Lab, Zhejiang Natural Science Foundation [LY19F020051, R19F020009, LZ17F020-001]
National Natural Science Foundation of China [61572-431, U19B2042]
Key R&D Program of Zhejiang Province [2018C01006]
Program of China Knowledge Center for Engineering Sciences and Technology
Program of ZJU and Tongdun Joint Research Lab
Program of ZJU and Horizon Robotics Joint Research Lab
Joint Research Program of ZJU
Hikvision Research Institute
Major Scientific Research Project of Zhejiang Lab [2018EC0ZX01-1]
CAS Earth Science Research Project [XDA19020104]

向作者/读者索取更多资源

Protocol

Reagent

智能总结 New
摘要

This paper proposes a text-image-based CAPTCHA method that generates multi-conditional CAPTCHAs by synthesizing sentence, object, and location features, which can resist CNN classification attacks, and experimental results show significant effectiveness.

CAPTCHA is used to distinguish humans from automated programs and plays an important role in multimedia security mechanisms. Traditional CAPTCHA methods like image-based CAPTCHA and text-based CAPTCHA are usually based on word-level understanding, which can be easily cracked due to the recent success of deep learning techniques. To this end, this paper proposes a text-image-based CAPTCHA based on the cognition process and semantic reasoning and a novel model to generate the CAPTCHA. This method synthesizes three features: sentence, object, and location to generate a multi-conditional CAPTCHA that can resist the attack of the classification of CNN. A quantity of experiments has been conducted, and the result showed that the classification of ResNet-50 on the proposed TIC only achieves 3.38% accuracy.

作者

我是这篇论文的作者

点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6

评分不足

次要评分

新颖性

-

重要性

-

科学严谨性

-

评价这篇论文

推荐

Article Computer Science, Artificial Intelligence

Unsupervised text-to-image synthesis

Yanlong Dong, Ying Zhang, Lin Ma, Zhi Wang, Jiebo Luo

Summary: This paper presents an unsupervised training approach for text-to-image synthesis, which generates pseudo image-text pair data based on visual concepts to initialize a GAN model. The proposed method is able to generate high-quality images for given sentences without the need for human-labeled data.

PATTERN RECOGNITION (2021)

添加到收藏夹

Article Computer Science, Information Systems

Multi-scale dual-modal generative adversarial networks for text-to-image synthesis

Bin Jiang, Yun Huang, Wei Huang, Chao Yang, Fangqiang Xu

Summary: This study proposes a Multi-scale Dual-modal Generative Networks (MD-GAN) for generating images from text descriptions. The method addresses two key issues in image generation: selectively aggregating channel information to adjust image texture and enhancing semantic consistency between text and images through the dual-modal modulation attention (DMA) and the multi-scale consistency discriminator (MCD).

MULTIMEDIA TOOLS AND APPLICATIONS (2023)

添加到收藏夹

Article Computer Science, Information Systems

DE-GAN: Text-to-image synthesis with dual and efficient fusion model

Bin Jiang, Weiyuan Zeng, Chao Yang, Renjun Wang, Bolin Zhang

Summary: In this paper, a novel Dual and Efficient Fusion Generative Adversarial Network (DE-GAN) is proposed to address the issues of limited diversity and high storage consumption in text-to-image synthesis. DE-GAN utilizes Dual Injection Blocks to balance the diversity and fidelity of generated images by injecting noise and text embeddings multiple times during the generation process. It also introduces an efficient condition channel attention module to capture correlations between text and image modalities with minimal storage overhead, enabling the model to adapt to resource-constrained applications. Comprehensive experiments demonstrate that DE-GAN efficiently generates more diverse and photo-realistic images compared to previous methods.

MULTIMEDIA TOOLS AND APPLICATIONS (2023)

添加到收藏夹

Review Computer Science, Artificial Intelligence

Adversarial text-to-image synthesis: A review

Stanislav Frolov, Tobias Hinz, Federico Raue, Joern Hees, Andreas Dengel

Summary: Text-to-image synthesis has made significant progress in recent years but still faces challenges that require further research and improvement. Areas of focus include enhancing evaluation metrics and datasets, as well as improving model training and design.

NEURAL NETWORKS (2021)

添加到收藏夹

Article Computer Science, Theory & Methods

Gotta CAPTCHA 'Em All: A Survey of 20 Years of the Human-or-computer Dilemma

Meriem Guerar, Luca Verderame, Mauro Migliardi, Francesco Palmieri, Alessio Merlo

Summary: A recent study has shown that malicious bots generated a significant portion of website traffic in 2019, posing a serious threat to businesses. In order to combat these bots, introducing CAPTCHA tests has become a common defense mechanism. Therefore, understanding the effectiveness of different CAPTCHA schemes is crucial. This paper provides an overview of the current research in the field of CAPTCHA schemes and introduces a new classification. It also summarizes various attack methods and discusses the limitations of different CAPTCHA schemes.

ACM COMPUTING SURVEYS (2022)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Semantic Similarity Distance: Towards better text-image consistency metric in text-to-image generation

Zhaorui Tan, Xi Yang, Zihan Ye, Qiufeng Wang, Yuyao Yan, Anh Nguyen, Kaizhu Huang

Summary: This paper tackles the challenge of generating high-quality images from text in visual-language understanding and introduces a novel text-image consistency metric, Semantic Similarity Distance (SSD). It also proposes Parallel Deep Fusion Generative Adversarial Networks (PDF-GAN) to mitigate inconsistent semantics and bridge the text-image semantic gap. Experimental results demonstrate that, guided by SSD, PDF-GAN significantly enhances the consistency between texts and images while preserving acceptable image quality.

PATTERN RECOGNITION (2023)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Generative adversarial network based on semantic consistency for text-to-image generation

Yue Ma, Li Liu, Huaxiang Zhang, Chunjing Wang, Zekang Wang

Summary: This paper proposes a novel generative adversarial network based on semantic consistency to generate semantically consistent and realistic images according to text descriptions. The method explores the semantic consistency between text and image for efficient cross-modal generation, and utilizes a generation network with hybrid attention to improve the authenticity of the generated images. Additionally, a semantic comparison module is introduced to compare the texts and generated images in the same semantic space through consistency refinement and information classification.

APPLIED INTELLIGENCE (2023)

添加到收藏夹

Article Computer Science, Information Systems

Stabilized Performance Maximization for GAN-based Real-Time Authentication Image Generation over Internet

Joo Yong Shim, Soyi Jung, Joongheon Kim, Jong-Kook Kim

Summary: This paper proposes an adaptive GAN selection scheme to generate new CAPTCHA images for enhanced security. The scheme focuses on maximizing image quality while ensuring system stability. Through performance evaluation, a trade-off between generation time and image quality is shown.

MULTIMEDIA TOOLS AND APPLICATIONS (2023)

添加到收藏夹

Article Computer Science, Information Systems

Learning efficient text-to-image synthesis via interstage cross-sample similarity distillation

Fengling Mao, Bingpeng Ma, Hong Chang, Shiguang Shan, Xilin Chen

Summary: This study proposes an interstage cross-sample similarity distillation model based on a generative adversarial network (GAN) for learning efficient text-to-image synthesis. Experimental results show that the model generates visually pleasing images and achieves quantitatively comparable performance with state-of-the-art methods.

SCIENCE CHINA-INFORMATION SCIENCES (2021)

添加到收藏夹

Article Computer Science, Information Systems

Cross-Modal Semantic Matching Generative Adversarial Networks for Text-to-Image Synthesis

Hongchen Tan, Xiuping Liu, Baocai Yin, Xin Li

Summary: This paper proposes Cross-modal Semantic Matching Generative Adversarial Networks (CSM-GAN) to improve the semantic consistency between text description and synthesized image in fine-grained text-to-image generation. Two new modules, Text Encoder Module (TEM) and Textual-Visual Semantic Matching Module (TVSMM), are introduced to increase semantic consistency in the global semantic embedding space. Thorough experiments show the superiority of CSM-GAN over other representative state-of-the-art methods.

IEEE TRANSACTIONS ON MULTIMEDIA (2022)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Text-to-image synthesis with self-supervised learning

Yong Xuan Tan, Chin Poo Lee, Mai Neo, Kian Ming Lim

Summary: Text-to-image synthesis is a technology that converts text descriptions into corresponding images and is widely used in various applications. However, this technology faces challenges such as visual realism, overconfidence, and training instability. To address these challenges, this paper proposes a self-supervised text-to-image synthesis method with enhancements including self-supervised learning, feature matching, L1 distance loss, and one-sided label smoothing. The proposed method generates images that are more diverse, visually realistic, and semantically consistent with the given text description.

PATTERN RECOGNITION LETTERS (2022)

添加到收藏夹

Article Computer Science, Artificial Intelligence

KT-GAN: Knowledge-Transfer Generative Adversarial Network for Text-to-Image Synthesis

Hongchen Tan, Xiuping Liu, Meng Liu, Baocai Yin, Xin Li

Summary: This paper presents a new framework called Knowledge-Transfer Generative Adversarial Network (KT-GAN) for fine-grained text-to-image generation. By introducing Alternate Attention-Transfer Mechanism (AATM) and Semantic Distillation Mechanism (SDM), the framework successfully bridges the cross-domain gap between text and image, achieving better text features and higher-quality images. Extensive experimental validation on two public datasets demonstrates that KT-GAN outperforms the baseline method significantly and achieves competitive results over various evaluation metrics.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2021)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Image manipulation with natural language using Two-sided Attentive Conditional Generative Adversarial Network

Dawei Zhu, Aditya Mogadala, Dietrich Klakow

Summary: The paper introduces a method of manipulating images using natural language descriptions, and designs TEA-cGAN to generate semantically manipulated images, including two different architectures for attending locations that need to be modified during generation and generating higher resolution images.

NEURAL NETWORKS (2021)

添加到收藏夹

Article Computer Science, Software Engineering

SWF-GAN: A Text-to-Image model based on sentence-word fusion perception

Chun Liu, Jingsong Hu, Hong Lin

Summary: This paper proposes SWF-GAN to synthesize images from descriptive text, which solves the problems of limited constraint of coarse-grained information and insufficient representational capacity of ordinary mask predictors. SWF-GAN designs a sentence-word fusion perceptual module to accurately generate the structure of the target object. The experiments show that SWF-GAN can generate clearer and more lively images.

COMPUTERS & GRAPHICS-UK (2023)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Semantic Object Accuracy for Generative Text-to-Image Synthesis

Tobias Hinz, Stefan Heinrich, Stefan Wermter

Summary: By introducing a new model and evaluation metric SOA, it provides a better evaluation of text-to-image models, ensuring that generated images match their captions and the user study confirmed this.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2022)

添加到收藏夹

Article Chemistry, Analytical

A Visual Analytics Approach for Station-Based Air Quality Data

Yi Du, Cuixia Ma, Chao Wu, Xiaowei Xu, Yike Guo, Yuanchun Zhou, Jianhui Li

SENSORS (2017)

添加到收藏夹

Article Engineering, Biomedical

DeepSleepNet: A Model for Automatic Sleep Stage Scoring Based on Raw Single-Channel EEG

Akara Supratak, Hao Dong, Chao Wu, Yike Guo

IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING (2017)

添加到收藏夹

Article Computer Science, Theory & Methods

Dropping Activation Outputs With Localized First-Layer Deep Network for Enhancing User Privacy and Data Security

Hao Dong, Chao Wu, Zhen Wei, Yike Guo

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY (2018)

添加到收藏夹

Article Engineering, Biomedical

Mixed Neural Network Approach for Temporal Sleep Stage Classification

Hao Dong, Akara Supratak, Wei Pan, Chao Wu, Paul M. Matthews, Yike Guo

IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING (2018)

添加到收藏夹

Article Neurosciences

Using Support Vector Machine on EEG for Advertisement Impact Assessment

Zhen Wei, Chao Wu, Xiaoyi Wang, Akara Supratak, Pan Wang, Yike Guo

FRONTIERS IN NEUROSCIENCE (2018)

添加到收藏夹

Article Green & Sustainable Science & Technology

Temporal sustainability efficiency analysis of urban areas via Data Envelopment Analysis and the hypervolume indicator: Application to London boroughs

C. Pozo, P. Limleamthong, Y. Guo, T. Green, N. Shah, S. Acha, A. Sawas, C. Wu, M. Siegert, G. Guillen-Gosalbez

JOURNAL OF CLEANER PRODUCTION (2019)

添加到收藏夹

Article Mathematical & Computational Biology

Design of Deep Learning Model for Task-Evoked fMRI Data Classification

Xiaojie Huang, Jun Xiao, Chao Wu

Summary: This study proposes a model that uses deep neural networks to classify task states from fMRI data by simultaneously utilizing spatial and temporal information. By adding an attention mechanism to the recurrent network module, the model effectively highlights brain activation states at reaction moments, showing a high classification accuracy of 94.31% and the ability to distinguish brain states under different task stimuli.

COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE (2021)

添加到收藏夹

Article Political Science

An early assessment of the County Medical Community reform in China: a case study of Zhejiang province

Chao Wu, Yixin Tu, Zexi Li, Jianxing Yu

Summary: The County Medical Community Reform, initiated in Anhui province and now widespread across China, has achieved noticeable results in governance, financing, creating resources, delivering services, and healthcare. During the COVID-19 outbreak, it has positively impacted the disease diagnosis and treatment in Zhejiang province’s primary health-care system. Lessons learned and recommendations for future development were summarized in an effort to optimize China’s primary health-care system.

JOURNAL OF CHINESE GOVERNANCE (2021)

添加到收藏夹

Article Engineering, Electrical & Electronic

A framework for self-supervised federated domain adaptation

Bin Wang, Gang Li, Chao Wu, WeiShan Zhang, Jiehan Zhou, Ye Wei

Summary: This paper introduces a method called self-supervised federated domain adaptation (SFDA) to address the problem of distributed multi-source domain adaptation. SFDA effectively aggregates models from multiple source domains in each round of communication by proposing a multi-domain model generalization balance and a weighted strategy based on centroid similarity. It also tackles the domain shift in the target domain through self-supervised training and improves the accuracy of the model compared to classical federated adversarial domain adaptation algorithm.

EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING (2022)

添加到收藏夹

Article Computer Science, Hardware & Architecture

Shuhai: A Tool for Benchmarking High Bandwidth Memory on FPGAs

Hongjing Huang, Zeke Wang, Jie Zhang, Zhenhao He, Chao Wu, Jun Xiao, Gustavo Alonso

Summary: This article discusses the characterization of High Bandwidth Memory (HBM) on an FPGA and the development of a benchmarking tool called Shuhai to evaluate its performance and usage. The study found that HBM can provide high memory bandwidth, but its usage significantly affects the achievable throughput. By comparing it with other types of memory, a better understanding of HBM's performance characteristics can be obtained.

IEEE TRANSACTIONS ON COMPUTERS (2022)

添加到收藏夹

Proceedings Paper Computer Science, Artificial Intelligence

Evaluate the Contribution of Multiple Participants in Federated Learning

Zhaoyang You, Xinya Wu, Kexuan Chen, Xinyi Liu, Chao Wu

Summary: In this study, a function was designed to recalculate Shapley Value, overcoming the issues caused by data replication and dataset partition, which led to an improvement in performance by about 50% when compared with the original index.

DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2021, PT II (2021)

添加到收藏夹

Proceedings Paper Computer Science, Information Systems

Generating Computational Taxonomy for Business Models of the Digital Economy

Chao Wu, Yi Cai, Mei Zhao, Songping Huang, Yike Guo

DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2016 (2016)

添加到收藏夹

Proceedings Paper Engineering, Biomedical

Integration of Sparse Bayesian Learning and Random Subspace for fMRI Multivariate Pattern Analysis

Shulin Yan, Xian Yang, Chao Wu, Yike Guo

2014 36TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC) (2014)

添加到收藏夹

Proceedings Paper Computer Science, Artificial Intelligence

An Approximation Approach to Measurement Design in the Reconstruction of Functional MRI Sequences

Shulin Yan, Lei Nie, Chao Wu, Yike Guo

BRAIN AND HEALTH INFORMATICS (2013)

添加到收藏夹

暂无数据

© Peeref 2019-2024. All rights reserved.