4.5 Article

Enhancing Arabic-text feature extraction utilizing label-semantic augmentation in few/zero-shot learning

期刊

EXPERT SYSTEMS
卷 40, 期 8, 页码 -

出版社

WILEY
DOI: 10.1111/exsy.13329

关键词

Arabic text classification; contextual embeddings; feature extraction; few; zero-shot learning; label semantics

向作者/读者索取更多资源

An increasing number of studies are using pre-trained language models to tackle few/zero-shot text classification problems. However, most of these studies fail to consider the semantic information embedded in the natural language class labels. This work demonstrates how label information can be leveraged to enhance feature representation in input texts, particularly in scenarios with scarce data resources and short texts lacking semantic information like tweets. The study also shows the effectiveness of zero-shot implementation in predicting new classes across different domains, achieving high accuracy in Arabic sarcasm detection.
A growing amount of research use pre-trained language models to address few/zero-shot text classification problems. Most of these studies neglect the semantic information hidden implicitly beneath the natural language names of class labels and develop a meta learner from the input texts solely. In this work, we demonstrate how label information can be utilized to extract enhanced feature representation of the input text from a Transformer-based pre-trained language model such as AraBERT. In addition, how this approach can improve performance when the data resources are scarce like in the Arabic language and the input text is short with little semantic information as is the case using tweets. The work also applies zero-shot text classification to predict new classes with no training examples across different domains including sarcasm detection and sentiment analysis using the information in the last layer of a trained classifier in a transfer learning setting. Experiments show that our approach has a better performance for the few-shot sentiment classification compared to baseline models and models trained without augmenting label information. Moreover, the zero-shot implementation achieved an accuracy up to 0.874 in Arabic sarcasm detection from a model trained on a sentiment analysis task.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Artificial Intelligence

Multimodal sentiment analysis: A systematic review of history, datasets, multimodal fusion methods, applications, challenges and future directions

Ankita Gandhi, Kinjal Adhvaryu, Soujanya Poria, Erik Cambria, Amir Hussain

Summary: This survey paper explores the importance and recent advancements in sentiment analysis and multimodal sentiment analysis in the fields of artificial intelligence and natural language processing. It compares various fusion architectures in terms of MSA categories and presents interdisciplinary applications and future research directions.

INFORMATION FUSION (2023)

Article Computer Science, Artificial Intelligence

A Hotel Ranking Model Through Online Reviews with Aspect-Based Sentiment Analysis

Tian-Hui You, Ling-Ling Tao, Erik Cambria

Summary: This study proposes a hotel ranking model based on online textual reviews, considering the differences in the number of reviews on different aspects. The model utilizes sentiment analysis to assist tourists in making desirable decisions on hotel selection.

INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING (2023)

Article Computer Science, Artificial Intelligence

A Mixed Approach for Aggressive Political Discourse Analysis on Twitter

Javier Torregrosa, Sergio D'Antonio-Maceiras, Guillermo Villar-Rodriguez, Amir Hussain, Erik Cambria, David Camacho

Summary: Political tensions have increased in Europe since the beginning of the new century, leading to social movements and political changes in various countries. This study examines the political discourse and underlying tensions during Madrid's elections in May 2021, using a mixed methodology approach. The findings suggest that the electoral campaign is not as negative as perceived by the citizens, and that ideologically extreme parties tend to use more aggressive language.

COGNITIVE COMPUTATION (2023)

Article Computer Science, Artificial Intelligence

Multitask Balanced and Recalibrated Network for Medical Code Prediction

Wei Sun, Shaoxiong Ji, Erik Cambria, Pekka Marttinen

Summary: Human coders assign standardized medical codes to clinical documents, but it is prone to errors and requires significant effort. Automated medical coding methods using machine learning, such as deep neural networks, have been developed. However, challenges still exist due to code association complexity, noise in lengthy documents, and imbalanced class problem. In this study, we propose a novel neural network model called the Multitask Balanced and Recalibrated Neural Network to address these issues. Experiments on a real-world clinical dataset called MIMIC-III demonstrate that our model outperforms competitive baselines.

ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY (2023)

Article Computer Science, Artificial Intelligence

Time expression recognition and normalization: a survey

Xiaoshi Zhong, Erik Cambria

Summary: Time information is crucial in the fields of data mining, information retrieval, and natural language processing. Time expression recognition and normalization (TERN) serves as a fundamental task for other linguistic tasks. This survey reviews previous research, provides an overview of time expression analysis development, and explores the role of time expressions in different domains. Three methods for TERN development are discussed: rule-based, traditional machine-learning, and deep-learning. Additionally, useful datasets, software, and potential future research directions are outlined.

ARTIFICIAL INTELLIGENCE REVIEW (2023)

Article Computer Science, Artificial Intelligence

A survey on syntactic processing techniques

Xulang Zhang, Rui Mao, Erik Cambria

Summary: Computational syntactic processing is a fundamental technique in natural language processing that transforms natural language into structured texts with syntactic features. This work surveys low-level syntactic processing techniques such as normalization, sentence boundary disambiguation, part-of-speech tagging, text chunking, and lemmatization, categorizes widely used methods, investigates challenges, and proposes future research directions.

ARTIFICIAL INTELLIGENCE REVIEW (2023)

Article Computer Science, Artificial Intelligence

Survey on sentiment analysis: evolution of research methods and topics

Jingfeng Cui, Zhaoxia Wang, Seng-Beng Ho, Erik Cambria

Summary: Sentiment analysis, a research hotspot in natural language processing, has attracted significant attention and resulted in a growing number of research papers. Despite numerous literature reviews on sentiment analysis, there has been no dedicated survey examining the evolution of research methods and topics. This study fills this gap by conducting a comprehensive survey that combines keyword co-occurrence analysis and community detection algorithm. The survey compares and analyzes the connections between research methods and topics over the past two decades and uncovers hotspots and trends over time, providing valuable guidance for researchers. Furthermore, the paper offers practical insights, technical directions, limitations, and future research prospects in sentiment analysis.

ARTIFICIAL INTELLIGENCE REVIEW (2023)

Article Computer Science, Artificial Intelligence

Fusing topology contexts and logical rules in language models for knowledge graph completion

Qika Lin, Rui Mao, Jun Liu, Fangzhi Xu, Erik Cambria

Summary: Knowledge graph completion (KGC) is crucial for many downstream applications. Existing language model-based methods for KGC often overlook the importance of modeling the deeper semantic information, such as topology contexts and logical rules. In this paper, we propose a unified framework FTL-LM that effectively incorporates topology contexts and logical rules in language models, and experimental results demonstrate its superiority over the state-of-the-art methods.

INFORMATION FUSION (2023)

Article Computer Science, Information Systems

A survey on XAI and natural language explanations

Erik Cambria, Lorenzo Malandri, Fabio Mercorio, Mario Mezzanzanica, Navid Nobani

Summary: The field of explainable artificial intelligence (XAI) has gained increasing importance in recent years. However, existing research often overlooks the role of natural language in generating explanations. This survey reviews 70 XAI papers published between 2006 and 2021 and evaluates their readiness in terms of natural language explanations. The results show that only a few recent studies have considered using natural language for communication with end users or implemented methods for generating natural language explanations.

INFORMATION PROCESSING & MANAGEMENT (2023)

Article Computer Science, Information Systems

Incorporating Multiple Knowledge Sources for Targeted Aspect-based Financial Sentiment Analysis

Kelvin Du, Frank Xing, Erik Cambria

Summary: Combining symbolic and subsymbolic methods has emerged as a promising strategy in tackling increasingly complex AI research tasks. This study presents a targeted aspect-based financial sentiment analysis hybrid model that incorporates multiple lexical knowledge sources into the fine-tuning process of pre-trained transformer models. Experimental results demonstrate that knowledge-enabled models systematically improve aspect sentiment analysis performance and even outperform state-of-the-art results.

ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS (2023)

Article Computer Science, Cybernetics

Ensemble Hybrid Learning Methods for Automated Depression Detection

Luna Ansari, Shaoxiong Ji, Qian Chen, Erik Cambria

Summary: Changes in human lifestyle have led to an increase in depression cases. Automated detection methods are effective in identifying depressed individuals. Ensemble models outperform hybrid models for depression detection.

IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

A brief survey on recent advances in coreference resolution

Ruicheng Liu, Rui Mao, Anh Tuan Luu, Erik Cambria

Summary: The task of resolving repeated objects in natural languages, known as coreference resolution, is an important part of modern natural language processing. It is classified into entity coreference resolution and event coreference resolution based on the resolved objects. Predicting coreference connections and identifying mentions/triggers are the major challenges in coreference resolution due to the difficulty of implicit relationships in natural language understanding. In this survey, we review the current employed evaluation metrics, datasets, and methods, investigating 10 widely used metrics, 18 datasets, and 4 main technical trends. We believe that this work provides a comprehensive roadmap for understanding the past and the future of coreference resolution.

ARTIFICIAL INTELLIGENCE REVIEW (2023)

Article Computer Science, Artificial Intelligence

MiMuSA-mimicking human language understanding for fine-grained multi-class sentiment analysis

Zhaoxia Wang, Zhenda Hu, Seng-Beng Ho, Erik Cambria, Ah-Hwee Tan

Summary: This paper proposes a new explainable fine-grained multi-class sentiment analysis method called MiMuSA, which mimics human language understanding processes. It builds multiple knowledge bases to support sentiment understanding and can identify fine-grained multi-class sentiments. Experimental results show that MiMuSA outperforms other existing multi-class sentiment analysis methods in terms of accuracy and F1-Score.

NEURAL COMPUTING & APPLICATIONS (2023)

Article Computer Science, Artificial Intelligence

Learning-Based Stock Trending Prediction by Incorporating Technical Indicators and Social Media Sentiment

Zhaoxia Wang, Zhenda Hu, Fang Li, Seng-Beng Ho, Erik Cambria

Summary: This research proposes a 3-phase hybrid model that utilizes both technical indicators and social media text sentiments as influence factors for stock trending prediction. The result shows that the proposed method has an accuracy of 73.41% and F1-score of 84.19%. The research not only demonstrates the merits of the proposed method, but also indicates that integrating social opinions with technical indicators is a right direction for enhancing the performance of learning-based stock market trending analysis methods.

COGNITIVE COMPUTATION (2023)

Article Computer Science, Artificial Intelligence

Emotion-and-knowledge grounded response generation in an open-domain dialogue setting

Deeksha Varshney, Asif Ekbal, Erik Cambria

Summary: This paper focuses on the neural-based interactive dialogue system that aims to engage and retain humans in long-lasting conversations. It proposes a new neural generative model that combines step-wise co-attention, self-attention-based transformer network, and an emotion classifier to control emotion and knowledge transfer during response generation. The results from quantitative, qualitative, and human evaluation show that the proposed models can generate natural and coherent sentences, capturing essential facts with significant improvement over emotional content.

KNOWLEDGE-BASED SYSTEMS (2024)

暂无数据