4.6 Article

Semi-supervised Learning for Affective Common-Sense Reasoning

Journal

COGNITIVE COMPUTATION
Volume 9, Issue 1, Pages 18-42

Publisher

SPRINGER
DOI: 10.1007/s12559-016-9433-5

Keywords

Analogical reasoning; Sentiment analysis; Semi-supervised learning; Classification; Model selection; Extreme learning machines; Vapnik-Chervonenkis theory; Rademacher complexity; Algorithmic stability

Ask authors/readers for more resources

Big social data analysis is the area of research focusing on collecting, examining, and processing large multi-modal and multi-source datasets in order to discover patterns/correlations and extract information from the Social Web. This is usually accomplished through the use of supervised and unsupervised machine learning algorithms that learn from the available data. However, these are usually highly computationally expensive, either in the training or in the prediction phase, as they are often not able to handle current data volumes. Parallel approaches have been proposed in order to boost processing speeds, but this clearly requires technologies that support distributed computations. Extreme learning machines (ELMs) are an emerging learning paradigm, presenting an efficient unified solution to generalized feed-forward neural networks. ELM offers significant advantages such as fast learning speed, ease of implementation, and minimal human intervention. However, ELM cannot be easily parallelized, due to the presence of a pseudo-inverse calculation. Therefore, this paper aims to find a reliable method to realize a parallel implementation of ELM that can be applied to large datasets typical of Big Data problems with the employment of the most recent technology for parallel in-memory computation, i.e., Spark, designed to efficiently deal with iterative procedures that recursively perform operations over the same data. Moreover, this paper shows how to take advantage of the most recent advances in statistical learning theory (SLT) in order to address the issue of selecting ELM hyperparameters that give the best generalization performance. This involves assessing the performance of such algorithms (i.e., resampling methods and in-sample methods) by exploiting the most recent results in SLT and adapting them to the Big Data framework. The proposed approach has been tested on two affective analogical reasoning datasets. Affective analogical reasoning can be defined as the intrinsically human capacity to interpret the cognitive and affective information associated with natural language. In particular, we employed two benchmarks, each one composed by 21,743 common-sense concepts; each concept is represented according to two models of a semantic network in which common-sense concepts are linked to a hierarchy of affective domain labels. The labeled data have been split into two sets: The first 20,000 samples have been used for building the model with the ELM with the different SLT strategies, while the rest of the labeled samples, numbering 1743, have been kept apart as reference set in order to test the performance of the learned model. The splitting process has been repeated 30 times in order to obtain statistically relevant results. We ran the experiments through the use of the Google Cloud Platform, in particular, the Google Compute Engine. We employed the Google Compute Engine Platform with NM = 4 machines with two cores and 1.8 GB of RAM (machine type n1-highcpu-2) and an HDD of 30 GB equipped with Spark. Results on the affective dataset both show the effectiveness of the proposed parallel approach and underline the most suitable SLT strategies for the specific Big Data problem. In this paper we showed how to build an ELM model with a novel scalable approach and to carefully assess the performance, with the use of the most recent results from SLT, for a sentiment analysis problem. Thanks to recent technologies and methods, the computational requirements of these methods have been improved to allow for the scaling to large datasets, which are typical of Big Data applications.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Review Computer Science, Theory & Methods

Human Movement Datasets: An Interdisciplinary Scoping Review

Temitayo Olugbade, Marta Bienkiewicz, Giulia Barbareschi, Vincenzo D'amato, Luca Oneto, Antonio Camurri, Catherine Holloway, Marten Bjorkman, Peter Keller, Martin Clayton, Amanda C. De C. Williams, Nicolas Gold, Cristina Becchio, Benoit Bardy, Nadia Bianchi-Berthouze

Summary: This paper presents a catalog of 704 open datasets that can be valuable to researchers searching for secondary data. The datasets are analyzed and reviewed under themes such as human diversity, ecological validity, and data recorded. The resulting 12-dimensional framework can guide researchers in planning the creation of open movement datasets.

ACM COMPUTING SURVEYS (2023)

Article Computer Science, Artificial Intelligence

Enhancing Arabic-text feature extraction utilizing label-semantic augmentation in few/zero-shot learning

Seham Basabain, Erik Cambria, Khalid Alomar, Amir Hussain

Summary: An increasing number of studies are using pre-trained language models to tackle few/zero-shot text classification problems. However, most of these studies fail to consider the semantic information embedded in the natural language class labels. This work demonstrates how label information can be leveraged to enhance feature representation in input texts, particularly in scenarios with scarce data resources and short texts lacking semantic information like tweets. The study also shows the effectiveness of zero-shot implementation in predicting new classes across different domains, achieving high accuracy in Arabic sarcasm detection.

EXPERT SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Virtual prompt pre-training for prototype-based few-shot relation extraction

Kai He, Yucheng Huang, Rui Mao, Tieliang Gong, Chen Li, Erik Cambria

Summary: This paper proposes a virtual prompt pre-training method that incorporates the virtual prompt into PLM parameters to achieve entity-relation-aware pre-training. The proposed method provides robust initialization for prompt encoding and avoids the labor-intensive and subjective issues in label word mapping and prompt template engineering.

EXPERT SYSTEMS WITH APPLICATIONS (2023)

Article Computer Science, Artificial Intelligence

Will Affective Computing Emerge From Foundation Models and General Artificial Intelligence? A First Evaluation of ChatGPT

Mostafa Amin, Erik W. Cambria, Bjorn Schuller

Summary: ChatGPT demonstrates the potential of general artificial intelligence capabilities and performs well across various natural language processing tasks. This study evaluates ChatGPT's text classification abilities for affective computing problems including personality prediction, sentiment analysis, and suicide tendency detection. Results show that task-specific RoBERTa models generally outperform other baselines, while ChatGPT performs decently and is comparable to Word2Vec and BoW baselines. ChatGPT exhibits robustness against noisy data, outperforming Word2Vec in such scenarios. The study concludes that ChatGPT is a good generalist model but not as specialized as task-specific models for optimal performance.

IEEE INTELLIGENT SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Multitask learning for multilingual intent detection and slot filling in dialogue systems

Mauajama Firdaus, Asif Ekbal, Erik Cambria

Summary: This research proposes a multilingual multitask approach that improves intent accuracy and slot filling for three different languages. The experimental results show an improvement in both tasks for all languages.

INFORMATION FUSION (2023)

Article Engineering, Civil

Surrogate models to unlock the optimal design of stiffened panels accounting for ultimate strength reduction due to welding residual stress

Andrea Coraddu, Luca Oneto, Shen Li, Miltiadis Kalikatzarakis, Olena Karpenko

Summary: In this paper, a three-step approach is developed for the optimal design of stiffened panels accounting for the ultimate limit state due to welding residual stress. State-of-the-art analytical approaches coupled with data-driven nonlinear finite element methods surrogates are used to find new optimal designs. The results obtained from optimizing a series of parameters of a commonly used stiffened panel geometry will support the authors' novel approach.

ENGINEERING STRUCTURES (2023)

Article Computer Science, Artificial Intelligence

Do we really need a new theory to understand over-parameterization?

Luca Oneto, Sandro Ridella, Davide Anguita

Summary: This century has witnessed a significant increase in investments in Artificial Intelligence (AI) and particularly in (Deep) Machine Learning (ML) in both public and private sectors. This has led to breakthroughs in solving complex real-world problems, but the increasing complexity of these breakthroughs has made it difficult to understand their fundamental mechanisms. Researchers are now questioning the need for a new theoretical framework to catch up with this complexity. One particular mechanism that is not well understood is over-parametrization, which refers to the ability of certain models to improve their generalization performance even when the number of parameters exceeds the interpolating threshold.

NEUROCOMPUTING (2023)

Review Engineering, Marine

A review on ship motions and quiescent periods prediction models

Giulia Cademartori, Luca Oneto, Federica Valdenazzi, Andrea Coraddu, Andrea Gambino, Davide Anguita

Summary: The prediction of ship motions and quiescent periods is crucial for the maritime industry to improve safety and efficiency of various marine operations. This review categorizes and examines the existing ship motion and quiescent period prediction models into physical, data-driven, and hybrid models. The advantages, disadvantages, open problems, and future perspectives of these models are discussed.

OCEAN ENGINEERING (2023)

Article Computer Science, Artificial Intelligence

AutoML-Emo: Automatic Knowledge Selection Using Congruent Effect for Emotion Identification in Conversations

Dazhi Jiang, Runguo Wei, Jintao Wen, Geng Tu, Erik Cambria

Summary: Emotion recognition in conversations has wide applications in various fields. We propose an AutoML strategy based on emotion congruent effect to select suitable knowledge and models, and effectively capture context information and enhance external knowledge in conversations.

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING (2023)

Editorial Material Computer Science, Artificial Intelligence

Guest Editorial Neurosymbolic AI for Sentiment Analysis

Frank Xing, Bjoern Schuller, Iti Chaturvedi, Erik Cambria, Amir Hussain

Summary: Neural network-based methods, such as word2vec and GPT-based models, have achieved significant progress in AI research, especially in handling large datasets. However, these methods lack in-depth understanding of the internal features and representations of the data, leading to various problems and concerns.

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING (2023)

Article Computer Science, Artificial Intelligence

Meta-Based Self-Training and Re-Weighting for Aspect-Based Sentiment Analysis

Kai He, Rui Mao, Tieliang Gong, Chen Li, Erik Cambria

Summary: The study proposes a meta-based self-training method for aspect-based sentiment analysis (ABSA). By generating pseudo-labels and controlling convergence rates, the method improves model performance and accuracy in fine-grained sentiment analysis.

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING (2023)

Article Computer Science, Artificial Intelligence

KnowleNet: Knowledge fusion network for multimodal sarcasm detection

Tan Yue, Rui Mao, Heng Wang, Zonghai Hu, Erik Cambria

Summary: Sarcasm detection is a challenging task in natural language processing, especially in the context of social media where sarcasm is prevalent. This paper proposes a novel model called KnowleNet that incorporates prior knowledge and cross-modal semantic contrast for multimodal sarcasm detection. By leveraging the ConceptNet knowledge base and utilizing contrastive learning, the model achieves state-of-the-art performance on benchmark datasets.

INFORMATION FUSION (2023)

Article Computer Science, Artificial Intelligence

Emotion-and-knowledge grounded response generation in an open-domain dialogue setting

Deeksha Varshney, Asif Ekbal, Erik Cambria

Summary: This paper focuses on the neural-based interactive dialogue system that aims to engage and retain humans in long-lasting conversations. It proposes a new neural generative model that combines step-wise co-attention, self-attention-based transformer network, and an emotion classifier to control emotion and knowledge transfer during response generation. The results from quantitative, qualitative, and human evaluation show that the proposed models can generate natural and coherent sentences, capturing essential facts with significant improvement over emotional content.

KNOWLEDGE-BASED SYSTEMS (2024)

Article Computer Science, Artificial Intelligence

MEGACare: Knowledge-guided multi-view hypergraph predictive framework for healthcare

Jialun Wu, Kai He, Rui Mao, Chen Li, Erik Cambria

Summary: Predicting a patient's future health condition is a trending topic in the intelligent medical field. This paper proposes a knowledge-guided predictive framework called MEGACare, which leverages multi-faceted medical knowledge and multi-view learning to enhance clinical prediction accuracy.

INFORMATION FUSION (2023)

Article Computer Science, Artificial Intelligence

Template-Free Prompting for Few-Shot Named Entity Recognition via Semantic-Enhanced Contrastive Learning

Kai He, Rui Mao, Yucheng Huang, Tieliang Gong, Chen Li, Erik Cambria

Summary: In this paper, a prompt-based contrastive learning method is proposed for few-shot NER tasks. The method leverages external knowledge to initialize semantic anchors and optimizes prompts and sentence embeddings with a proposed semantic-enhanced contrastive loss. The method outperforms traditional contrastive learning methods in few-shot scenarios and effectively addresses the issues in conventional methods.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

No Data Available