4.6 Article

A novel context-aware multimodal framework for persian sentiment analysis

Journal

NEUROCOMPUTING
Volume 457, Issue -, Pages 377-388

Publisher

ELSEVIER
DOI: 10.1016/j.neucom.2021.02.020

Keywords

Multimodal sentiment analysis; Persian sentiment analysis

Ask authors/readers for more resources

Recent studies show that utilizing multimodal data can effectively gauge public perception, and a Persian multimodal dataset and sentiment analysis framework are provided as research resources. Experimental results demonstrate that integrating multimodal features can enhance sentiment analysis performance.
Most recent works on sentiment analysis have exploited the text modality. However, millions of hours of video recordings posted on social media platforms everyday hold vital unstructured information that can be exploited to more effectively gauge public perception. Multimodal sentiment analysis offers an innovative solution to computationally understand and harvest sentiments from videos by contextually exploiting audio, visual and textual cues. In this paper, we, firstly, present a first of its kind Persian multimodal dataset comprising more than 800 utterances, as a benchmark resource for researchers to evaluate multimodal sentiment analysis approaches in Persian language. Secondly, we present a novel context-aware multimodal sentiment analysis framework, that simultaneously exploits acoustic, visual and textual cues to more accurately determine the expressed sentiment. We employ both decision level (late) and feature-level (early) fusion methods to integrate affective cross-modal information. Experimental results demonstrate that the contextual integration of multimodal features such as textual, acoustic and visual features deliver better performance (91.39%) compared to unimodal features (89.24%). (c) 2021 Published by Elsevier B.V.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Physics, Multidisciplinary

A Novel Hierarchical Extreme Machine-Learning-Based Approach for Linear Attenuation Coefficient Forecasting

Giuseppe Varone, Cosimo Ieracitano, Aybike Ozyuksel Ciftcioglu, Tassadaq Hussain, Mandar Gogate, Kia Dashtipour, Bassam Naji Al-Tamimi, Hani Almoamari, Iskender Akkurt, Amir Hussain

Summary: The development of reinforced polymer composite materials has significantly influenced the problem of shielding high-energy photons in industrial and healthcare facilities. Machine learning approaches can be used to assess the gamma-ray shielding behavior of composites and provide an alternative to theoretical calculations. In this study, a dataset was developed and machine learning models were used to replicate the gamma-ray shielding characteristics of concrete.

ENTROPY (2023)

Article Engineering, Biomedical

Contactless WiFi Sensing and Monitoring for Future Healthcare-Emerging Trends, Challenges, and Opportunities

Yao Ge, Ahmad Taha, Syed Aziz Shah, Kia Dashtipour, Shuyuan Zhu, Jonathan Cooper, Qammer H. Abbasi, Muhammad Ali Imran

Summary: WiFi sensing has gained significant attention as a potential mechanism for remote monitoring of the aging population without deploying devices on users' bodies. It has the potential to detect critical events such as falls, sleep disturbances, and respiratory disorders. Unlike other sensing methods, WiFi technology is easy to implement and unobtrusive. This paper reviews the current state-of-the-art research on WiFi-based sensing and discusses its healthcare applications and open research challenges.

IEEE REVIEWS IN BIOMEDICAL ENGINEERING (2023)

Article Computer Science, Artificial Intelligence

Enhancing Arabic-text feature extraction utilizing label-semantic augmentation in few/zero-shot learning

Seham Basabain, Erik Cambria, Khalid Alomar, Amir Hussain

Summary: An increasing number of studies are using pre-trained language models to tackle few/zero-shot text classification problems. However, most of these studies fail to consider the semantic information embedded in the natural language class labels. This work demonstrates how label information can be leveraged to enhance feature representation in input texts, particularly in scenarios with scarce data resources and short texts lacking semantic information like tweets. The study also shows the effectiveness of zero-shot implementation in predicting new classes across different domains, achieving high accuracy in Arabic sarcasm detection.

EXPERT SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Virtual prompt pre-training for prototype-based few-shot relation extraction

Kai He, Yucheng Huang, Rui Mao, Tieliang Gong, Chen Li, Erik Cambria

Summary: This paper proposes a virtual prompt pre-training method that incorporates the virtual prompt into PLM parameters to achieve entity-relation-aware pre-training. The proposed method provides robust initialization for prompt encoding and avoids the labor-intensive and subjective issues in label word mapping and prompt template engineering.

EXPERT SYSTEMS WITH APPLICATIONS (2023)

Article Computer Science, Artificial Intelligence

Will Affective Computing Emerge From Foundation Models and General Artificial Intelligence? A First Evaluation of ChatGPT

Mostafa Amin, Erik W. Cambria, Bjorn Schuller

Summary: ChatGPT demonstrates the potential of general artificial intelligence capabilities and performs well across various natural language processing tasks. This study evaluates ChatGPT's text classification abilities for affective computing problems including personality prediction, sentiment analysis, and suicide tendency detection. Results show that task-specific RoBERTa models generally outperform other baselines, while ChatGPT performs decently and is comparable to Word2Vec and BoW baselines. ChatGPT exhibits robustness against noisy data, outperforming Word2Vec in such scenarios. The study concludes that ChatGPT is a good generalist model but not as specialized as task-specific models for optimal performance.

IEEE INTELLIGENT SYSTEMS (2023)

Article Chemistry, Analytical

A Novel Unsupervised Video Anomaly Detection Framework Based on Optical Flow Reconstruction and Erased Frame Prediction

Heqing Huang, Bing Zhao, Fei Gao, Penghui Chen, Jun Wang, Amir Hussain

Summary: This paper proposes a novel unsupervised learning framework for video anomaly detection in smart city surveillance applications, using a training model based on the Cloze Test strategy. By encoding motion and appearance information at an object level, the proposed method improves the accuracy of anomaly perception. Comparative experiments on benchmark datasets demonstrate high AUROC scores through the proposed method.

SENSORS (2023)

Article Computer Science, Artificial Intelligence

Can ChatGPT's Responses Boost Traditional Natural Language Processing?

Mostafa M. Amin, Erik Cambria, Bjoern W. Schuller

Summary: The employment of foundation models is expanding and ChatGPT has the potential to enhance existing NLP techniques with its novel knowledge.

IEEE INTELLIGENT SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

ECPEC: Emotion-Cause Pair Extraction in Conversations

Wei Li, Yang Li, Vlad Pandelea, Mengshi Ge, Luyao Zhu, Erik Cambria

Summary: The paper introduces a new task called emotion-cause pair extraction in conversations (ECPEC), which aims to extract pairs of emotional utterances and corresponding cause utterances in conversations. The utterance-level ECPEC task is more challenging as the distance between emotion and cause utterances is greater. The experimental results on the proposed ConvECPE dataset demonstrate the feasibility of the ECPEC task and the effectiveness of the framework.

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING (2023)

Article Computer Science, Artificial Intelligence

AutoML-Emo: Automatic Knowledge Selection Using Congruent Effect for Emotion Identification in Conversations

Dazhi Jiang, Runguo Wei, Jintao Wen, Geng Tu, Erik Cambria

Summary: Emotion recognition in conversations has wide applications in various fields. We propose an AutoML strategy based on emotion congruent effect to select suitable knowledge and models, and effectively capture context information and enhance external knowledge in conversations.

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING (2023)

Editorial Material Computer Science, Artificial Intelligence

Guest Editorial Neurosymbolic AI for Sentiment Analysis

Frank Xing, Bjoern Schuller, Iti Chaturvedi, Erik Cambria, Amir Hussain

Summary: Neural network-based methods, such as word2vec and GPT-based models, have achieved significant progress in AI research, especially in handling large datasets. However, these methods lack in-depth understanding of the internal features and representations of the data, leading to various problems and concerns.

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING (2023)

Article Computer Science, Artificial Intelligence

Meta-Based Self-Training and Re-Weighting for Aspect-Based Sentiment Analysis

Kai He, Rui Mao, Tieliang Gong, Chen Li, Erik Cambria

Summary: The study proposes a meta-based self-training method for aspect-based sentiment analysis (ABSA). By generating pseudo-labels and controlling convergence rates, the method improves model performance and accuracy in fine-grained sentiment analysis.

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING (2023)

Article Computer Science, Artificial Intelligence

KnowleNet: Knowledge fusion network for multimodal sarcasm detection

Tan Yue, Rui Mao, Heng Wang, Zonghai Hu, Erik Cambria

Summary: Sarcasm detection is a challenging task in natural language processing, especially in the context of social media where sarcasm is prevalent. This paper proposes a novel model called KnowleNet that incorporates prior knowledge and cross-modal semantic contrast for multimodal sarcasm detection. By leveraging the ConceptNet knowledge base and utilizing contrastive learning, the model achieves state-of-the-art performance on benchmark datasets.

INFORMATION FUSION (2023)

Proceedings Paper Computer Science, Information Systems

The P vs. NP Problem and Attempts to Settle It via Perfect Graphs State-of-the-Art Approach

Maher Heal, Kia Dashtipour, Mandar Gogate

Summary: The P vs. NP problem is a major problem in computer science that has been open for almost 50 years. Its solution would have a tremendous impact on various fields including mathematics, cryptography, algorithm research, artificial intelligence, game theory, multimedia processing, philosophy, and economics. This work attempts to solve the maximum independent set problem in a polynomial time by transforming any graph into a perfect graph.

ADVANCES IN INFORMATION AND COMMUNICATION, FICC, VOL 2 (2023)

Article Computer Science, Artificial Intelligence

Emotion-and-knowledge grounded response generation in an open-domain dialogue setting

Deeksha Varshney, Asif Ekbal, Erik Cambria

Summary: This paper focuses on the neural-based interactive dialogue system that aims to engage and retain humans in long-lasting conversations. It proposes a new neural generative model that combines step-wise co-attention, self-attention-based transformer network, and an emotion classifier to control emotion and knowledge transfer during response generation. The results from quantitative, qualitative, and human evaluation show that the proposed models can generate natural and coherent sentences, capturing essential facts with significant improvement over emotional content.

KNOWLEDGE-BASED SYSTEMS (2024)

Article Computer Science, Artificial Intelligence

MEGACare: Knowledge-guided multi-view hypergraph predictive framework for healthcare

Jialun Wu, Kai He, Rui Mao, Chen Li, Erik Cambria

Summary: Predicting a patient's future health condition is a trending topic in the intelligent medical field. This paper proposes a knowledge-guided predictive framework called MEGACare, which leverages multi-faceted medical knowledge and multi-view learning to enhance clinical prediction accuracy.

INFORMATION FUSION (2023)

Article Computer Science, Artificial Intelligence

3D-KCPNet: Efficient 3DCNNs based on tensor mapping theory

Rui Lv, Dingheng Wang, Jiangbin Zheng, Zhao-Xu Yang

Summary: In this paper, the authors investigate tensor decomposition for neural network compression. They analyze the convergence and precision of tensor mapping theory, validate the rationality of tensor mapping and its superiority over traditional tensor approximation based on the Lottery Ticket Hypothesis. They propose an efficient method called 3D-KCPNet to compress 3D convolutional neural networks using the Kronecker canonical polyadic (KCP) tensor decomposition. Experimental results show that 3D-KCPNet achieves higher accuracy compared to the original baseline model and the corresponding tensor approximation model.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Personalized robotic control via constrained multi-objective reinforcement learning

Xiangkun He, Zhongxu Hu, Haohan Yang, Chen Lv

Summary: In this paper, a novel constrained multi-objective reinforcement learning algorithm is proposed for personalized end-to-end robotic control with continuous actions. The approach trains a single model using constraint design and a comprehensive index to achieve optimal policies based on user-specified preferences.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Overlapping community detection using expansion with contraction

Zhijian Zhuo, Bilian Chen, Shenbao Yu, Langcai Cao

Summary: In this paper, a novel method called Expansion with Contraction Method for Overlapping Community Detection (ECOCD) is proposed, which utilizes non-negative matrix factorization to obtain disjoint communities and applies expansion and contraction processes to adjust the degree of overlap. ECOCD is applicable to various networks with different properties and achieves high-quality overlapping community detection.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

High-compressed deepfake video detection with contrastive spatiotemporal distillation

Yizhe Zhu, Chunhui Zhang, Jialin Gao, Xin Sun, Zihan Rui, Xi Zhou

Summary: In this work, the authors propose a Contrastive Spatio-Temporal Distilling (CSTD) approach to improve the detection of high-compressed deepfake videos. The approach leverages spatial-frequency cues and temporal-contrastive alignment to fully exploit spatiotemporal inconsistency information.

NEUROCOMPUTING (2024)

Review Computer Science, Artificial Intelligence

A review of coverless steganography

Laijin Meng, Xinghao Jiang, Tanfeng Sun

Summary: This paper provides a review of coverless steganographic algorithms, including the development process, known contributions, and general issues in image and video algorithms. It also discusses the security of coverless steganography from theoretical analysis to actual investigation for the first time.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Confidence-based interactable neural-symbolic visual question answering

Yajie Bao, Tianwei Xing, Xun Chen

Summary: Visual question answering requires processing multi-modal information and effective reasoning. Neural-symbolic learning is a promising method, but current approaches lack uncertainty handling and can only provide a single answer. To address this, we propose a confidence based neural-symbolic approach that evaluates NN inferences and conducts reasoning based on confidence.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

A framework-based transformer and knowledge distillation for interior style classification

Anh H. Vo, Bao T. Nguyen

Summary: Interior style classification is an interesting problem with potential applications in both commercial and academic domains. This project proposes a method named ISC-DeIT, which combines data-efficient image transformer architectures and knowledge distillation, to address the interior style classification problem. Experimental results demonstrate a significant improvement in predictive accuracy compared to other state-of-the-art methods.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Improving robustness for vision transformer with a simple dynamic scanning augmentation

Shashank Kotyan, Danilo Vasconcellos Vargas

Summary: This article introduces a novel augmentation technique called Dynamic Scanning Augmentation to improve the accuracy and robustness of Vision Transformer (ViT). The technique leverages dynamic input sequences to adaptively focus on different patches, resulting in significant changes in ViT's attention mechanism. Experimental results demonstrate that Dynamic Scanning Augmentation outperforms ViT in terms of both robustness to adversarial attacks and accuracy against natural images.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Introducing shape priors in Siamese networks for image classification

Hiba Alqasir, Damien Muselet, Christophe Ducottet

Summary: The article proposes a solution to improve the learning process of a classification network by providing shape priors, reducing the need for annotated data. The solution is tested on cross-domain digit classification tasks and a video surveillance application.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Neural dynamics solver for time-dependent infinity-norm optimization based on ACP framework with robot application

Dexiu Ma, Mei Liu, Mingsheng Shang

Summary: This paper proposes a method using neural dynamics solvers to solve infinity-norm optimization problems. Two improved solvers are constructed and their effectiveness and superiority are demonstrated through theoretical analysis and simulation experiments.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

cpp-AIF: A multi-core C plus plus implementation of Active Inference for Partially Observable Markov Decision Processes

Francesco Gregoretti, Giovanni Pezzulo, Domenico Maisto

Summary: Active Inference is a computational framework that uses probabilistic inference and variational free energy minimization to describe perception, planning, and action. cpp-AIF is a header-only C++ library that provides a powerful tool for implementing Active Inference for Partially Observable Markov Decision Processes through multi-core computing. It is cross-platform and improves performance, memory management, and usability compared to existing software.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Predicting stock market trends with self-supervised learning

Zelin Ying, Dawei Cheng, Cen Chen, Xiang Li, Peng Zhu, Yifeng Luo, Yuqi Liang

Summary: This paper proposes a novel stock market trends prediction framework called SMART, which includes a self-supervised stock technical data sequence embedding model S3E. By training with multiple self-supervised auxiliary tasks, the model encodes stock technical data sequences into embeddings and uses the learned sequence embeddings for predicting stock market trends. Extensive experiments on China A-Shares market and NASDAQ market prove the high effectiveness of our model in stock market trends prediction, and its effectiveness is further validated in real-world applications in a leading financial service provider in China.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

DHGAT: Hyperbolic representation learning on dynamic graphs via attention networks

Hao Li, Hao Jiang, Dongsheng Ye, Qiang Wang, Liang Du, Yuanyuan Zeng, Liu Yuan, Yingxue Wang, C. Chen

Summary: DHGAT1, a dynamic hyperbolic graph attention network, utilizes hyperbolic metric properties to embed dynamic graphs. It employs a spatiotemporal self-attention mechanism and weighted node representations, resulting in excellent performance in link prediction tasks.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Progressive network based on detail scaling and texture extraction: A more general framework for image deraining

Jiehui Huang, Zhenchao Tang, Xuedong He, Jun Zhou, Defeng Zhou, Calvin Yu-Chian Chen

Summary: This study proposes a progressive learning multi-scale feature blending model for image deraining tasks. The model utilizes detail dilation and texture extraction to improve the restoration of rainy images. Experimental results show that the model achieves near state-of-the-art performance in rain removal tasks and exhibits better rain removal realism.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Stabilization and synchronization control for discrete-time complex networks via the auxiliary role of edges subsystem

Lizhi Liu, Zilin Gao, Yinhe Wang, Yongfu Li

Summary: This paper proposes a novel discrete-time interconnected model for depicting complex dynamical networks. The model consists of nodes and edges subsystems, which consider the dynamic characteristic of both nodes and edges. By designing control strategies and coupling modes, the stabilization and synchronization of the network are achieved. Simulation results demonstrate the effectiveness of the proposed methods.

NEUROCOMPUTING (2024)