☆ 4.2 Article

Sentence-Based Sentiment Analysis for Expressive Text-to-Speech

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2013)

期刊

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING

卷 21, 期 2, 页码 223-233

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TASL.2012.2217129

关键词

Expressive text-to-speech (TTS) synthesis; feature engineering; sentiment analysis; text classification

类别

Acoustics Engineering, Electrical & Electronic

向作者/读者索取更多资源

Protocol

Reagent

摘要

Current research to improve state of the art Text-To-Speech (TTS) synthesis studies both the processing of input text and the ability to render natural expressive speech. Focusing on the former as a front-end task in the production of synthetic speech, this article investigates the proper adaptation of a Sentiment Analysis procedure (positive/neutral/negative) that can then be used as an input feature for expressive speech synthesis. To this end, we evaluate different combinations of textual features and classifiers to determine the most appropriate adaptation procedure. The effectiveness of this scheme for Sentiment Analysis is evaluated using the Semeval 2007 dataset and a Twitter corpus, for their affective nature and their granularity at the sentence level, which is appropriate for an expressive TTS scenario. The experiments conducted validate the proposed procedure with respect to the state of the art for Sentiment Analysis.

作者

我是这篇论文的作者

点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.2

评分不足

次要评分

新颖性

-

重要性

-

科学严谨性

-

评价这篇论文

推荐

Article Acoustics

CAMNet: A controllable acoustic model for efficient, expressive, high-quality text-to-speech

Jesus Monge Alvarez, Holly Francois, Hosang Sung, Seungdo Choi, Jonghoon Jeong, Kihyun Choo, Kyoungbo Min, Sangjun Park

Summary: Spoken language is crucial for human-machine interaction, and text-to-speech (TTS) models are essential for efficient communication. CAMNet, based on deep convolutional TTS (DCTTS), offers controllable style transfer capabilities and allows for consistent control over expression, pitch, and speaking rate, while maintaining high-quality synthesized speech.

APPLIED ACOUSTICS (2022)

添加到收藏夹

Article Automation & Control Systems

Integrating color cues to improve multimodal sentiment analysis in social media

Jieyu An, Wan Mohd Nazmee Wan Zainon

Summary: Multimodal sentiment analysis is an important research area, especially in social media where emotions are expressed through text and images. This paper proposes a novel model called ICCI, which integrates color cues to improve sentiment analysis accuracy. The model extracts semantic and color features, and utilizes a cross-attention mechanism for feature interaction. Experimental results on benchmark datasets demonstrate the effectiveness of ICCI, outperforming existing methods with higher accuracy.

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE (2023)

添加到收藏夹

Article Chemistry, Multidisciplinary

HierTTS: Expressive End-to-End Text-to-Waveform Using a Multi-Scale Hierarchical Variational Auto-Encoder

Zengqiang Shang, Peiyang Shi, Pengyuan Zhang, Li Wang, Guangying Zhao

Summary: We propose a highly expressive end-to-end text-to-waveform generation model, which deeply couples the hierarchical properties of speech with hierarchical variational auto-encoders and models multi-scale latent variables. Our model performs closer to natural speech in prosody expressiveness and has better generative diversity.

APPLIED SCIENCES-BASEL (2023)

添加到收藏夹

Article Chemistry, Multidisciplinary

Re-Engineered Word Embeddings for Improved Document-Level Sentiment Analysis

Su Yang, Farzin Deravi

Summary: This paper proposes a novel re-engineering mechanism for generating word embeddings to enhance document-level sentiment analysis. By re-engineering the feature components of embedding vectors, the mechanism increases the between-class separation and leverages the informative content of the documents.

APPLIED SCIENCES-BASEL (2022)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Evaluating feature combination strategies for hate-speech detection in Spanish using linguistic features and transformers

Jose Antonio Garcia-Diaz, Salud Maria Jimenez-Zafra, Miguel Angel Garcia-Cumbreras, Rafael Valencia-Garcia

Summary: The rise of social networks has allowed individuals with misogynistic, xenophobic, and homophobic views to spread hate-speech, causing harm to individuals or groups based on their gender, ethnicity, or sexual orientation. Automatic identification of hate-speech is challenging, especially in languages other than English. This study focuses on identifying hate-speech in Spanish and examines the most effective features and their combination for developing accurate systems.

COMPLEX & INTELLIGENT SYSTEMS (2023)

添加到收藏夹

Article Computer Science, Information Systems

A Novel End-to-End Turkish Text-to-Speech (TTS) System via Deep Learning

Saadin Oyucu

Summary: The study developed a Turkish speech synthesis system using a deep learning approach to address the lack of corpus for Turkish TTS. Real users rated the quality of synthesized speech as 4.49 using Mean Opinion Score (MOS), and an objective evaluation obtained a score of 4.32. These findings represent the first documented deep learning and HiFi-GAN-based TTS system for Turkish TTS.

ELECTRONICS (2023)

添加到收藏夹

Article Chemistry, Multidisciplinary

Mitigating Class Imbalance in Sentiment Analysis through GPT-3-Generated Synthetic Sentences

Cici Suhaeni, Hwan-Seung Yong

Summary: This paper examines the effectiveness of the GPT-3 model in addressing imbalanced sentiment analysis, specifically focusing on the imbalanced Coursera online course review dataset. The study employs synthetic review generation and sentiment classification using nine models on both imbalanced and balanced datasets. The results show that high-quality synthetic reviews significantly enhance sentiment classification performance, with an average accuracy increase of approximately 12.76% on the balanced dataset. The study highlights the potential of the GPT-3 model as a feasible solution for data imbalance in sentiment analysis and provides significant insights for future research.

APPLIED SCIENCES-BASEL (2023)

添加到收藏夹

Article Computer Science, Information Systems

TF-TDA: A Novel Supervised Term Weighting Scheme for Sentiment Analysis

Arwa Alshehri, Abdulmohsen Algarni

Summary: In text classification tasks, feature representation and weighting schemes are important for classification performance. Traditional unsupervised term weighting (UTW) schemes, such as TF-IDF, are not sufficient for sentiment analysis (SA) tasks. This study proposes a novel supervised term weighting (STW) approach called TF-TDA, which categorizes extracted features into groups with different levels of discrimination and weights each group based on its contribution. Experimental results using four SA datasets show that TF-TDA outperforms two baseline term weighting approaches, with improvements in the F1 score ranging from 0.52% to 3.99%. Statistical tests confirm the significant improvement achieved by TF-TDA, with p-values ranging from 0.0000597 to 0.0455.

ELECTRONICS (2023)

添加到收藏夹

Review Computer Science, Artificial Intelligence

Multimodal sentimental analysis for social media applications: A comprehensive review

Ganesh Chandrasekaran, Tu N. Nguyen, D. Jude Hemanth

Summary: Sentiment analysis is crucial for identifying and classifying opinions on products or services, with traditional text-based methods no longer meeting the needs of analyzing multimodal data effectively.

WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY (2021)

添加到收藏夹

Review Computer Science, Artificial Intelligence

Text-to-Speech Synthesis: Literature Review with an Emphasis on Malayalam Language

M. P. Jasir, Kannan Balakrishnan

Summary: This article discusses the research progress of text-to-speech synthesis in English and prominent Indian languages, with a special focus on Malayalam. It emphasizes the importance of improving the naturalness of synthetic speech in multilingual countries.

ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING (2022)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Duplicate product record detection engine for e-commerce platforms

Osman Semih Albayrak, Tevfik Aytekin, Tolga Ahmet Kalayci

Summary: Having a clean and standardized product catalog is crucial for e-commerce companies. This study introduces a novel duplicate record detection engine developed for an e-commerce company, Hepsiburada, and demonstrates its high precision in detecting duplicate product records.

EXPERT SYSTEMS WITH APPLICATIONS (2022)

添加到收藏夹

Article Computer Science, Information Systems

Multi-layered perceptron based deep learning model for emotion extraction on monolingual text using intelligence feature engineering and filtering techniques

P. Kumaran, Rajeswari Sridhar, Hiran Nandy

Summary: Text Sentiment Analysis (TSA) for blogs on major microblogging platforms is important but challenging due to the complexity of natural language and the informal structure employed in short text like Twitter. In this proposed work, a MLP-SDLM model is used to concatenate data filtering and feature engineering approaches, and a K-map based technique is introduced to efficiently combine filtered and unfiltered textual and non-textual features. The proposed models outperform traditional machine learning and deep learning classifiers, achieving high accuracy rates of 95.13% for MLP-SDLM, 89.17% for K-map based technique, and 88.7% for MLP.

MULTIMEDIA TOOLS AND APPLICATIONS (2023)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Multimodal Video Sentiment Analysis Using Deep Learning Approaches, a Survey

Sarah A. Abdu, Ahmed H. Yousef, Ashraf Salem

Summary: This research provides a comprehensive overview of the latest updates in the field of video sentiment analysis, categorizing thirty-five state-of-the-art models based on the architecture used in each model. It concludes that the most powerful architecture in multimodal sentiment analysis task is the Multi-Modal Multi-Utterance based architecture.

INFORMATION FUSION (2021)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Generating effective label description for label-aware sentiment classification

Xiaofei Zhu, Zhanwang Peng, Jiafeng Guo, Stefan Dietze

Summary: Sentiment classification aims to predict the sentiment label for a given text. Recent research efforts have focused on incorporating matching clues between text words and class labels into the learning process. However, these methods heavily rely on label content availability and only capture label-specific signals to measure word contribution. In this paper, a novel framework called LGDSC is proposed, which generates an effective label description and utilizes a Dual-Channel Label-guided Attention Network (DLAN) to learn text representation from two different channels.

EXPERT SYSTEMS WITH APPLICATIONS (2023)

添加到收藏夹

Review Chemistry, Multidisciplinary

Sentiment Analysis of Twitter Data

Yili Wang, Jiaxuan Guo, Chengsheng Yuan, Baozhu Li

Summary: Twitter Sentiment Analysis is an active subfield of text mining, which has attracted considerable interest among researchers. This research provides a comprehensive review of the latest developments in this area, including newly proposed algorithms and applications. The survey classifies each publication based on its significance to specific TSA methods and depicts the current research direction in the field of TSA.

APPLIED SCIENCES-BASEL (2022)

添加到收藏夹

Article Chemistry, Analytical

A WASN-Based Suburban Dataset for Anomalous Noise Event Detection on Dynamic Road-Traffic Noise Mapping

Rosa Ma Alsina-Pages, Ferran Orga, Francesc Alias, Joan Claudi Socoro

SENSORS (2019)

添加到收藏夹

Review Engineering, Electrical & Electronic

Review of Wireless Acoustic Sensor Networks for Environmental Noise Monitoring in Smart Cities

Francesc Alias, Rosa Ma. Alsina-Pages

JOURNAL OF SENSORS (2019)

添加到收藏夹

Article Acoustics

Anomalous events removal for automated traffic noise maps generation

Rosa Ma Alsina-Pages, Francesc Alias, Joan Claudi Socoro, Ferran Orga, Roberto Benocci, Giovanni Zambon

APPLIED ACOUSTICS (2019)

添加到收藏夹

Article Chemistry, Multidisciplinary

Glottal Source Contribution to Higher Order Modes in the Finite Element Synthesis of Vowels

Marc Freixes, Marc Arnela, Joan Claudi Socoro, Francesc Alias, Oriol Guasch

APPLIED SCIENCES-BASEL (2019)

添加到收藏夹

Article Computer Science, Information Systems

Parallel hierarchical architectures for efficient consensus clustering on big multimedia cluster ensembles

Xavier Sevillano, Joan Claudi Socoro, Francesc Alias

INFORMATION SCIENCES (2020)

添加到收藏夹

Article Chemistry, Analytical

Aggregate Impact of Anomalous Noise Events on the WASN-Based Computation of Road Traffic Noise Levels in Urban and Suburban Environments

Francesc Alias, Ferran Orga, Rosa Ma Alsina-Pages, Joan Claudi Socoro

SENSORS (2020)

添加到收藏夹

Article Acoustics

A unit selection text-to-speech-and-singing synthesis framework from neutral speech: proof of concept

Marc Freixes, Francesc Alias, Joan Claudi Socoro

EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING (2019)

添加到收藏夹

Editorial Material Chemistry, Multidisciplinary

Editorial for Special Issue IberSPEECH2018: Speech and Language Technologies for Iberian Languages

Francesc Alias, Antonio Bonafonte, Antonio Teixeira

APPLIED SCIENCES-BASEL (2020)

添加到收藏夹

Article Chemistry, Analytical

WASN-Based Day-Night Characterization of Urban Anomalous Noise Events in Narrow and Wide Streets

Francesc Alias, Joan Claudi Socoro, Rosa Ma Alsina-Pages

SENSORS (2020)

添加到收藏夹

Article Chemistry, Multidisciplinary

WASN-Based Spectro-Temporal Analysis and Clustering of Road Traffic Noise in Urban and Suburban Areas

Joan Claudi Socoro, Francesc Alias, Rosa Ma Alsina-Pages

Summary: This paper introduces a clustering method to analyze and categorize the spectro-temporal features of road traffic noise (RTN) collected. The results of the experiments show that the clustering solutions of RTN vary in different environments.

APPLIED SCIENCES-BASEL (2022)

添加到收藏夹

Article Chemistry, Multidisciplinary

Contribution of Vocal Tract and Glottal Source Spectral Cues in the Generation of Acted Happy and Aggressive Spanish Vowels

Marc Freixes, Joan Claudi Socoro, Francesc Alias

Summary: This study analyzes the contribution of vocal tract and glottal source spectral cues in speech generation. The results show that vocal tract cues significantly contribute to the expression of happy and aggressive emotions for [a] vowels, while glottal source spectral cues significantly contribute to [u] vowels.

APPLIED SCIENCES-BASEL (2022)

添加到收藏夹

Article Acoustics

Effects of COVID-19 lockdown in Milan urban and Rome suburban acoustic environments: Anomalous noise events and intermittency ratio

Francesc Alias, Rosa Ma. Alsina-Pages

Summary: This study analyzes the impact of COVID-19 lockdown on the acoustic environment of Milan and Rome. In Rome, there is a significant increase in anomalous noise events (ANE) during the lockdown, especially at night and on weekends, despite a decrease in prominent events. In contrast, ANEs decrease during the lockdown in Milan, mostly during the daytime. The intermittency ratio (IR), representing the impact of noise on the population, significantly decreases in most sensing locations during the lockdown, indicating a reduction in the negative impact of noise.

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA (2022)

添加到收藏夹

Article Chemistry, Multidisciplinary

Evaluation of Glottal Inverse Filtering Techniques on OPENGLOT Synthetic Male and Female Vowels

Marc Freixes, Luis Joglar-Ongay, Joan Claudi Socoro, Francesc Alias-Pujol

Summary: Currently, articulatory-based three-dimensional source-filter models have limited expressiveness in producing vowels and diphtongs. Glottal inverse filtering (GIF) techniques can be used to identify specific characteristics of the glottal source signal and vocal tract transfer function, allowing for expressive speech synthesis. In this study, a two-phase analysis methodology is introduced for comparing GIF techniques based on a reference dataset. State-of-the-art GIF techniques based on iterative adaptive inverse filtering (IAIF) and quasi closed phase (QCP) approaches are evaluated on the OPENGLOT database, and the results show that QCP-based techniques outperform IAIF-based methods in most error metrics and scenarios.

APPLIED SCIENCES-BASEL (2023)

添加到收藏夹

Article Acoustics

Noise at the time of COVID 19: The impact in some areas in Rome and Milan, Italy

Rosa Maria Alsina Pages, Francesc Alias, Patrizia Bellucci, Pier Paolo Cartolano, Ilaria Coppa, Laura Peruzzi, Alessandro Bisceglie, Giovanni Zambon

NOISE MAPPING (2020)

添加到收藏夹

Proceedings Paper Computer Science, Theory & Methods

Remote Acoustic Monitoring System for Noise Sensing

Unai Hernandez-Jayo, Rosa Ma Alsina-Pages, Ignacio Angulo, Francesc Alias

ONLINE ENGINEERING & INTERNET OF THINGS (2018)

添加到收藏夹

暂无数据

© Peeref 2019-2024. All rights reserved.