Article
Acoustics
Chenpeng Du, Kai Yu
Summary: This paper proposes a novel approach for generating natural speech by modeling phone-level prosodies using a GMM-based MDN and extending it for multi-speaker TTS. The experiments demonstrate that the proposed method achieves significantly better diversity and naturalness compared to a single Gaussian distribution, and can clone prosodies from a reference speech.
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING
(2022)
Article
Behavioral Sciences
Michal Icht, Gil Zukerman, Esther Ben-Itzchak, Boaz M. Ben-David
Summary: Individuals with autism spectrum disorder without intellectual disability (ASD-without-ID) do not significantly differ from matched controls in the identification of simple prosodic emotions, but perform worse in the identification of complex prosodic emotions. Intervention programs may focus on improving performance with complex emotions by leveraging intact abilities in processing simple emotions.
Article
Chemistry, Multidisciplinary
Jaeryoung Lee
Summary: The study explores the importance of affective speech in robotic applications, finding that affective voices can help users better understand information and can evoke corresponding negative emotions when conversing with negative voices.
APPLIED SCIENCES-BASEL
(2021)
Article
Multidisciplinary Sciences
Leonor Neves, Marta Martins, Ana Isabel Correia, Sao Luis Castro, Cesar F. Lima
Summary: The study found a significant association between emotion recognition in speech prosody and children's socio-emotional adjustment, particularly in relation to dimensions of prosocial behavior and cognitive and behavioral self-regulation. However, no associations were found for emotion recognition in non-verbal vocalizations or facial emotion recognition in relation to socio-emotional adjustment.
ROYAL SOCIETY OPEN SCIENCE
(2021)
Article
Neurosciences
Mathilde Marie Duville, Luz Maria Alonso-Valerdi, David I. Ibarra-Zarate
Summary: This study aims to improve socio-emotional impairments in autistic children by analyzing their neuronal activity related to emotional prosody discrimination. The research will create a speech database, extract acoustic features, and use Support Vector Machine to validate the speech corpus, ultimately developing intervention measures to help autistic children improve their affective prosodies discrimination.
FRONTIERS IN HUMAN NEUROSCIENCE
(2021)
Article
Multidisciplinary Sciences
Maria A. Di Biase, Ye Ella Tian, Richard A. I. Bethlehem, Jakob Seidlitz, Aaron. F. Alexander-Bloch, B. T. Thomas Yeo, Andrew Zalesky
Summary: Brain scans from large, diverse cohorts have helped establish normative brain aging charts, but it is unclear if these cross-sectional estimates accurately reflect longitudinal age-related brain changes. This study shows that cross-sectional brain charts substantially underestimate actual longitudinal brain changes. Additionally, individual brain aging trajectories vary greatly and are challenging to predict based on cross-sectional data. Overall, longitudinal measurements are crucial for understanding brain development and aging.
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA
(2023)
Article
Biology
D. W. Kikuchi, K. Reinhold
Summary: The study models variability in animal behavior using bird migration timing as an example, finding stable sets of return dates through simulations. It suggests that individual variation is inversely related to fitness risks and positively related to territory inequality. This result is applicable across systems where competition can lead to a diversity of individual strategies, depending on the distribution of resources.
PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES
(2021)
Article
Automation & Control Systems
K. C. Rajeswari, R. S. Mohana, S. Manikandan, S. Beski Prabaharan
Summary: Text-to-speech synthesis is gaining popularity and finding applications in various fields. This paper proposes a novel feature extraction technique based on special symbols in the text and cepstrum variation of the speech signal, which works well for real-time applications.
INTELLIGENT AUTOMATION AND SOFT COMPUTING
(2022)
Article
Linguistics
Xose A. Padilla
Summary: Understanding emotions in speech plays a crucial role in successful social interaction. Prosody has been identified as a key element in this process. By analyzing the behavior of acoustic magnitudes, it is possible to establish a relationship between the statement that triggers an emotional response and the response itself. The results show evidence of regularity and directionality in the behavior of F0 in syntagmatic relationships.
SPANISH IN CONTEXT
(2022)
Article
Neurosciences
Yehuda I. Dor, Daniel Algom, Vered Shakuf, Boaz M. Ben-David
Summary: Older adults and young adults have different ways of processing emotions in speech. These age-related changes affect all speech channels, and involve both sensory and cognitive factors.
FRONTIERS IN NEUROSCIENCE
(2022)
Article
Chemistry, Analytical
Jesin James, B. T. Balamurali, Catherine Watson, Hansjoerg Mixdorff
Summary: This study presents a low-resource emotional speech synthesis system based on modelling prosody features for empathetic speech synthesis. It focuses on modelling and synthesising secondary emotions, which are difficult to model compared to primary emotions. The research proposes a proof of concept using handcrafted feature extraction and a low-resource-intensive machine learning approach to create synthetic speech with secondary emotions. An emotional text-to-speech synthesis system is developed to synthesise five secondary emotions and a perception test is conducted, which shows a hit rate of over 65% in identifying the correct emotion in a forced response test.
Article
Astronomy & Astrophysics
E. Guise, S. F. Honig, V Gorjian, A. J. Barth, T. Almeyda, L. Pei, S. B. Cenko, R. Edelson, A. Filippenko, M. D. Joner, C. D. Laney, W. Li, M. A. Malkan, M. L. Nguyen, W. Zheng
Summary: Multiwavelength variability studies of active galactic nuclei reveal their inner regions. Dust reverberation mapping measurements show that different wavelengths are dominated by the same hot dust emission in Zw229-015. Monte Carlo simulations suggest that the dust is distributed in an extended flat disk with an inclination angle of approximately 49 degrees.
MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY
(2022)
Article
Psychology, Multidisciplinary
Eva Gal, Istvan Toth-Kiraly, Gabor Orosz
Summary: Fixed intelligence mindset promotes maladaptive emotional reactions, and self-esteem plays a mediating role in this relationship. Research has found that students endorsing fixed intelligence mindset experience higher levels of negative emotions in adverse academic situations, which is related to a decrease in self-esteem.
FRONTIERS IN PSYCHOLOGY
(2022)
Article
Multidisciplinary Sciences
Tomoya Nakai, Laura Rachman, Pablo Arias Sarah, Kazuo Okanoya, Jean-Julien Aucouturier
Summary: People have an advantage in identifying individuals and emotions in their own culture, known as the other-race and language-familiarity effect. In cross-cultural experiments, participants performed better in their native language when categorizing vocal emotional cues and detecting non-emotional pitch changes. These results suggest that unfamiliarity with the phonology of another language impairs the detection of pitch prosodic cues and the recognition of expressive prosody.
Article
Linguistics
Vass Verkhodanova, Matt Coler, Roel Jonkers, Sanne Timmermans, Natasha Maurits, Bauke de Jong, Wander Lowie
Summary: This study investigates the relationship between listeners' perceptual judgments of speech healthiness and the acoustic changes in the speech of people with Parkinson's disease. The findings show that regardless of listeners' expertise and language background, they are more sensitive to speech rate, phonation deficiency, and vowel centralization when classifying speech as healthy or unhealthy. These findings suggest that aspects of phonation and prosody serve as prominent markers of speech healthiness for listeners, independent of their first language or expertise. This has important implications for clinical practice and the subjective perception of speech in people with Parkinson's disease.
JOURNAL OF NEUROLINGUISTICS
(2022)
Article
Neurosciences
Peter Q. Pfordresher, Pauline Larrouy-Maestri
FRONTIERS IN HUMAN NEUROSCIENCE
(2015)
Article
Audiology & Speech-Language Pathology
Marie Reine Ayoub, Pauline Larrouy-Maestri, Dominique Morsomme
Article
Audiology & Speech-Language Pathology
Pauline Larrouy-Maestri, Dominique Morsomme
Article
Multidisciplinary Sciences
Pauline Larrouy-Maestri, David Magis, Matthias Grabenhorst, Dominique Morsomme
Article
Biology
Tina Roeske, Pauline Larrouy-Maestri, Yasuhiro Sakamoto, David Poeppel
PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES
(2020)
Article
Audiology & Speech-Language Pathology
Pauline Larrouy-Maestri, Xinyue Wang, Renan Vairo Nunes, David Poeppel
Summary: This study examines the accuracy of professional singers' self-evaluations and finds that most singers overestimate their own pitch accuracy. It also reveals a relationship between singing proficiency and self-evaluation ability.
Article
Multidisciplinary Sciences
N. Holz, P. Larrouy-Maestri, D. Poeppel
Summary: The study demonstrates that listeners are good at inferring the intensity and arousal of vocalizations, but have difficulty categorizing peak emotions. It suggests that moderate and strong emotions are more easily classified, while peak emotions are ambiguous. This finding challenges existing theories on emotion communication.
SCIENTIFIC REPORTS
(2021)
Article
Psychology, Multidisciplinary
Cecilia Durojaye, Lauren Fink, Tina Roeske, Melanie Wald-Fuhrmann, Pauline Larrouy-Maestri
Summary: This study examines the classification of music and speech sounds performed by the dundun talking drum, revealing the importance of acoustic features and listener familiarity in distinguishing speech and music. The experiment showed that listeners were able to identify the intended category of the samples, with acoustic features such as intensity, pitch, timbre, and timing playing a significant role.
FRONTIERS IN PSYCHOLOGY
(2021)
Article
Psychology, Experimental
Pauline Larrouy-Maestri, Vanessa Kegel, Wolff Schlotz, Pol van Rijn, Winfried Menninghaus, David Poeppel
Summary: Prosodic stresses in speech can significantly impact the meaning of utterances. This study focuses on the mechanisms underlying the meaning effects of ironic prosody, which is commonly used in personal and mass-media communication. Through a series of experiments involving sentence interpretation, acoustic analysis, and participant ratings, the study reveals that ironic meaning is primarily conveyed by a shift in stress position within a sentence. This change in position serves as a cue for listeners to consider alternative meanings of the sentence, highlighting the importance of prosody in human communication.
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL
(2023)
Article
Psychology, Mathematical
Pauline Larrouy-Maestri, Peter M. C. Harrison, Daniel Mullensiefen
BEHAVIOR RESEARCH METHODS
(2019)
Article
Psychology, Multidisciplinary
Julia Merrill, Pauline Larrouy-Maestri
FRONTIERS IN PSYCHOLOGY
(2017)
Article
Music
Pauline Larrouy-Maestri, Dominique Morsomme, David Magis, David Poeppel
Article
Music
Pauline Larrouy-Maestri, David Magis, Dominique Morsomme