4.7 Article

Signaling sarcasm: From hyperbole to hashtag

Journal

INFORMATION PROCESSING & MANAGEMENT
Volume 51, Issue 4, Pages 500-509

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.ipm.2014.07.006

Keywords

Social media; Automatic sentiment analysis; Opinion mining; Sarcasm; Verbal irony

Ask authors/readers for more resources

To avoid a sarcastic message being understood in its unintended literal meaning, in micro-texts such as messages on Twitter.com sarcasm is often explicitly marked with a hashtag such as '#sarcasm'. We collected a training corpus of about 406 thousand Dutch tweets with hashtag synonyms denoting sarcasm. Assuming that the human labeling is correct (annotation of a sample indicates that about 90% of these tweets are indeed sarcastic), we train a machine learning classifier on the harvested examples, and apply it to a sample of a day's stream of 2.25 million Dutch tweets. Of the 353 explicitly marked tweets on this day, we detect 309(87%) with the hashtag removed. We annotate the top of the ranked list of tweets most likely to be sarcastic that do not have the explicit hashtag. 35% of the top-250 ranked tweets are indeed sarcastic. Analysis indicates that the use of hashtags reduces the further use of linguistic markers for signaling sarcasm, such as exclamations and intensifiers. We hypothesize that explicit markers such as hashtags are the digital extralinguistic equivalent of non-verbal expressions that people employ in live interaction when conveying sarcasm. Checking the consistency of our finding in a language from another language family, we observe that in French the hashtag '#sarcasme' has a similar polarity switching function, be it to a lesser extent. (C) 2014 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Computer Science, Artificial Intelligence

Lexicon or grammar? Using memory-based learning to investigate the syntactic relationship between Belgian and Netherlandic Dutch

Robbert De Troij, Stefan Grondelaers, Dirk Speelman, Antal van den Bosch

Summary: This article uses computational tools to investigate the syntactic relationship between Belgian Dutch and Netherlandic Dutch, finding that lexical input works well for Netherlandic Dutch but not as effectively for Belgian Dutch.

NATURAL LANGUAGE ENGINEERING (2022)

Article Business

Linguistic elements of conversational human voice in online brand communication: Manipulations and perceptions

Christine Liebrecht, Christina Tsaousi, Charlotte van Hooijdonk

Summary: This study delves into the operationalization of CHV in online brand communication, presenting a taxonomy of linguistic elements related to message personalization, informal speech, and invitational rhetoric. It also discusses how these operationalizations contribute to consumers' perceptions of CHV and their evaluation regarding the message and the brand, while providing directions for future research and managerial implications.

JOURNAL OF BUSINESS RESEARCH (2021)

Article Linguistics

The importance of raising teachers' and students' awareness of pragmatics in German second language writing: a study of the effect of grammatical and lexical errors compared to pragma-linguistic infelicities

Antoinette Luijkx, Marinel Gerritsen, Margot van Mulken

Summary: The study examined the challenges Dutch learners face when using German in a professional context and how to maintain good relationships with native German speakers. The results showed that pragmalinguistic issues are more significant in German writing courses.

LANGUAGE AWARENESS (2022)

Article Linguistics

Generating hypotheses for alternations at low and intermediate levels of schematicity. The use of Memory-based Learning

Dirk Pijpops, Dirk Speelman, Antal van den Bosch

Summary: According to usage-based linguistics, language variation addresses the functional need of language users, which depends on the lexical realization of different constructions. This paper develops a data-driven approach to study language variation and applies it to investigate the Dutch naar-alternation.

LINGUISTICS VANGUARD (2022)

Article Communication

Responding to online complaints in webcare by public organizations: the impact on continuance intention and reputation

Sandra Jacobs, Christine Liebrecht

Summary: This study examines the impact of tone, response strategy, and user involvement in webcare on participants' continuance intention and perceptions of reputation in a public sector context. The results indicate that using a conversational human voice in webcare contributes to reputation management and increases continuance intention, while response strategy and user involvement have minimal impacts.

JOURNAL OF COMMUNICATION MANAGEMENT (2023)

Article Health Care Sciences & Services

Capturing Emerging Experiential Knowledge for Vaccination Guidelines Through Natural Language Processing: Proof-of-Concept Study

Lea Loesch, Teun Zuiderent-Jerak, Florian Kunneman, Elena Syurina, Marloes Bongers, Mart L. Stein, Michelle Chan, Willemine Willems, Aura Timen

Summary: This study explores the potential of artificial intelligence (AI)-based methods to capture experience-based knowledge and value considerations from existing data channels for guideline development. The findings show that natural language processing (NLP) methods can identify and analyze experience-based knowledge and provide valuable insights for guideline development. This knowledge can help identify problems with guideline application and contribute to the revision of guideline text.

JOURNAL OF MEDICAL INTERNET RESEARCH (2023)

Article Computer Science, Artificial Intelligence

Foundation models and the privatization of public knowledge

Fabian Ferrari, Jose van Dijck, Antal van den Bosch

Summary: In order to ensure the integrity of knowledge production, it is necessary to provide regulators and researchers with access to the training procedures of foundational models like GPT-4. Foundation models need to be open and accessible, although they are not synonymous.

NATURE MACHINE INTELLIGENCE (2023)

Article Communication

Conceptual similarity and visual metaphor: effects on viewing times, appreciation, and recall

Luuk Lagerwerf, Margot Van Mulken, Jefta B. Lagerwerf

Summary: The levels of conceptual similarity in equivalent visual structures can determine how meaning is attributed to images. Visual hyponyms are interpreted more quickly and appreciated more than visual metaphors and unrelated objects.

FRONTIERS IN COMMUNICATION (2023)

Article Communication

Observe, inspect, modify: Three conditions for generative AI governance

Fabian Ferrari, Jose van Dijck, Antal van den Bosch

Summary: The absence of benchmarks to examine the effectiveness of oversight mechanisms for generative AI systems is a problem for research and policy. This article introduces the conditions of industrial observability, public inspectability, and technical modifiability as structural elements for governing generative AI systems. These conditions are exemplified using the EU's AI Act, grounding the analysis of oversight mechanisms in the material properties of generative AI systems.

NEW MEDIA & SOCIETY (2023)

Article Linguistics

Supramodal Sentence Processing in the Human Brain: fMRI Evidence for the Influence of Syntactic Complexity in More Than 200 Participants

Julia Udden, Annika Hulten, Jan-Mathijs Schoffelen, Nietzsche Lam, Karin Harbusch, Antal van den Bosch, Gerard Kempen, Karl Magnus Petersson, Peter Hagoort

Summary: This study investigates the independence of sentence processing beyond single words and the network parts sensitive to syntactic complexity. The findings show that a left-hemisphere frontotemporoparietal network is supramodal and the left inferior frontal gyrus and the left posterior middle temporal gyrus are associated with different complexities. These findings have implications for neurobiological models of language processing.

NEUROBIOLOGY OF LANGUAGE (2022)

Article Computer Science, Artificial Intelligence

Unsupervised Text Segmentation Predicts Eye Fixations During Reading

Jinbiao Yang, Antal van den Bosch, Stefan L. Frank

Summary: This study investigates the cognitive units during reading and finds that model-segmented units predict eye fixations better than word units. The results support the theory that the mental lexicon stores not only words but also smaller and larger units.

FRONTIERS IN ARTIFICIAL INTELLIGENCE (2022)

Article Computer Science, Artificial Intelligence

I Need a CAVAA: How Conversational Agent Voting Advice Applications (CAVAAs) Affect Users' Political Knowledge and Tool Experience

Naomi Kamoen, Christine Liebrecht

Summary: Research shows that VAA with a conversational agent function improves users' political knowledge and accuracy of answering questions compared to traditional VAA. The structured design of CAVAA provides a better user experience in terms of accessing additional information and evaluation.

FRONTIERS IN ARTIFICIAL INTELLIGENCE (2022)

Article Audiology & Speech-Language Pathology

Speech register influences listeners' word expectations

M. Bentum, L. Ten Bosch, A. van den Bosch, M. Ernestus

Summary: This study investigated the influence of speech register on predictive language processing using the N400 effect. The results show that the amplitude of the N400 is best predicted by register-specific word surprisal, indicating that the statistics of the wider context (i.e., register) influence predictive language processing. Furthermore, adaptation to speech register is not solely explained by recency effects; instead, listeners adjust their word anticipations based on the presented speech register.

BRAIN AND LANGUAGE (2022)

Article Communication

Sorry but no sorry: The use and effects of apologies in airline webcare responses to NeWOM messages of flight passengers

Charlotte van Hooijdonk, Christine Liebrecht

Summary: The study revealed that offering an apology is the most frequently used response strategy in webcare conversations, with accommodative strategies being more common than defensive ones. While the presence of an apology alone does not enhance brand reputation, a combination of defensive and accommodative strategies proves to be effective in protecting reputation.

DISCOURSE CONTEXT & MEDIA (2021)

Article Linguistics

Modeling the auxiliary phrase asymmetry in code-switched Spanish-English

Chara Tsoukala, Stefan L. Frank, Antal Van den Bosch, Jorge Valdes Kroff, Mirjam Broersma

Summary: Spanish-English bilinguals show an asymmetry in code-switching between the auxiliary verbs haber and estar, possibly due to the semantic weight of estar as a main verb. This was tested using a connectionist model that demonstrated the disappearance of the asymmetry when haber was used as a main verb, supporting the hypothesis of lack of semantic weight causing the asymmetry.

BILINGUALISM-LANGUAGE AND COGNITION (2021)

Article Computer Science, Information Systems

The social-technological ways to develop digital entrepreneurship: Targeting value creation and value capture

Sang-Bing Tsai, Xusen Cheng, Yanwu Yang, Jason Xiong, Alex Zarifis

Summary: This article structurally concludes the methods proposed and evidenced to develop digital entrepreneurship from a socio-technical perspective. The technology itself and the process of utilization should be carefully considered. From a social perspective, fulfilling the needs of customers in social interaction and nurturing characteristics and social skills for the digital work environment are crucial.

INFORMATION PROCESSING & MANAGEMENT (2024)

Article Computer Science, Information Systems

NSEP: Early fake news detection via news semantic environment perception

Xiaochang Fang, Hongchen Wu, Jing Jing, Yihong Meng, Bing Yu, Hongzhu Yu, Huaxiang Zhang

Summary: This study proposes a novel fake news detection framework, utilizing news semantic environment perception (NSEP) to identify fake news content. The framework consists of steps such as dividing the semantic environment into macro and micro levels, applying graph convolutional networks, and utilizing multihead attention. Empirical experiments show that the NSEP framework achieves high accuracy in detecting Chinese fake news, outperforming other baseline methods and highlighting the importance of both micro and macro semantic environments in early detection of fake news.

INFORMATION PROCESSING & MANAGEMENT (2024)

Article Computer Science, Information Systems

A scalable and flexible basket analysis system for big transaction data in Spark

Xudong Sun, Alladoumbaye Ngueilbaye, Kaijing Luo, Yongda Cai, Dingming Wu, Joshua Zhexue Huang

Summary: This paper proposes a scalable distributed frequent itemset mining (ScaDistFIM) algorithm to address the data scalability and flexibility issues in basket analysis in the big data era. Experiment results demonstrate that the ScaDistFIM algorithm is more efficient compared to the Spark FP-Growth algorithm.

INFORMATION PROCESSING & MANAGEMENT (2024)

Article Computer Science, Information Systems

A T5-based interpretable reading comprehension model with more accurate evidence training

Boxu Guan, Xinhua Zhu, Shangbo Yuan

Summary: This paper aims to improve the interpretability of machine reading comprehension models by utilizing the pre-trained T5 model for evidence inference. They propose an interpretable reading comprehension model based on T5, which is trained on a more accurate evidence corpus and can infer precise interpretations for answers. Experimental results show that their model outperforms the baseline BERT model on the SQuAD1.1 task.

INFORMATION PROCESSING & MANAGEMENT (2024)

Article Computer Science, Information Systems

STMAP: A novel semantic text matching model augmented with embedding perturbations

Yanhao Wang, Baohua Zhang, Weikang Liu, Jiahao Cai, Huaping Zhang

Summary: In this study, we propose a data augmentation-based semantic text matching model called STMAP. By using Gaussian noise and noise mask signal for data augmentation, as well as employing an adaptive optimization network for training target optimization, our model achieves good performance in few-shot learning and semantic deviation problems.

INFORMATION PROCESSING & MANAGEMENT (2024)

Article Computer Science, Information Systems

An efficient loss function and deep learning approach for ranking stock returns in the absence of prior knowledge

Jiahao Yang, Shuo Feng, Wenkai Zhang, Ming Zhang, Jun Zhou, Pengyuan Zhang

Summary: To pursue profit from stock markets, researchers utilize deep learning methods to forecast asset price movements. However, there are two issues in current research, the discrepancy between forecasting results and profits, and heavy reliance on prior knowledge. To address these issues, researchers propose a novel optimization objective and modeling method, and conduct experiments to validate their approach.

INFORMATION PROCESSING & MANAGEMENT (2024)

Article Computer Science, Information Systems

Revealing the technology development of natural language processing: A Scientific entity-centric perspective

Heng Zhang, Chengzhi Zhang, Yuzhuo Wang

Summary: This study provides an accurate analysis of technology development in the field of Natural Language Processing (NLP) from an entity-centric perspective. The findings indicate an increase in the average number of entities per paper, with pre-trained language models becoming mainstream and the impact of Wikipedia dataset and BLEU metric continuing to rise. There has been a surge in popularity for new high-impact technologies in recent years, with researchers accepting them at an unprecedented speed.

INFORMATION PROCESSING & MANAGEMENT (2024)

Article Computer Science, Information Systems

Citation prediction by leveraging transformers and natural language heuristics

Davide Buscaldi, Danilo Dessi, Enrico Motta, Marco Murgia, Francesco Osborne, Diego Reforgiato Recupero

Summary: In scientific papers, citing other articles is a common practice to support claims and provide evidence. This paper proposes two automatic methods using Transformer models to address citation placement, and achieves significant improvements in experiments.

INFORMATION PROCESSING & MANAGEMENT (2024)

Article Computer Science, Information Systems

Data-driven analysis of digital entrepreneurship in medical supply resilience confronting the COVID-19 epidemic

Baozhuang Niu, Lingfeng Wang, Xinhu Yu, Beibei Feng

Summary: This paper examines whether the incumbent brand should adopt digital technology to forecast demand and adjust order decisions in the face of soaring demand for medical supply caused by frequent outbreaks of regional COVID-19 epidemic. The study finds that digital transformation can lead to a triple-win situation among the incumbent brand, social welfare, and consumer surplus, as well as bring benefits to the manufacturer. Furthermore, the research provides insights for firms' digital entrepreneurship decisions through theoretical optimization and data processing/policy simulation.

INFORMATION PROCESSING & MANAGEMENT (2024)

Article Computer Science, Information Systems

Multi-level knowledge-driven feature representation and triplet loss optimization network for image-text retrieval

Xueyang Qin, Lishang Li, Fei Hao, Meiling Ge, Guangyao Pang

Summary: Image-text retrieval is important in connecting vision and language. This paper proposes a method that utilizes prior knowledge to enhance feature representations and optimize network training for better retrieval results.

INFORMATION PROCESSING & MANAGEMENT (2024)

Review Computer Science, Information Systems

A co-attention based multi-modal fusion network for review helpfulness prediction

Gang Ren, Lei Diao, Fanjia Guo, Taeho Hong

Summary: This paper proposes a novel approach for predicting the helpfulness of reviews by utilizing both textual and image features. The proposed method considers the correlation between features through self-attention and co-attention mechanisms, and fuses multi-modal features for prediction. Experimental results demonstrate the superior performance of the proposed method compared to benchmark methods.

INFORMATION PROCESSING & MANAGEMENT (2024)

Article Computer Science, Information Systems

Retrieval Contrastive Learning for Aspect-Level Sentiment Classification

Zhongquan Jian, Jiajian Li, Qingqiang Wu, Junfeng Yao

Summary: Aspect-Level Sentiment Classification (ALSC) is a crucial challenge in Natural Language Processing (NLP). Most existing methods fail to consider the correlations between different instances, leading to a lack of global viewpoint. To address this issue, we propose a Retrieval Contrastive Learning (RCL) framework that extracts intrinsic knowledge across instances for improved instance representation. Experimental results demonstrate that training ALSC models with RCL leads to substantial performance improvements.

INFORMATION PROCESSING & MANAGEMENT (2024)

Article Computer Science, Information Systems

A hierarchical convolutional model for biomedical relation extraction

Ying Hu, Yanping Chen, Ruizhang Huang, Yongbin Qin, Qinghua Zheng

Summary: Biomedical relation extraction aims to extract the interactive relations between biomedical entities in a sentence. This study proposes a hierarchical convolutional model to address the semantic overlapping and data imbalance problems. The model encodes both local contextual features and global semantic dependencies, enhancing the discriminability of the neural network for biomedical relation extraction.

INFORMATION PROCESSING & MANAGEMENT (2024)

Article Computer Science, Information Systems

Topic Audiolization: A Model for Rumor Detection Inspired by Lie Detection Technology

Zhou Yang, Yucai Pang, Xuehong Li, Qian Li, Shihong Wei, Rong Wang, Yunpeng Xiao

Summary: This study proposes a rumor detection model based on topic audiolization, which transforms the topic space into audio-like signals. Experimental results show that the model achieves significant performance improvements in rumor identification.

INFORMATION PROCESSING & MANAGEMENT (2024)

Article Computer Science, Information Systems

User-oriented metrics for search engine deterministic sort orders

Alistair Moffat

Summary: This paper proposes the buying power metric for assessing the quality of product rankings on e-commerce sites. It discusses the relationship between the buying power metric and user reactions, and introduces an alternative product ranking effectiveness metric.

INFORMATION PROCESSING & MANAGEMENT (2024)