4.2 Article

Exploiting Language Models to Classify Events from Twitter

Journal

COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE
Volume 2015, Issue -, Pages -

Publisher

HINDAWI LTD
DOI: 10.1155/2015/401024

Keywords

-

Funding

  1. Basic Science Research Program through the National Research Foundation of Korea (NRF) - Ministry of Education, Science and Technology [2012R1A1A2006906]
  2. National Research Foundation of Korea [2012R1A1A2006906] Funding Source: Korea Institute of Science & Technology Information (KISTI), National Science & Technology Information Service (NTIS)

Ask authors/readers for more resources

Classifying events is challenging in Twitter because tweets texts have a large amount of temporal data with a lot of noise and various kinds of topics. In this paper, we propose a method to classify events from Twitter. We firstly find the distinguishing terms between tweets in events and measure their similarities with learning language models such as ConceptNet and a latent Dirichlet allocation method for selectional preferences (LDA-SP), which have been widely studied based on large text corpora within computational linguistic relations. The relationship of term words in tweets will be discovered by checking them under each model. We then proposed a method to compute the similarity between tweets based on tweets' features including common term words and relationships among their distinguishing term words. It will be explicit and convenient for applying to k-nearest neighbor techniques for classification. We carefully applied experiments on the Edinburgh Twitter Corpus to show that our method achieves competitive results for classifying events.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.2
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available