4.7 Article

Spam community detection & influence minimization using NRIM algorithm

Journal

COMPUTERS IN HUMAN BEHAVIOR
Volume 147, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.chb.2023.107832

Keywords

Big data; Spam; Graph database; Influence minimization; Neo4J; Twitter; NodeRank; communi

Ask authors/readers for more resources

This study uses social network analysis and visualization methods to examine interdisciplinary collaborations and emerging practices based on bibliometric data from the Web of Science. It also develops an algorithm for detecting social spam on Twitter and limiting its spread through influential users and communities, reducing the negative impact of spammers on the network.
Big Data is a research area where many different disciplines work together. Social media has grown in popularity as a tool for disseminating and gathering information. However, the success of social media like Twitter, Facebook, etc., has not only attracted genuine users but also spammers who utilize social graphs, famous phrases, and hashtags to spread malware. This study uses several social network analysis and visualization methods based on bibliometric data from the Web of Science to look at the structure and patterns of interdisciplinary collaborations and the latest emerging overall practice. For a better understanding of spamming behaviors on Twitter, the Twitter data set is thoroughly analyzed, and categorized into Spam and Non-Spam classifications. Earlier studies confined their scope to investigating the most negatively influential spammers by blocking the most influential spammers. However, the cumulative impact of other spammers having low individual negative influence values but higher impact values was neglected. In this article, we develop an algorithm for detecting social spam using Node Rank-based Influence Minimization (NRIM), which integrates Node Rank with the impact value of spam. The proposed spam influence minimization model also identifies spam-influential users and aids in limiting the flow of spam tweets within the Twitter network. Additionally, a detection algorithm for influential communities has been proposed to limit the spread of spam content through influential communities on the Twitter network. The primary focus of this paper is to reduce the spam impact on Twitter data by identifying influential spammers using the Node_Rank-based Influence Minimization (NRIM) algorithm. To begin, the tweets are classified into spam and non-spam using a machine learning algorithm. Furthermore, the spam observed in the Graph is analyzed, and the Spammer is passed through the NRIM algorithm to find the influential Spammers. In addition to this, the negative impact of the Spammer is reduced on the Twitter graph, and its impact is analyzed on query processing executed on Graph. The technique used for the minimization of the Spammer's negative effect on the graph reduces the query execution time by 12%.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available