4.5 Article

SCSMiner: mining social coding sites for software developer recommendation with relevance propagation

Journal

WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS
Volume 21, Issue 6, Pages 1523-1543

Publisher

SPRINGER
DOI: 10.1007/s11280-018-0526-9

Keywords

SCSMiner; Social coding sites; Expert finding; Developer recommendation; Relevance propagation

Funding

  1. Natural Science Foundation of China [61379119, 61672453]
  2. Australia Research Council Linkage Project [LP140100937]

Ask authors/readers for more resources

With the advent of social coding sites, software development has entered a new era of collaborative work. Social coding sites (e.g., GitHub) can integrate social networking and distributed version control in a unified platform to facilitate collaborative developments over the world. One unique characteristic of such sites is that the past development experiences of developers provided on the sites convey the implicit metrics of developer's programming capability and expertise, which can be applied in many areas, such as software developer recruitment for IT corporations. Motivated by this intuition, we aim to develop a framework to effectively locate the developers with right coding skills. To achieve this goal, we devise a generativ e probabilistic expert ranking model upon which a consistency among projects is incorporated as graph regularization to enhance the expert ranking and a perspective of relevance propagation illustration is introduced. For evaluation, StackOverflow is leveraged to complement the ground truth of expert. Finally, a prototype system, SCSMiner, which provides expert search service based on a real-world dataset crawled from GitHub is implemented and demonstrated.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Computer Science, Information Systems

Be Causal: De-Biasing Social Network Confounding in Recommendation

Qian Li, Xiangmeng Wang, Zhichao Wang, Guandong Xu

Summary: In recommendation systems, the MNAR problem causes selection bias and affects recommendation performance. Existing approaches focus on modeling the exposure of missing entries but overlook the causal perspective. To address this, we propose a method called DENC (De-Bias Network Confounding in Recommendation) that provides causal analysis on MNAR using both inherent factors and auxiliary networks. DENC includes an exposure model to control confounding and a deconfounding model for balanced representation learning. Experiments on three datasets demonstrate that DENC outperforms state-of-the-art baselines.

ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA (2023)

Article Computer Science, Information Systems

Federated Learning for Privacy Preservation of Healthcare Data From Smartphone-Based Side-Channel Attacks

Abdul Rehman, Imran Razzak, Guandong Xu

Summary: Federated learning is a promising framework for preserving user data privacy in machine and deep learning models. This study proposes a novel framework based on federated learning to detect side-channel attacks in smartphone keyboard input, and experimental results demonstrate its effectiveness.

IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS (2023)

Article Computer Science, Artificial Intelligence

A Confusion Method for the Protection of User Topic Privacy in Chinese Keyword-based Book Retrieval

Zongda Wu, Jian Xie, Shigen Shen, Chongze Lin, Guandong Xu, Enhong Chen

Summary: In this article, a privacy protection solution is proposed for a Chinese keyword-based book search service. By modifying the user query sequence, the user query topics can be confused to protect the user's topic privacy on an untrusted server, while maintaining the accuracy of the book search service. A client-based framework and privacy model are introduced to guide the generation of cover queries based on a user query sequence. The proposed modification algorithm quickly generates cover queries that meet the privacy model by replacing, deleting, and adding keywords. The effectiveness of the approach is demonstrated through theoretical analysis and experimental evaluation, showing improved security for users' topic privacy without compromising efficiency, accuracy, and usability of the existing Chinese keyword book search service, thus contributing to the construction of a privacy-preserving text retrieval platform in an untrusted network environment.

ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING (2023)

Editorial Material Computer Science, Information Systems

Trustworthy Recommendation and Search: Introduction to the Special Issue - Part 1

Hongzhi Yin, Yizhou Sun, Guandong Xu, Evangelos Kanoulas

ACM TRANSACTIONS ON INFORMATION SYSTEMS (2023)

Editorial Material Computer Science, Information Systems

Guest Editorial Open and Interpretable AI in Computational Pathology

Imran Razzak, Muhammad Khuram Khan, Guandong Xu, Fahmi Khalifa

IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS (2023)

Editorial Material Computer Science, Information Systems

Trustworthy Recommendation and Search: Introduction to the Special Section - Part 2

Hongzhi Yin, Yizhou Sun, Guandong Xu, Evangelos Kanoulas

ACM TRANSACTIONS ON INFORMATION SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Achieving counterfactual fairness with imperfect structural causal model

Tri Dung Duong, Qian Li, Guandong Xu

Summary: Counterfactual fairness addresses discrimination between model predictions in the actual and counterfactual worlds. This research proposes a minimax game-theoretic model that achieves counterfactual fairness without strict assumptions on structural causal models.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Article Computer Science, Artificial Intelligence

Hyperbolic Neural Collaborative Recommender

Anchen Li, Bo Yang, Huan Huo, Hongxu Chen, Guandong Xu, Zhen Wang

Summary: Recently, deep learning techniques have achieved great success in recommender systems. However, most deep methods lack explicit extraction of mutual semantic relationships between users and items, which are latent in user-item interactions. Additionally, these methods primarily focus on representation learning in euclidean geometry, ignoring the non-euclidean latent anatomy of the bipartite graph structure. This work presents HNCR, a deep hyperbolic representation learning method that leverages mutual semantic relationships for collaborative filtering tasks, demonstrating superior performance compared to euclidean counterparts and state-of-the-art recommendation baselines.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2023)

Article Computer Science, Artificial Intelligence

Causal Disentanglement for Semantic-Aware Intent Learning in Recommendation

Xiangmeng Wang, Qian Li, Dianer Yu, Peng Cui, Zhichao Wang, Guandong Xu

Summary: This study proposes an unbiased and semantic-aware disentanglement learning approach from a causal perspective, which generates semantic-aware representations by disentangling users' true intents from specific item context, and designs a causal intervention mechanism to eliminate confounding bias.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2023)

Article Computer Science, Information Systems

Mitigating the performance sacrifice in DP-satisfied federated settings through graph contrastive learning

Haoran Yang, Xiangyu Zhao, Muyang Li, Hongxu Chen, Guandong Xu

Summary: This paper investigates the implementation of differential privacy (DP) on graph edges and observes a decrease in performance. The authors propose leveraging graph contrastive learning to alleviate the performance drop resulting from DP.

INFORMATION SCIENCES (2023)

Article Computer Science, Artificial Intelligence

Causal Optimal Transport for Treatment Effect Estimation

Qian Li, Zhichao Wang, Shaowu Liu, Gang Li, Guandong Xu

Summary: Treatment effect estimation is crucial for answering questions about the impact of specific treatments. However, treatment assignment bias is a fundamental issue in this research. To address this problem, the paper proposes a novel causal optimal transport (CausalOT) model that utilizes global information on observational covariates to mitigate limited overlapping, and designs a new counterfactual loss to improve estimation accuracy.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Review Computer Science, Cybernetics

Graph-Aware Deep Fusion Networks for Online Spam Review Detection

Li He, Guandong Xu, Shoaib Jameel, Xianzhi Wang, Hongxu Chen

Summary: This article introduces a graph-based model for detecting spam information in product reviews on e-commerce platforms. The model utilizes relevant metadata and relational data to capture semantic information in the review text, and synthesizes interactions at different levels through fusion techniques. Experimental results show that the proposed model outperforms other baselines.

IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS (2023)

Article Computer Science, Cybernetics

Dynamic Recommendation Based on Graph Diffusion and Ebbinghaus Curve

Zhihong Cui, Xiangguo Sun, Hongxu Chen, Li Pan, Lizhen Cui, Shijun Liu, Guandong Xu

Summary: Nowadays, dynamic recommendations struggle to find user online interest evolving patterns due to their complex interactions. Each interaction is influenced by multiple underlying reasons, so it is necessary to analyze each interaction instance individually. Additionally, traditional sequential models fail to personalize recommendations for users with different long-term and short-term tastes. In this article, a novel recommendation model based on Graph Diffusion and Ebbinghaus Curve is proposed, which explores underlying reasons for interactions and captures users' personalized strategies using a well-designed neural network and the Ebbinghaus Curve. Extensive experiments on real-world datasets confirm the superiority of the proposed model over existing baselines.

IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS (2023)

Article Computer Science, Hardware & Architecture

API Recommendation For Mashup Creation: A Comprehensive Survey

Hadeel Alhosaini, Sultan Alharbi, Xianzhi Wang, Guandong Xu

Summary: This paper provides a comprehensive review of API recommendation research for mashup creation. It compares different approaches and techniques, identifies challenges in current research, and suggests promising directions for future studies. It serves as a useful reference for researchers and practitioners in this field.

COMPUTER JOURNAL (2023)

Article Automation & Control Systems

Exploring explicit and implicit graph learning for multivariate time series imputation

Yakun Chen, Ruotong Hu, Zihao Li, Chao Yang, Xianzhi Wang, Guodong Long, Guandong Xu

Summary: This study proposes a novel approach to deal with missing values in multivariate time series. The approach combines graph and recurrent neural networks to capture dependencies between variables and time. By integrating external data sources, such as domain knowledge and implicit relationships between nodes, the ability to impute missing values is enhanced. Experimental results show that the model outperforms current state-of-the-art baselines in various industrial fields.

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE (2024)

No Data Available