4.6 Article

Exploiting Natural Language Structures in Software Informal Documentation

期刊

IEEE TRANSACTIONS ON SOFTWARE ENGINEERING
卷 47, 期 8, 页码 1587-1604

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TSE.2019.2930519

关键词

Neon; Software; Linguistics; Pattern recognition; Documentation; Manuals; Mining unstructured data; natural language processing; empirical study

资金

  1. Swiss National Science foundation [200021-166275]
  2. Swiss National Science Foundation (SNF) [200021_166275] Funding Source: Swiss National Science Foundation (SNF)

向作者/读者索取更多资源

The study introduces NEON, a natural language processing-based software documentation analyzer, which automates the mining of patterns and rules, reducing the manual effort required for tagging. Through analysis and classification of mobile app reviews, developers discussions, and issues, NEON demonstrates excellent performance and significant time savings.
Communication means, such as issue trackers, mailing lists, Q&A forums, and app reviews, are premier means of collaboration among developers, and between developers and end-users. Analyzing such sources of information is crucial to build recommenders for developers, for example suggesting experts, re-documenting source code, or transforming user feedback in maintenance and evolution strategies for developers. To ease this analysis, in previous work we proposed Development Emails Content Analyzer (DECA), a tool based on Natural Language Parsing that classifies with high precision development emails' fragments according to their purpose. However, DECA has to be trained through a manual tagging of relevant patterns, which is often effort-intensive, error-prone and requires specific expertise in natural language parsing. In this paper, we first show, with an empirical study, the extent to which producing rules for identifying such patterns requires effort, depending on the nature and complexity of patterns. Then, we propose an approach, named Nlp-based softwarE dOcumentation aNalyzer (NEON), that automatically mines such rules, minimizing the manual effort. We assess the performances of NEON in the analysis and classification of mobile app reviews, developers discussions, and issues. NEON simplifies the patterns identification and rules definition processes, allowing a savings of more than 70 percent of the time otherwise spent on performing such activities manually. Results also show that NEON-generated rules are close to the manually identified ones, achieving comparable recall.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据