☆ 4.5 Article

A PDTB- styled end- to- end discourse parser

NATURAL LANGUAGE ENGINEERING (2014)

期刊

NATURAL LANGUAGE ENGINEERING

卷 20, 期 2, 页码 151-184

出版社

CAMBRIDGE UNIV PRESS

DOI: 10.1017/S1351324912000307

关键词

类别

Computer Science, Artificial Intelligence Linguistics Language & Linguistics

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Since the release of the large discourse-level annotation of the Penn Discourse Treebank (PDTB), research work has been carried out on certain subtasks of this annotation, such as disambiguating discourse connectives and classifying Explicit or Implicit relations. We see a need to construct a full parser on top of these subtasks and propose a way to evaluate the parser. In this work, we have designed and developed an end-to-end discourse parser-to-parse free texts in the PDTB style in a fully data-driven approach. The parser consists of multiple components joined in a sequential pipeline architecture, which includes a connective classifier, argument labeler, explicit classifier, non-explicit classifier, and attribution span labeler. Our trained parser first identifies all discourse and non-discourse relations, locates and labels their arguments, and then classifies the sense of the relation between each pair of arguments. For the identified relations, the parser also determines the attribution spans, if any, associated with them. We introduce novel approaches to locate and label arguments, and to identify attribution spans. We also significantly improve on the current state-of-the-art connective classifier. We propose and present a comprehensive evaluation from both component-wise and error-cascading perspectives, in which we illustrate how each component performs in isolation, as well as how the pipeline performs with errors propagated forward. The parser gives an overall system F-1 score of 46.80 percent for partial matching utilizing gold standard parses, and 38.18 percent with full automation.

A PDTB- styled end- to- end discourse parser

期刊

NATURAL LANGUAGE ENGINEERING

出版社

CAMBRIDGE UNIV PRESS

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A PDTB- styled end- to- end discourse parser

期刊

NATURAL LANGUAGE ENGINEERING

出版社

CAMBRIDGE UNIV PRESS

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文