☆ 4.6 Article

Dependency parsing of Turkish

COMPUTATIONAL LINGUISTICS (2008)

期刊

COMPUTATIONAL LINGUISTICS

卷 34, 期 3, 页码 357-389

出版社

MIT PRESS

DOI: 10.1162/coli.2008.07-017-R1-06-83

关键词

类别

Computer Science, Artificial Intelligence Computer Science, Interdisciplinary Applications Linguistics Language & Linguistics

资金

TUBITAK (The Scientific and Technical Research Council of Turkey)
Istanbul Technical University

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

The suitability of different parsing methods for different languages is an important topic in syntactic parsing. Especially lesser-studied languages, typologically different from the languages for which methods have originally been developed, pose interesting challenges in this respect. This article presents an investigation of data-driven dependency parsing of Turkish, an agglutinative, free constituent order language that can be seen as the representative of a wider class of languages of similar type. Our investigations show that morphological structure plays an essential role in finding syntactic relations in such a language. In particular, we show that employing sublexical units called infectional groups, rather than word forms, as the basic parsing units improves parsing accuracy. We test our claim on two different parsing methods, one based on a probabilistic model with beam search and the other based on discriminative classifiers and a deterministic parsing strategy, and show that the usefulness of sublexical units holds regardless of the parsing method. We examine the impact of morphological and lexical information in detail and show that, properly used, this kind of information can improve parsing accuracy substantially. Applying the techniques presented in this article, we achieve the highest reported accuracy for parsing the Turkish Treebank.

Dependency parsing of Turkish

期刊

COMPUTATIONAL LINGUISTICS

出版社

MIT PRESS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Dependency parsing of Turkish

期刊

COMPUTATIONAL LINGUISTICS

出版社

MIT PRESS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文