期刊
COMPUTERS & ELECTRICAL ENGINEERING
卷 66, 期 -, 页码 474-486出版社
PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.compeleceng.2017.11.002
关键词
Arabic; Text; Classification; Linear discriminant analysis; Eigenvectors; Fisher
类别
资金
- Kuwait Foundation of Advancement of Science (KFAS), Research grant [P11418E001]
- Kuwait University Research Administration
Fisher's discriminant analysis; also called linear discriminant analysis (LDA), is a popular dimensionality reduction technique that is widely used for features extraction. LDA aims at finding an optimal linear transformation based on maximizing a class separability. Even though LDA shows useful results in various pattern recognition problems, such as face recognition, less attention has been devoted to employing this technique in Arabic information retrieval tasks. In particular, the sizable feature vectors in textual data enforces to implement dimensionality reduction techniques such as LDA. In this paper, we empirically investigated an LDA based method for Arabic text classification. We used a corpus that contains 2,000 documents belonging to five categories. The experimental results showed that the performance of semantic loss LDA based method was almost the same as the semantic rich singular value decomposition (SVD), and that is indication that LDA is a promising method for text mining applications. (C) 2017 Elsevier Ltd. All rights reserved.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据