4.4 Article

Semantically enhanced pseudo relevance feedback for Arabic information retrieval

Journal

JOURNAL OF INFORMATION SCIENCE
Volume 42, Issue 2, Pages 246-260

Publisher

SAGE PUBLICATIONS LTD
DOI: 10.1177/0165551515594722

Keywords

Arabic; information retrieval; pseudo relevance feedback; query expansion; semantic

Ask authors/readers for more resources

The conventional information retrieval (IR) framework consists of four primary phases, namely, pre-processing, indexing, querying and retrieving results. Some phases of the current Arabic IR (AIR) framework have several drawbacks. This research aims to enhance an AIR by improving the processes in a conventional IR framework. We introduce an enhanced stop-word list in the pre-processing level and investigate several Arabic stemmers. In addition, an Arabic WordNet was utilized in the corpus and query expansion levels. We also adopted semantic information for the Pseudo Relevance Feedback. The enhanced Arabic IR framework was built and evaluated using TREC 2001 data. The technique of using the Arabic WordNet to build a semantic relationship between query and corpus in two levels, that is, the corpus and query levels, is a new one. The enhanced AIR framework demonstrated an improvement by 49% in terms of mean average precision, with an increase of 7.3% in recall compared with the baseline framework.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available