4.7 Article

DistanceRank: An intelligent ranking algorithm for web pages

期刊

INFORMATION PROCESSING & MANAGEMENT
卷 44, 期 2, 页码 877-892

出版社

ELSEVIER SCI LTD
DOI: 10.1016/j.ipm.2007.06.004

关键词

web ranking; crawling; web graph; reinforcement learning

向作者/读者索取更多资源

A fast and efficient page ranking mechanism for web crawling and retrieval remains as a challenging issue. Recently, several link based ranking algorithms like PageRank, HITS and OPIC have been proposed. In this paper, we propose a novel recursive method based on reinforcement learning which considers distance between pages as punishment, called DistanceRank to compute ranks of web pages. The distance is defined as the number of average clicks between two pages. The objective is to minimize punishment or distance so that a page with less distance to have a higher rank. Experimental results indicate that DistanceRank outperforms other ranking algorithms in page ranking and crawling scheduling. Furthermore, the complexity of DistanceRank is low. We have used University of California at Berkeley's web for our experiments. (c) 2007 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据