4.7 Article

Investigating queries and search failures in academic search

Journal

INFORMATION PROCESSING & MANAGEMENT
Volume 53, Issue 3, Pages 666-683

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.ipm.2017.01.005

Keywords

-

Funding

  1. Ahold Delhaize
  2. Amsterdam Data Science, Blendle
  3. Bloomberg Research Grant program
  4. Dutch national program COMMIT
  5. Elsevier
  6. European Community's Seventh Framework Programme [312827]
  7. ESF Research Network Program ELIAS
  8. Royal Dutch Academy of Sciences (KNAW) under the Elite Network Shifts project
  9. Microsoft Research Ph.D. program
  10. Netherlands eScience Center [027.012.105]
  11. Netherlands Institute for Sound and Vision
  12. Netherlands Organisation for Scientific Research (NWO) [727.011.005, 612.001.116, HOR-11-10, 640.006.013, 612.066.930, CI-14-25, SH-322-15, 652.002.001, 612.001.551, 652.001.003]
  13. Yahoo Faculty Research and Engagement Program
  14. Yandex

Ask authors/readers for more resources

Academic search concerns the retrieval and profiling of information objects in the domain of academic research. In this paper we reveal important observations of academic search queries, and provide an algorithmic solution to address a type of failure during search sessions: null queries. We start by providing a general characterization of academic search queries, by analyzing a large-scale transaction log of a leading academic search engine. Unlike previous small-scale analyses of academic search queries, we find important differences with query characteristics known from web search. E.g., in academic search there is a substantially bigger proportion of entity queries, and a heavier tail in query length distribution. We then focus on search failures and, in particular, on null queries that lead to an empty search engine result page, on null sessions that contain such null queries, and on users who are prone to issue null queries. In academic search approximately 1 in 10 queries is a null query, and 25% of the sessions contain a null query. They appear in different types of search sessions, and prevent users from achieving their search goal. To address the high rate of null queries in academic search, we consider the task of providing query suggestions. Specifically we focus on a highly frequent query type: non-boolean informational queries. To this end we need to overcome query sparsity and make effective use of session information. We find that using entities helps to surface more relevant query suggestions in the face of query sparsity. We also find that query suggestions should be conditioned on the type of session in which they are offered to be more effective. After casting the session classification problem as a multi-label classification problem, we generate session-conditional query suggestions based on predicted session type. We find that this session-conditional method leads to significant improvements over a generic query suggestion method. Personalization yields very little further improvements over session -conditional query suggestions. (c) 2017 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available