IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

On The Reuse of Past Searches in Information Retrieval: Study of Two Probabilistic Algorithms

On The Reuse of Past Searches in Information Retrieval: Study of Two Probabilistic Algorithms
View Sample PDF
Author(s): Claudio Gutiérrez-Soto (Université de Toulouse, France & Universidad del Bío, Chile)and Gilles Hubert (Université de Toulouse, France)
Copyright: 2016
Pages: 21
Source title: Business Intelligence: Concepts, Methodologies, Tools, and Applications
Source Author(s)/Editor(s): Information Resources Management Association (USA)
DOI: 10.4018/978-1-4666-9562-7.ch057

Purchase

View On The Reuse of Past Searches in Information Retrieval: Study of Two Probabilistic Algorithms on the publisher's website for pricing and purchasing information.

Abstract

When using information retrieval systems, information related to searches is typically stored in files, which are well known as log files. By contrast, past search results of previously submitted queries are ignored most of the time. Nevertheless, past search results can be profitable for new searches. Some approaches in Information Retrieval exploit the previous searches in a customizable way for a single user. On the contrary, approaches that deal with past searches collectively are less common. This paper deals with such an approach, by using past results of similar past queries submitted by other users, to build the answers for new submitted queries. It proposes two Monte Carlo algorithms to build the result for a new query by selecting relevant documents associated to the most similar past query. Experiments were carried out to evaluate the effectiveness of the proposed algorithms using several dataset variants. These algorithms were also compared with the baseline approach based on the cosine measure, from which they reuse past results. Simulated datasets were designed for the experiments, following the Cranfield paradigm, well established in the Information Retrieval domain. The empirical results show the interest of our approach.

Related Content

Dina Darwish. © 2024. 48 pages.
Dina Darwish. © 2024. 51 pages.
Smrity Prasad, Kashvi Prawal. © 2024. 19 pages.
Jignesh Patil, Sharmila Rathod. © 2024. 17 pages.
Ganesh B. Regulwar, Ashish Mahalle, Raju Pawar, Swati K. Shamkuwar, Priti Roshan Kakde, Swati Tiwari. © 2024. 23 pages.
Pranali Dhawas, Abhishek Dhore, Dhananjay Bhagat, Ritu Dorlikar Pawar, Ashwini Kukade, Kamlesh Kalbande. © 2024. 24 pages.
Pranali Dhawas, Minakshi Ashok Ramteke, Aarti Thakur, Poonam Vijay Polshetwar, Ramadevi Vitthal Salunkhe, Dhananjay Bhagat. © 2024. 26 pages.
Body Bottom