IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

Dynamic Indexing of Information in the Web: The Case of News Sites

Dynamic Indexing of Information in the Web: The Case of News Sites
View Free PDF
Author(s): Luciano de A. Barbosa (UFPE, Brazil), Mariano Cravo T. Neto (UFPE, Brazil), Ana Carolina Salgado (UFPE, Brazil) and Franklin de S. Ramalho (UFPE, Brazil)
Copyright: 2003
Pages: 4
Source title: Information Technology & Organizations: Trends, Issues, Challenges & Solutions
Source Editor(s): Mehdi Khosrow-Pour, D.B.A. (Information Resources Management Association, USA)
DOI: 10.4018/978-1-59140-066-0.ch072
ISBN13: 9781616921248
EISBN13: 9781466665330

Abstract

This paper presents a solution to keep available up-to-date information in a search engine whose scope is the content available within news web sites. This solution is based on the use of non-uniform policy to update the documents belonging to this scope. In order to use the non-uniform policy, we identify the most and the least recently updated documents, based on the idea in which it is supposed that the closest documents of the root of a site are the most modified ones. This hypothesis was verified through an experiment within news sites. In order to demonstrate the efficiency of our solution regarding a traditional one, we performed a case study whose results showed that: our solution spent less time to make the new information available, it made fewer requests to the web server, it kept a high freshness of the scope and, finally, it kept the search engine index up-to-date for a much longer time than the traditional solution.

Body Bottom