System of Information Retrieval in XML Documents

View Free PDF

Author(s): Saliha Smadhi (Universite de Pau, France)
Copyright: 2002
Pages: 4
Source title: Issues & Trends of Information Technology Management in Contemporary Organizations
Source Editor(s): Mehdi Khosrow-Pour, D.B.A. (Information Resources Management Association, USA)
DOI: 10.4018/978-1-930708-39-6.ch192
ISBN13: 9781930708396
EISBN13: 9781466641358

Keywords: Information Science Reference / IT Research & Theory / IT Research and Theory / Library & Information Science

Abstract

The Extensible Markup Language (XML) is considered as a new standard for data representation and exchange on the web. XML opens opportunities to develop a new generation of information retrieval system (IRS) to improve the interrogation process of document bases on the Web. We propose an approach to retrieve units (or subdocuments) of relevant information from XML documents. Our work focuses instead on end-users who have not expertise in the domain and of that the structure is them unknown (like a majority of the end-users). This approach supports keywords based searching like classical IRS and integrates structured searching with the search attributes notion. It’s based on an indexing method of document tree leafs which authorize so a content-oriented retrieval. The retrieval subdocuments are ranked according to their similarity with user’s query. We use an similarity measure which is a compromise between two measures : exhaustiveness and specificity.

IRMA Offers Over 2,500 Full Text Open Access Research Papers for Free Download Click to Start Searching Free IRM Research!

IRMA Sponsors

Encyclopedia of Information Science and Technology, Fourth Edition