IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

Optimal Query Generation for Hidden Web Extraction Through Response Analysis

Optimal Query Generation for Hidden Web Extraction Through Response Analysis
View Sample PDF
Author(s): Sonali Gupta (YMCA University of Science & Technology, India)and Komal Kumar Bhatia (YMCA University of Science & Technology, India)
Copyright: 2018
Pages: 19
Source title: The Dark Web: Breakthroughs in Research and Practice
Source Author(s)/Editor(s): Information Resources Management Association (USA)
DOI: 10.4018/978-1-5225-3163-0.ch005

Purchase

View Optimal Query Generation for Hidden Web Extraction Through Response Analysis on the publisher's website for pricing and purchasing information.

Abstract

A huge number of Hidden Web databases exists over the WWW forming a massive source of high quality information. Retrieval of this information for enriching the repository of the search engine is the prime target of a Hidden web crawler. Besides this, the crawler should perform this task at an affordable cost and resource utilization. This paper proposes a Random ranking mechanism whereby the queries to be raised by the hidden web crawler have been ranked. By ranking the queries according to the proposed mechanism, the Hidden Web crawler is able to make an optimal choice among the candidate queries and efficiently retrieve the Hidden web databases. The Hidden Web crawler proposed here also possesses an extensible and scalable framework to improve the efficiency of crawling. The proposed approach has also been compared with other methods of Hidden Web crawling existing in the literature.

Related Content

Dina Darwish. © 2024. 28 pages.
Dina Darwish. © 2024. 28 pages.
Muhammad Ahmed, Adnan Ahmad, Furkh Zeshan, Hamid Turab. © 2024. 33 pages.
Pankaj Bhambri. © 2024. 17 pages.
Kaushikkumar Patel. © 2024. 20 pages.
Vijaya Kittu Manda, Arnold Mashud Abukari, Vivek Gupta, Madavarapu Jhansi Bharathi. © 2024. 24 pages.
Pankaj Bhambri. © 2024. 17 pages.
Body Bottom