IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

Indexing Techniques for Web Access Logs

Indexing Techniques for Web Access Logs
View Sample PDF
Author(s): Yannis Manolopoulos (Aristotle University of Thessaloniki, Greece), Mikolaj Morzy (Poznan University of Technology, Poland), Tadeusz Morzy (Poznan University of Technology, Poland), Alexandros Nanopoulos (Aristotle University of Thessaloniki, Greece), Marek Wojciechowski (Poznan University of Technology, Poland)and Maciej Zakrzewicz (Poznan University of Technology, Poland)
Copyright: 2004
Pages: 30
Source title: Web Information Systems
Source Author(s)/Editor(s): David Taniar (Monash University, Australia)and Johanna Wenny Rahayu (La Trobe University, Australia)
DOI: 10.4018/978-1-59140-208-4.ch009

Purchase

View Indexing Techniques for Web Access Logs on the publisher's website for pricing and purchasing information.

Abstract

Access histories of users visiting a web server are automatically recorded in web access logs. Conceptually, the web-log data can be regarded as a collection of clients’ access-sequences, where each sequence is a list of pages accessed by a single user in a single session. This chapter presents novel indexing techniques that support efficient processing of so-called pattern queries, which consist of finding all access sequences that contain a given subsequence. Pattern queries are a key element of advanced analyses of web-log data, especially those concerning typical navigation schemes. In this chapter, we discuss the particularities of efficiently processing user access-sequences with pattern queries, compared to the case of searching unordered sets. Extensive experimental results are given, which examine a variety of factors and illustrate the superiority of the proposed methods over indexing techniques for unordered data adapted to access sequences.

Related Content

Dina Darwish. © 2024. 28 pages.
Dina Darwish. © 2024. 28 pages.
Muhammad Ahmed, Adnan Ahmad, Furkh Zeshan, Hamid Turab. © 2024. 33 pages.
Pankaj Bhambri. © 2024. 17 pages.
Kaushikkumar Patel. © 2024. 20 pages.
Vijaya Kittu Manda, Arnold Mashud Abukari, Vivek Gupta, Madavarapu Jhansi Bharathi. © 2024. 24 pages.
Pankaj Bhambri. © 2024. 17 pages.
Body Bottom