Retrieving Non-Latin Information in a Latin Web: The Case of Greek

View Sample PDF

Author(s): Fotis Lazarinis (University of Sunderland, UK)
Copyright: 2009
Pages: 16
Source title: Handbook of Research on Text and Web Mining Technologies
Source Author(s)/Editor(s): Min Song (New Jersey Institute of Technology, USA)and Yi-Fang Brook Wu (New Jersey Institute of Technology, USA)
DOI: 10.4018/978-1-59904-990-8.ch031

Keywords: Data Mining / Data Mining and Databases / Information Science Reference / Library & Information Science

Purchase

View Retrieving Non-Latin Information in a Latin Web: The Case of Greek on the publisher's website for pricing and purchasing information.

Abstract

Over 60% of the online population are non-English speakers and it is probable the number of non-English speakers is growing faster than English speakers. Most search engines were originally engineered for English. They do not take full account of inflectional semantics nor, for example, diacritics or the use of capitals. The main conclusion from the literature is that searching using non-English and non-Latin based queries results in lower success and requires additional user effort so as to achieve acceptable recall and precision. In this chapter a Greek query log is morphologically and grammatically analyzed and a number of queries are submitted to search engines and their relevance is evaluated with the aid of real users. A Greek meta-searcher redirecting normalized queries to Google.gr is also presented and evaluated. An increase in relevance is reported when stopwords are eliminated and queries are normalized based on their morphology.

The IRMA Community

Research IRM

Retrieving Non-Latin Information in a Latin Web: The Case of Greek

Purchase

Abstract

Related Content

IRMA Sponsors