IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

A Grid and Cloud Based System for Data Grouping Computation and Online Service

A Grid and Cloud Based System for Data Grouping Computation and Online Service
View Sample PDF
Author(s): Wing-Ning Li (University of Arkansas, USA), Donald Hayes (University of Arkansas, USA), Jonathan Baran (University of Arkansas, USA), Cameron Porter (Acxiom Corporation, USA)and Tom Schweiger (Acxiom Corporation, USA)
Copyright: 2013
Pages: 14
Source title: Applications and Developments in Grid, Cloud, and High Performance Computing
Source Author(s)/Editor(s): Emmanuel Udoh (Sullivan University, USA)
DOI: 10.4018/978-1-4666-2065-0.ch021

Purchase

View A Grid and Cloud Based System for Data Grouping Computation and Online Service on the publisher's website for pricing and purchasing information.

Abstract

Record linkage deals with finding records that identify the same real world entity, such as an individual or a business, from a given file or set of files. Record linkage problem is also referred to as the entity resolution or record recognition problem. To locate those records identifying the same real world entity, in principle, pairwise record analyses have to be performed among all records. Analytical operations between two records vary from comparing corresponding fields to enhancing records through large knowledge bases and querying large databases. Hence, these operations are complex and take time. To reduce the number of pairwise record comparisons, blocking techniques are introduced to partition the records into blocks. After that records in each block are analyzed against one and another. One of the effective blocking methods is the closure approach, where a “related” equivalence relation is used to partition the records into equivalence classes. This paper introduces the closure problem and describes the design and implementation of a parallel and distributed closure prototype system running in an enterprise grid.

Related Content

Radhika Kavuri, Satya kiranmai Tadepalli. © 2024. 19 pages.
Ramu Kuchipudi, Ramesh Babu Palamakula, T. Satyanarayana Murthy. © 2024. 10 pages.
Nidhi Niraj Worah, Megharani Patil. © 2024. 21 pages.
Vishal Goar, Nagendra Singh Yadav. © 2024. 23 pages.
S. Boopathi. © 2024. 24 pages.
Sai Samin Varma Pusapati. © 2024. 25 pages.
Swapna Mudrakola, Krishna Keerthi Chennam, Shitharth Selvarajan. © 2024. 11 pages.
Body Bottom