IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

The Need to Consider Hardware Selection when Designing Big Data Applications Supported by Metadata

The Need to Consider Hardware Selection when Designing Big Data Applications Supported by Metadata
View Sample PDF
Author(s): Nathan Regola (University of Notre Dame, USA), David A. Cieslak (Aunalytics, Inc., USA)and Nitesh V. Chawla (University of Notre Dame, USA)
Copyright: 2014
Pages: 16
Source title: Big Data Management, Technologies, and Applications
Source Author(s)/Editor(s): Wen-Chen Hu (University of North Dakota, USA)and Naima Kaabouch (University of North Dakota, USA)
DOI: 10.4018/978-1-4666-4699-5.ch015

Purchase

View The Need to Consider Hardware Selection when Designing Big Data Applications Supported by Metadata on the publisher's website for pricing and purchasing information.

Abstract

The selection of hardware to support big data systems is complex. Even defining the term “big data” is difficult. “Big data” can mean a large volume of data in a database, a MapReduce cluster that processes data, analytics and reporting applications that must access large datasets to operate, algorithms that can effectively operate on large datasets, or even basic scripts that produce a needed resulted by leveraging data. Big data systems can be composed of many component systems. For these reasons, it appears difficult to create a universal, representative benchmark that approximates a “big data” workload. Along with the trend to utilize large datasets and sophisticated tools to analyze data, the trend of cloud computing has emerged as an effective method of leasing compute time. This chapter explores some of the issues at the intersection of virtualized computing (since cloud computing often uses virtual machines), metadata stores, and big data. Metadata is important because it enables many applications and users to access datasets and effectively use them without relying on extensive knowledge from humans about the data.

Related Content

Renjith V. Ravi, Mangesh M. Ghonge, P. Febina Beevi, Rafael Kunst. © 2022. 24 pages.
Manimaran A., Chandramohan Dhasarathan, Arulkumar N., Naveen Kumar N.. © 2022. 20 pages.
Ram Singh, Rohit Bansal, Sachin Chauhan. © 2022. 19 pages.
Subhodeep Mukherjee, Manish Mohan Baral, Venkataiah Chittipaka. © 2022. 17 pages.
Vladimir Nikolaevich Kustov, Ekaterina Sergeevna Selanteva. © 2022. 23 pages.
Krati Reja, Gaurav Choudhary, Shishir Kumar Shandilya, Durgesh M. Sharma, Ashish K. Sharma. © 2022. 18 pages.
Nwosu Anthony Ugochukwu, S. B. Goyal. © 2022. 23 pages.
Body Bottom