IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

A Study on the Performance and Scalability of Apache Flink Over Hadoop MapReduce

A Study on the Performance and Scalability of Apache Flink Over Hadoop MapReduce
View Sample PDF
Author(s): Pankaj Lathar (CBP Government Engineering College, New Delhi, India)and K. G. Srinivasa (CBP Government Engineering College, New Delhi, India)
Copyright: 2019
Volume: 2
Issue: 1
Pages: 13
Source title: International Journal of Fog Computing (IJFC)
Editor(s)-in-Chief: Sam Goundar (Victoria University of Wellington, New Zealand)and Kashif Munir (National College of Business Administration & Economics, Pakistan)
DOI: 10.4018/IJFC.2019010103

Purchase

View A Study on the Performance and Scalability of Apache Flink Over Hadoop MapReduce on the publisher's website for pricing and purchasing information.

Abstract

With the advancements in science and technology, data is being generated at a staggering rate. The raw data generated is generally of high value and may conceal important information with the potential to solve several real-world problems. In order to extract this information, the raw data available must be processed and analysed efficiently. It has however been observed, that such raw data is generated at a rate faster than it can be processed by traditional methods. This has led to the emergence of the popular parallel processing programming model – MapReduce. In this study, the authors perform a comparative analysis of two popular data processing engines – Apache Flink and Hadoop MapReduce. The analysis is based on the parameters of scalability, reliability and efficiency. The results reveal that Flink unambiguously outperformance Hadoop's MapReduce. Flink's edge over MapReduce can be attributed to following features – Active Memory Management, Dataflow Pipelining and an Inline Optimizer. It can be concluded that as the complexity and magnitude of real time raw data is continuously increasing, it is essential to explore newer platforms that are adequately and efficiently capable of processing such data.

Related Content

William Tichaona Vambe. © 2023. 16 pages.
Yee-Ming Chen, Chung-Hung Hsieh. © 2022. 11 pages.
Nitin Rathore, Anand Rajavat. © 2022. 18 pages.
Yee-Ming Chen, Chung-Hung Hsieh. © 2022. 14 pages.
Hewan Shrestha, Puviyarai T., Sana Sodanapalli, Chandramohan Dhasarathan. © 2021. 17 pages.
Kelly M. Torres, Aubrey Statti. © 2021. 19 pages.
Sana Sodanapalli, Hewan Shrestha, Chandramohan Dhasarathan, Puviyarasi T., Sam Goundar. © 2021. 15 pages.
Body Bottom