A Study on the Performance and Scalability of Apache Flink Over Hadoop MapReduce

View Sample PDF

Author(s): Pankaj Lathar (CBP Government Engineering College, New Delhi, India)and K. G. Srinivasa (CBP Government Engineering College, New Delhi, India)
Copyright: 2019
Volume: 2
Issue: 1
Pages: 13
Source title: International Journal of Fog Computing (IJFC)
Editor(s)-in-Chief: Sam Goundar (Victoria University of Wellington, New Zealand)and Kashif Munir (National College of Business Administration & Economics, Pakistan)
DOI: 10.4018/IJFC.2019010103

Keywords: Cloud Computing / Computer Science & IT / Information Science Reference / Network Architecture

Purchase

View A Study on the Performance and Scalability of Apache Flink Over Hadoop MapReduce on the publisher's website for pricing and purchasing information.

Abstract

With the advancements in science and technology, data is being generated at a staggering rate. The raw data generated is generally of high value and may conceal important information with the potential to solve several real-world problems. In order to extract this information, the raw data available must be processed and analysed efficiently. It has however been observed, that such raw data is generated at a rate faster than it can be processed by traditional methods. This has led to the emergence of the popular parallel processing programming model – MapReduce. In this study, the authors perform a comparative analysis of two popular data processing engines – Apache Flink and Hadoop MapReduce. The analysis is based on the parameters of scalability, reliability and efficiency. The results reveal that Flink unambiguously outperformance Hadoop's MapReduce. Flink's edge over MapReduce can be attributed to following features – Active Memory Management, Dataflow Pipelining and an Inline Optimizer. It can be concluded that as the complexity and magnitude of real time raw data is continuously increasing, it is essential to explore newer platforms that are adequately and efficiently capable of processing such data.

The IRMA Community

Research IRM

A Study on the Performance and Scalability of Apache Flink Over Hadoop MapReduce

Purchase

Abstract

Related Content

IRMA Sponsors