A Survey on the Performance Comparison of Map Reduce Technologies and the Architectural Improvement of Spark

Raghavendra, GS;Manasa, Bezwada;Vasavi, M.;

doi:10.22937/IJCSNS.2022.22.5.18

International Journal of Computer Science & Network Security

제22권5호
/
Pages.121-126
/
2022
/
1738-7906(pISSN)

국제컴퓨터통신보호논문지학회 (International Journal of Computer Science & Network Security)

DOI QR Code

A Survey on the Performance Comparison of Map Reduce Technologies and the Architectural Improvement of Spark

Raghavendra, GS (Computer Science and Engineering, RVR & JC College of Engineering) ;
Manasa, Bezwada (Computer Science and Engineering, RVR & JC College of Engineering) ;
Vasavi, M. (Computer Science and Engineering, RVR & JC College of Engineering)

투고 : 2022.05.05
발행 : 2022.05.30

https://doi.org/10.22937/IJCSNS.2022.22.5.18 인용 PDF KSCI

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

Hadoop and Apache Spark are Apache Software Foundation open source projects, and both of them are premier large data analytic tools. Hadoop has led the big data industry for five years. The processing velocity of the Spark can be significantly different, up to 100 times quicker. However, the amount of data handled varies: Hadoop Map Reduce can process data sets that are far bigger than Spark. This article compares the performance of both spark and map and discusses the advantages and disadvantages of both above-noted technologies.

키워드

과제정보

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors. The authors would like to thank the editor and anonymous reviewers for their comments that help improve the quality of this work

참고문헌

"Apache Map Reduce" IBM technologies 2020.
"Apache Spark Tutorial for Beginners" Data Flair 2020.
"Real Time Cluster Computing Framework" Sandeep Dayananda, 2020
"Hadoop MapReduce vs Spark: A Comprehensive Analysis "Nicholas Samuel on Data Integration, ETL
"Apache Spark Pros and Cons" Knowledge Hut. 2020
"Limitations of Apache Spark" techvidvan 2020
Adesh Chimariya B. Professor Mika Mantyla, "Streaming Data AnalyticsBackground, Technologies, and Outlook," Master's Thesis, University of Oulu
Ovidiu-Cristian Marcu , Alexandru Costan , Gabriel Antoniu , Maria S. Perez-Hernandez Bogdan Nicolae† , Radu Tudoran, Stefano Bortoli 2018 IEEE 38th International Conference on Distributed Computing Systems (ICDCS) ,pp.1480-1485.
UnGyu Han and Jinho Ahn, "Dynamic Load Balancing Method for Apache Flume Log Processing," in Advanced Science and Technology Letters, Vol.79 (IST 2014), pp.83-86
Yang Ruan, Zhenhua Guo, Yuduo Zhou, Judy Qiu, Geoffrey Fox, "HyMR: a Hybrid MapReduce Workflow System," ACM 978-1-4503-1339-1/12/06.
Gautam Pal, Gangmin Li, Katie Atkinson "Multi-Agent Big-Data Lambda Architecture Model for E-Commerce Analytics" ,,mdpi ,pp.1-15.
Gautam Pal, Gangmin Li, Katie Atkinson "Big Data Real Time Ingestion and Machine Learning", IEEE Second International Conference on Data Stream Mining & Processing,
Gunturi S Raghavendra,Prof Shanthi Mahesh, Prof MVP Chandrasekhara Raohttps://www.ijrte.org/portfolioitem/e6045018520/

International Journal of Computer Science & Network Security

A Survey on the Performance Comparison of Map Reduce Technologies and the Architectural Improvement of Spark

초록

키워드

과제정보

참고문헌

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)