Open Access   Article Go Back

A Survey on Map Reduce Algorithm for Big data analysis using Hadoop, Pig and Hive utility tools

Ankita Kadre1 , S.R Yadav2

Section:Survey Paper, Product Type: Journal Paper
Volume-3 , Issue-10 , Page no. 52-57, Oct-2015

Online published on Oct 31, 2015

Copyright © Ankita Kadre , S.R Yadav . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: Ankita Kadre , S.R Yadav, “A Survey on Map Reduce Algorithm for Big data analysis using Hadoop, Pig and Hive utility tools,” International Journal of Computer Sciences and Engineering, Vol.3, Issue.10, pp.52-57, 2015.

MLA Style Citation: Ankita Kadre , S.R Yadav "A Survey on Map Reduce Algorithm for Big data analysis using Hadoop, Pig and Hive utility tools." International Journal of Computer Sciences and Engineering 3.10 (2015): 52-57.

APA Style Citation: Ankita Kadre , S.R Yadav, (2015). A Survey on Map Reduce Algorithm for Big data analysis using Hadoop, Pig and Hive utility tools. International Journal of Computer Sciences and Engineering, 3(10), 52-57.

BibTex Style Citation:
@article{Kadre_2015,
author = {Ankita Kadre , S.R Yadav},
title = {A Survey on Map Reduce Algorithm for Big data analysis using Hadoop, Pig and Hive utility tools},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {10 2015},
volume = {3},
Issue = {10},
month = {10},
year = {2015},
issn = {2347-2693},
pages = {52-57},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=703},
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=703
TI - A Survey on Map Reduce Algorithm for Big data analysis using Hadoop, Pig and Hive utility tools
T2 - International Journal of Computer Sciences and Engineering
AU - Ankita Kadre , S.R Yadav
PY - 2015
DA - 2015/10/31
PB - IJCSE, Indore, INDIA
SP - 52-57
IS - 10
VL - 3
SN - 2347-2693
ER -

VIEWS PDF XML
2510 2339 downloads 2297 downloads
  
  
           

Abstract

Data is growing at a rate which cannot be handled by the traditional methods of computing. To store and process such data new data analysis and storage techniques have emerged over the last few years. Hadoop is one such parallel processing open source framework which provides distributed storage and processing of big data. This paper introduces Big Data, a new platform which enables accessing, manipulating, analyzing, and visualizing data residing on a Hadoop cluster. In this paper a survey is done on big data analysis using Hadoop and other utility tools like Pig and Hive. The majority of large-scale data intensive applications executed by data centers are based on Map-Reduce or its open-source implementation, Hadoop. Such applications are executed on large clusters requiring large amounts of energy, making the energy costs a large fraction of the data center’s overall costs. Therefore to minimizing the energy consumption when executing Map-Reduce jobs is a critical concern for data centers. In this survey Flight data has been analyzed in terms of the mentioned parameters such as time complexity and energy consumption information’s are retrieved using Hadoop.

Key-Words / Index Term

Map reduce, big data, Hadoop, HDFS, Pig & Hive, Flight data

References

.
[1] Toshimori Honjo, Kazuki Oikawa "Hardware acceleration of Hadoop Map-Reduce" in 2013 IEEE International Conference on Big Data.
[2] Ming Meng, Jing Gao, Jun-jie Chen "Blast-Parallel: The Parallelizing Implementation Of Sequence Alignment Algorithms Based On Hadoop Platform" in 2013 6th International Conference on Biomedical Engineering and Informatics (BMEI 2013).
[3] Madhury Mohandas, Dhanya P M "An Approach for Log Analysis Based Failure Monitoring in Hadoop Cluster" in 2013 IEEE.
[4] Ilja Kromonov, Pelle Jakovits, Satish Narayana Srirama "NEWT - A Resilient BSP Framework for Iterative Algorithms on Hadoop YARN" in 2014 IEEE.
[5] Lena Mashayekhy, Mahyar Movahed Nejad, Daniel Grosu, Dajun Lu, Weisong Shi "Energy-aware Scheduling of MapReduce Jobs" in 2014 IEEE International Congress on Big Data.
[6] Oscar D. Lara, Weiqiang Zhuang, and Adarsh Pannu "Big R: Large-scale Analytics on Hadoop using R" in 2014 IEEE International Congress on Big Data
[7] Kiran M., Amresh Kumar "Verification and Validation of Parallel Support Vector Machine Algorithm based on Map-Reduce Program Model on Hadoop Cluster" in 2013 International Conference on Advanced Computing and Communication Systems (ICACCS -2013), Dec. 19 – 21, 2013, Coimbatore, INDIA