Inference patterns from Big Data using aggregation, filtering and tagging- A survey

Pathak Anand Prakashbhai, Hari Mohan Pandey

Research output: Chapter in Book/Report/Conference proceedingConference proceeding (ISBN)

3 Citations (Scopus)

Abstract

This paper reviews various approaches to infer the patterns from Big Data using aggregation, filtering and tagging. Earlier research shows that data aggregation concerns about gathered data and how efficiently it can be utilized. It is understandable that at the time of data gathering one does not care much about whether the gathered data will be useful or not. Hence, filtering and tagging of the data are the crucial steps in collecting the relevant data to fulfill the need. Therefore the main goal of this paper is to present a detailed and comprehensive survey on different approaches. To make the concept clearer, we have provided a brief introduction of Big Data, how it works, working of two data aggregation tools (namely, flume and sqoop), data processing tools (hive and mahout) and various algorithms that can be useful to understand the topic. At last we have included comparisons between aggregation tools, processing tools as well as various algorithms through its pre-process, matching time, results and reviews.

Original languageEnglish
Title of host publicationProceedings of the 5th International Conference on Confluence 2014: The Next Generation Information Technology Summit
Subtitle of host publicationThe Next Generation Information Technology Summit
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages66-71
Number of pages6
ISBN (Electronic)9781479942367
ISBN (Print)9781479942367
DOIs
Publication statusPublished - 6 Nov 2014
Event5th International Conference on Confluence 2014 - The Next Generation Information Technology Summit - Noida, India
Duration: 25 Sep 201426 Sep 2014

Publication series

NameProceedings of the 5th International Conference on Confluence 2014: The Next Generation Information Technology Summit

Conference

Conference5th International Conference on Confluence 2014 - The Next Generation Information Technology Summit
CountryIndia
CityNoida
Period25/09/1426/09/14

    Fingerprint

Keywords

  • aggregation
  • Big Data
  • filtering
  • Flume
  • Hadoop
  • HDFS
  • Hive
  • Mahout
  • MapReduce
  • Sqoop
  • tagging

Cite this

Prakashbhai, P. A., & Pandey, H. M. (2014). Inference patterns from Big Data using aggregation, filtering and tagging- A survey. In Proceedings of the 5th International Conference on Confluence 2014: The Next Generation Information Technology Summit: The Next Generation Information Technology Summit (pp. 66-71). [6949238] (Proceedings of the 5th International Conference on Confluence 2014: The Next Generation Information Technology Summit). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/CONFLUENCE.2014.6949238