Analysis & design of data farming algorithm for cardiac patient data

Mohd Shahnawaz, Kanak Saxena, Hari Pandey

Research output: Chapter in Book/Report/Conference proceedingConference proceeding (ISBN)

3 Downloads (Pure)

Abstract

Data farming is a process to grow data by applying various statistical, predictions, machine learning and data mining approach on the available data. As data collection cost is high so many times data mining projects use existing data collected for various other purposes, such as daily collected data to process and data required for monitoring & control. Sometimes, the dataset available might be large or wide data set and sufficient for extraction of knowledge but sometimes the data set might be narrow and insufficient to extract meaningful knowledge or the data may not even exist. Mining from wide datasets has received wide attention in the available literature. Many models and algorithms for data reduction & feature selection have been developed for wide datasets. Determining or extracting knowledge from a narrow data set (partial availability of data) or in the absence of an existing data set has not been sufficiently addressed in the literature. In this paper we propose an algorithm for data farming, which farm sufficient data from the available little seed data. Classification accuracy of J48 classification for farmed data is achieved better than classification results for the seed data, which proves that the proposed data farming algorithm is effective.
Original languageEnglish
Title of host publicationNot Known
Pages114-118
DOIs
Publication statusE-pub ahead of print - 23 Aug 2018
Event8th International Conference on Cloud Computing, Data Science & Engineering (Confluence) - , India
Duration: 11 Jan 201812 Jan 2018

Conference

Conference8th International Conference on Cloud Computing, Data Science & Engineering (Confluence)
CountryIndia
Period11/01/1812/01/18

Fingerprint

Data mining
Seed
Farms
Learning systems
Feature extraction
Data reduction
Availability
Monitoring
Costs

Keywords

  • Interactive data exploration and discovery
  • Methodologies and Tools
  • Data Farming
  • J48 Classification
  • Cardiac Patient data
  • Missing value estimation.

Cite this

Shahnawaz, Mohd ; Saxena, Kanak ; Pandey, Hari. / Analysis & design of data farming algorithm for cardiac patient data. Not Known. 2018. pp. 114-118
@inproceedings{3c81104b39654efb8e6a86e53a63b87d,
title = "Analysis & design of data farming algorithm for cardiac patient data",
abstract = "Data farming is a process to grow data by applying various statistical, predictions, machine learning and data mining approach on the available data. As data collection cost is high so many times data mining projects use existing data collected for various other purposes, such as daily collected data to process and data required for monitoring & control. Sometimes, the dataset available might be large or wide data set and sufficient for extraction of knowledge but sometimes the data set might be narrow and insufficient to extract meaningful knowledge or the data may not even exist. Mining from wide datasets has received wide attention in the available literature. Many models and algorithms for data reduction & feature selection have been developed for wide datasets. Determining or extracting knowledge from a narrow data set (partial availability of data) or in the absence of an existing data set has not been sufficiently addressed in the literature. In this paper we propose an algorithm for data farming, which farm sufficient data from the available little seed data. Classification accuracy of J48 classification for farmed data is achieved better than classification results for the seed data, which proves that the proposed data farming algorithm is effective.",
keywords = "Interactive data exploration and discovery, Methodologies and Tools, Data Farming, J48 Classification, Cardiac Patient data, Missing value estimation.",
author = "Mohd Shahnawaz and Kanak Saxena and Hari Pandey",
year = "2018",
month = "8",
day = "23",
doi = "10.1109/CONFLUENCE.2018.8442527",
language = "English",
isbn = "978-1-5386-1719-9",
pages = "114--118",
booktitle = "Not Known",

}

Shahnawaz, M, Saxena, K & Pandey, H 2018, Analysis & design of data farming algorithm for cardiac patient data. in Not Known. pp. 114-118, 8th International Conference on Cloud Computing, Data Science & Engineering (Confluence), India, 11/01/18. https://doi.org/10.1109/CONFLUENCE.2018.8442527

Analysis & design of data farming algorithm for cardiac patient data. / Shahnawaz, Mohd; Saxena, Kanak; Pandey, Hari.

Not Known. 2018. p. 114-118.

Research output: Chapter in Book/Report/Conference proceedingConference proceeding (ISBN)

TY - GEN

T1 - Analysis & design of data farming algorithm for cardiac patient data

AU - Shahnawaz, Mohd

AU - Saxena, Kanak

AU - Pandey, Hari

PY - 2018/8/23

Y1 - 2018/8/23

N2 - Data farming is a process to grow data by applying various statistical, predictions, machine learning and data mining approach on the available data. As data collection cost is high so many times data mining projects use existing data collected for various other purposes, such as daily collected data to process and data required for monitoring & control. Sometimes, the dataset available might be large or wide data set and sufficient for extraction of knowledge but sometimes the data set might be narrow and insufficient to extract meaningful knowledge or the data may not even exist. Mining from wide datasets has received wide attention in the available literature. Many models and algorithms for data reduction & feature selection have been developed for wide datasets. Determining or extracting knowledge from a narrow data set (partial availability of data) or in the absence of an existing data set has not been sufficiently addressed in the literature. In this paper we propose an algorithm for data farming, which farm sufficient data from the available little seed data. Classification accuracy of J48 classification for farmed data is achieved better than classification results for the seed data, which proves that the proposed data farming algorithm is effective.

AB - Data farming is a process to grow data by applying various statistical, predictions, machine learning and data mining approach on the available data. As data collection cost is high so many times data mining projects use existing data collected for various other purposes, such as daily collected data to process and data required for monitoring & control. Sometimes, the dataset available might be large or wide data set and sufficient for extraction of knowledge but sometimes the data set might be narrow and insufficient to extract meaningful knowledge or the data may not even exist. Mining from wide datasets has received wide attention in the available literature. Many models and algorithms for data reduction & feature selection have been developed for wide datasets. Determining or extracting knowledge from a narrow data set (partial availability of data) or in the absence of an existing data set has not been sufficiently addressed in the literature. In this paper we propose an algorithm for data farming, which farm sufficient data from the available little seed data. Classification accuracy of J48 classification for farmed data is achieved better than classification results for the seed data, which proves that the proposed data farming algorithm is effective.

KW - Interactive data exploration and discovery

KW - Methodologies and Tools

KW - Data Farming

KW - J48 Classification

KW - Cardiac Patient data

KW - Missing value estimation.

U2 - 10.1109/CONFLUENCE.2018.8442527

DO - 10.1109/CONFLUENCE.2018.8442527

M3 - Conference proceeding (ISBN)

SN - 978-1-5386-1719-9

SP - 114

EP - 118

BT - Not Known

ER -