An exploratory research is presented to gauge the impact of feature selection on heterogeneous ensembles. The task is to predict diabetes onset with healthcare data obtained from UC Irvine (VCI) database. Evidence suggests that accuracy and diversity are the two vital requirements to achieve good ensembles. Therefore, the research presented in this paper exploits diversity from heterogeneous base classifiers; and the optimisation effect of feature subset selection in order to improve accuracy. Five widely used classifiers are employed for the ensembles and a meta-classifier is used to aggregate their outputs. The results are presented and compared with similar studies that used the same dataset within the literature. It is shown that by using the proposed method, diabetes onset prediction can be done with higher accuracy.
|Title of host publication||Predicting Diabetes Onset: an Ensemble Supervised Learning Approach|
|Publication status||Published - 12 Jul 2018|
|Event||2018 IEEE Congress on Evolutionary Computing (CEC) - Rio de Janeiro, Brazil|
Duration: 8 Jul 2018 → 13 Jul 2018
|Conference||2018 IEEE Congress on Evolutionary Computing (CEC)|
|City||Rio de Janeiro|
|Period||8/07/18 → 13/07/18|