SELECTION OF OPTIMAL FEATURES IN STATISTICAL MODELLING
dc.contributor.author | Gachoki, P. K. | |
dc.contributor.author | Njoroge, G. G. | |
dc.contributor.author | Muraya, M. M. | |
dc.date.accessioned | 2022-04-19T21:21:50Z | |
dc.date.available | 2022-04-19T21:21:50Z | |
dc.date.issued | 2021 | |
dc.description | pkgachoki@gmail.com; moses.muraya@chuka.ac.ke | en_US |
dc.description.abstract | In statistical modelling, selection of optimal features entails making a selection of relevant predictor variables to be used in development of statistical models. Most modelling studies have focused on construction of statistical models skipping out or failing to put on record the process of selection of best features which is an integral part of statistical modeling. This failure might lead to use of duplicated features, features that are less relevant or other that have low variance in addition to random features which could result to poor performing prediction models. This study seeks to discuss how feature selection can be done as a pre-requisite for statistical modeling. Some of the methods used in selection of best features include; forward selection, backward elimination, recursive elimination, entropy selection, variance threshold elimination, chi-square statistics, tree based selection, feature importance and correlation matrix with heat maps. This study is vital to researchers building statistical models since use of optimal features in statistical modeling would lead to high performing statistical models. | en_US |
dc.description.sponsorship | Chuka University | en_US |
dc.identifier.citation | Gachoki, P. K., Njoroge, G. G. and Muraya, M. M. (2021). Selection of optimal features in statistical modelling. In: Isutsa, D. K. (Ed.). Proceedings of the 7th International Research Conference held in Chuka University from 3rd to 4th December 2020, Chuka, Kenya, p. 555-564 | en_US |
dc.identifier.uri | http://repository.chuka.ac.ke/handle/chuka/16215 | |
dc.language.iso | en | en_US |
dc.publisher | Chuka University | en_US |
dc.subject | Feature selection | en_US |
dc.subject | forward selection | en_US |
dc.subject | feature importance | en_US |
dc.subject | correlation matrix with heatmaps | en_US |
dc.title | SELECTION OF OPTIMAL FEATURES IN STATISTICAL MODELLING | en_US |
dc.type | Article | en_US |