Performance Evaluation of Imputation Methods For Incomplete Datasets
In this study, we compare the performance of four different imputation strategies ranging from the commonly used Listwise Deletion to model based approaches such as the Maximum Likehood on enhancing completeness in incomplete software project data sets. We evaluate the impact of each of these methods by implementing them on six different real-time software project data sets which are classified into different categories based on their inherent properties. The reliability of the constructed data sets using these techniques are further tested by building prediction models using stepwise regression. The experiment results are noted and the findings are finally discussed.
International Journal of Software Engineering and Knowledge Engineering
(2007). Performance Evaluation of Imputation Methods For Incomplete Datasets. International Journal of Software Engineering and Knowledge Engineering, 17(1), 127-152.
Available at: http://aquila.usm.edu/fac_pubs/2093