[DBS01] Missing Data in Hierarchical Classification - a study
Conférence Internationale avec comité de lecture :
ASMDA 2001,
January 2001,
motcle:
Résumé:
We analyse the effect of missing data in hierarchical classification of variables according to the following factors: amount of missing data, imputation techniques, similarity coefficient, and aggregation criterion. We have used two methods of imputation, a regression method using an ordinary-least squares method and an EM algorithm. For the similarity matrices we have used the basic affinity coefficient and the Pearson's correlation coefficient. As aggregation criteria we apply average linkage, single linkage and complete linkage methods. To compare the structure of the hierarchical classifications the Spearman's coefficient between the associated ultrametrics has been used. We present here simulation experiments in two multivariate normal cases.
Commentaires:
10 th International Conference on Applied Stochastic Models and Data Analysis