[GNT12] Stacking prediction for a binary outcome

Conférence Internationale avec comité de lecture : Compstat 2012, August 2012, pp.271-282, Limassol, Chypre,

Mots clés: Stacking meta-model, binary outcome, Prediction, Sparse-PLSDA, Boosting, Bayesian, Scoring, SVM

Résumé: A large number of supervised classi cation models have been proposed in the literature. In order to avoid any bias induced by the use of one single statistical approach, they are combined through a speci c "stacking" eta-model. To deal with the case of a binary outcome and of categorical predictors, we introduce several improvements to stacking: combining models is done through PLS-DA instead of OLS due to the strong correlation between predictions, and a speci c methodology is developed for the case of a small number of observations, using repeated sub-sampling for variables selection. Five very di erent models (Boosting, Naive Bayes, SVM, Sparse PLS-DA and Expert Scoring) are combined through this improved stacking, and applied in the context of the development of alternative strategies for safety evaluation where multiple in vitro, in silico and physico-chemical parameters are used to classify substances in two classes : "Sensitizer" and "No Sensitizer". Results show that stacking meta-models have better performances than each of the five models taken separately, and furthermore, stacking provides a better balance between sensitivity and speci city.

Equipe: msdma


@inproceedings {
title="{Stacking prediction for a binary outcome}",
author=" C. Gomes and H. Nocairi and M. Thomas and F. Ibanez and J. Collin and G. Saporta ",
booktitle="{Compstat 2012}",
address="Limassol, Chypre",