[Bar16g] Controlling the cost of prediction in using a cascade of reject classifiers for personalized medicine
Revue Internationale avec comité de lecture :
Journal 7th International Conference on Bioinformatics Models, Methods and Algorithms (BIOINFORMATICS 2016). ,
pp. 42-50,
2016
motcle:
Résumé:
The supervised learning in bioinformatics is a major tool to diagnose a disease, to identify the best therapeutic strategy or to establish a prognostic. The main objective in classifier construction is to maximize the accuracy in order to obtain a reliable prediction system. However, a second objective is to minimize the cost of the use of the classifier on new patients. Despite the control of the classification cost is high important in the medical domain, it has been very little studied. We point out that some patients are easy to predict, only a small subset of medical variables are needed to obtain a reliable prediction. The prediction of these patients can be cheaper than the others patient. Based on this idea, we propose a cascade approach that decreases the classification cost of the basic classifiers without dropping their accuracy. Our cascade system is a sequence of classifiers with rejects option of increasing cost. At each stage, a classifier receives all patients rejected by the last classifier, makes a prediction of the patient and rejects to the next classifier the patients with low confidence prediction. The performances of our methods are evaluated on four real medical problems.
Commentaires:
Blaise Hanczar, Avner Bar-Hen