[DMT17] WILDCAT: Weakly Supervised Learning of Deep ConvNets for Image Classification, Pointwise Localization and Segmentation

Conférence Internationale avec comité de lecture : IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , July 2017, pp.5957-5966, USA, Honolulu, Hawaii , (DOI: 10.1109/CVPR.2017.631)

Auteurs: T. Durand , T. Mordan , N. Thome , M. Cord

Mots clés: Deep Learning, Convolutionnal Neural Networks, Weakly Supervised Learning

Résumé: This paper introduces WILDCAT, a deep learning method which jointly aims at aligning image regions for gaining spatial invariance and learning strongly localized features. Our model is trained using only global image labels and is devoted to three main visual recognition tasks: image classification, weakly supervised pointwise object localization and semantic segmentation. WILDCAT extends state-of-the-art Convolutional Neural Networks at three major levels: the use of Fully Convolutional Networks for maintaining spatial resolution, the explicit design in the network of local features related to different class modalities, and a new way to pool these features to provide a global image prediction required for weakly supervised training. Extensive experiments show that our model significantly outperforms the state-of-the-art methods.

Download

BibTeX

@inproceedings {
	DMT17,
	title	=	"{WILDCAT: Weakly Supervised Learning of Deep ConvNets for Image Classification, Pointwise Localization and Segmentation}",
	author	=	" T. Durand and T. Mordan and N. Thome and M. Cord ",
	booktitle	=	"{IEEE Conference on Computer Vision and Pattern Recognition (CVPR) }",
	year	=	2017,
	month	=	"July",
	pages	=	"5957-5966",
	address	=	" USA, Honolulu, Hawaii ",
	doi	=	"10.1109/CVPR.2017.631",
}