Publications

Google Scholar Profile

International Journals

Deep Time Series Forecasting with Shape and Temporal Criteria. [pdf].
V. Le Guen, N. Thome. IEEE Transactions on Pattern Analysis and Machine Intelligence (IF: 24.3), February 2022.
Augmenting Physical Models with Deep Networks for Complex Dynamics Forecasting. [pdf].
Y. Yin, V. Le Guen, J. Dona, I. Ayed, E. de Bezenac, N. Thome, P. Gallinari. Journal of Statistical Mechanics: Theory and Experiment (JSTAT) (IF: 2.23), Vol. 12, December 2021.
Confidence Estimation via Auxiliary Models. [pdf]. C. Corbière, N. Thome, A. Saporta, T-H. Vu, M. Cord, P. Pérez.
IEEE Transactions on Pattern Analysis and Machine Intelligence (IF: 24.3), vol. 44, no. 10, pp. 6043-6055, June 2021.
3D Spatial Priors for Semi-Supervised Organ Segmentation with Deep ConvNets. [pdf].
O. Petit, N. Thome, Luc Soler. International Journal of Computer Assisted Radiology and Surgery (IJCARS) (IF: 2.94), Springer 2021.
Iterative Confidence Relabeling with Deep ConvNets for Organ Segmentation with Partial Labels. [pdf]
O. Petit, N. Thome, L. Soler. Computerized Medical Imaging and Graphics (IF: 3.75), Volume 91, July 2021.
Distributed Optimization for Deep Learning with Gossip Exchange. [pdf]
M. Blot, D. Picard, N. Thome, M. Cord. Neurocomputing (IF: 3.241), Volume 330, Pages 287-296, February 2019.
Classifying low-resolution images by integrating privileged information in deep CNNs. [pdf]
M. Chevalier, N. Thome, G. Henaff, M. Cord. Pattern Recognition Letters (PRL) (IF: 1.952), Volume 116, Pages 29-35, December 2018.
End-to-End Learning of Latent Deformable Part-based Representations for Object Detection. [pdf]
T. Mordan, N. Thome, G. Henaff, M. Cord. International Journal of Computer Vision (IJCV) (IF: 11.541), Pages 1-21, July 2018.
SyMIL: MinMax Latent SVM for Weakly Labeled Data. [pdf]
T. Durand, N. Thome, M. Cord. IEEE Transactions on Neural Networks and Learning Systems (IF: 7.89), Pages 1-14, April 2018.
Exploiting Negative Evidence for Deep Latent Structured Models. [pdf]
T. Durand, N. Thome, M. Cord. IEEE Transactions on Pattern Analysis and Machine Intelligence (IF: 24.3), Pages 1-14, January 2018.
Gaze Latent Support Vector Machine for Image Classification Improved by Weakly Supervised Region Selection. [pdf] X. Wang, N. Thome, M. Cord. Pattern Recognition (PR) (IF: 3.96), Volume 72, Pages 59-71, December 2017.
Learning a Distance Metric from Relative Comparisons between Quadruplets of Images. [pdf]
M. Law, N. Thome, M. Cord. International Journal of Computer Vision (IJCV) (IF: 11.541), Volume 121, Issue 1, pp 65-94, January 2017.
Perceptual principles for video classification with Slow Feature Analysis. [Project Page] [pdf]
C. Thériault, N. Thome, M. Cord, P. Pérez. IEEE Journal of Selected Topics in Signal Processing (IF: 4.36), p. 428-437, vol 8, Ap 2014.
Learning Deep Hierarchical Visual Feature Coding. [pdf]. H. Goh, N. Thome, M. Cord, J.H. Lim.
IEEE Transactions on Neural Networks and Learning Systems (IF: 7.89), p. 2212-2225, vol 12, Dec 2014.
SnooperText: A Text Detection System for Automatic Indexing of Urban Scenes. [Project Page] [pdf]
R. Minetto, N. Thome, M. Cord, N. Leite, J. Stolfi. Computer Vision and Image Understanding (IF: 2.4), p. 92-104, vol 122, May 2014.
JKernelMachines: A simple framework for Kernel Machines. [mloss] [pdf]. D. Picard, N. Thome and M. Cord.
Journal of Machine Learning Research (JMLR), track for Machine Learning Open Source Software, p. 1417-1421, vol 14, May 2013.
Extended Coding and Pooling in the HMAX Model. [Project Page] [pdf]
C. Theriault, N. Thome and M. Cord. IEEE Transactions on Image Processing (IF: 5.071), vol 22, num 2, p. 764-777, February 2013.
Pooling in Image Representation: the Visual Codeword Point of View. [Project Page] [pdf]
S. Avila, N. Thome, M. Cord, E. Valle and A. Araujo. Computer Vision and Image Understanding (IF: 2.4), Special Issue on Visual Concept Detection, , vol 117, num 5, p. 453-465, May 2013.
T-HOG: an Effective Gradient-Based Descriptor for Single Line Text Regions. [Project Page] [pdf]
R. Minetto, N. Thome, M. Cord, N. Leite, J. Stolfi. Pattern Recognition (PR) (IF: 3.96) , vol 46, num 3, p. 1078-1090, March 2013.
A Cognitive and Video-based Approach for Multinational License Plate Recognition.
N. Thome, A. Vacavant, L. Robinault, S. Miguet. Machine Vision and Applications (MVA) (IF: 1.306) ,
vol 22, num 2, p. 389-407, March 2011.
A Real-Time, MultiView Fall Detection System: a LHMM-Based Approach. [pdf]
N. Thome, S. Miguet, Sébastien Ambellouis. IEEE Transactions on Circuits and Systems for Video Technology (IF: 3.558) , Special Issue on Event Analysis in Videos, vol 18, Issue 11, p.1522-1532, November 2008.
Learning Articulated Appearance Models for Tracking Humans: a Spectral Graph Matching Approach. [pdf] N. Thome, D. Mérad, S. Miguet. Signal Processing: Image Communication (IF: 2.073) , vol. 23, issue 10, p.769-787, December 2008.

International Conferences

Full Contextual Attention for Multi-resolution Transformers in Semantic Segmentation. [pdf]
L. Themyr, C. Rambour, N. Thome, T. Collins, A. Hostettler. WACV 2023.
Complementing Brightness Constancy with Deep Networks for Optical Flow Prediction. [pdf]
V. Le Guen, C. Rambour, N. Thome. ECCV 2022. GitHub code
Hierarchical Average Precision Training for Pertinent Image Retrieval. [pdf]
E. Ramzi, N. Audebert, N. Thome, C. Rambour, X. Bitot. ECCV 2022.
Diverse probabilistic trajectory forecasting with admissibility constraints. [pdf]
L. Calem, H. Ben Younes, N. Thome, P. Pérez. ICPR 2022.
Swapping Semantic Contents for Mixing Images. [pdf]
R. Sun, C. Masson, G. Hénaff, N. Thome, M. Cord. ICPR 2022.
Robust and Decomposable Average Precision for Image Retrieval. [pdf]
E. Ramzi, N. Thome, C. Rambour, N. Audebert, X. Bitot. NeurIPS 2021. GitHub code
Augmenting Physical Models with Deep Networks for Complex Dynamics Forecasting. [pdf]
Y. Yin, V. Le Guen, J. Dona, I. Ayed, E. de Bezenac, N. Thome, P. Gallinari. ICLR 2021 (oral). GitHub code
Probabilistic Time Series Forecasting with Shape and Temporal Diversity. [pdf]
V. Le Guen, N. Thome. NeurIPS 2020. GitHub code
Disentangling Physical Dynamics from Unknown Factors for Unsupervised Video Prediction. [pdf]
V. Le Guen, N. Thome. CVPR 2020. GitHub code
Shape and Time Distortion Loss for Training Deep Time Series Forecasting Models. [pdf]
V. Le Guen, N. Thome. NeurIPS 2019. GitHub code
Addressing Failure Detection by Learning Model Confidence. [pdf]
C. Corbière, N. Thome, A. Bar-Hen, M. Cord, P. Pérez. NeurIPS 2019. GitHub code
DiscoNet: Shapes Learning on Disconnected Manifolds for 3D Editing. [pdf]
E. Mehr, A. Jourdan, N. Thome, M. Cord, V. Guitteny. ICCV 2019.
MUREL: Multimodal Relational Reasoning for Visual Question Answering. [pdf]
H. Ben-younes, R. Cadene, M. Cord, N. Thome. CVPR 2019. GitHub code
BLOCK: Bilinear Superdiagonal Fusion for Visual Question Answering and Visual Relationship Detection. [pdf]. H. Ben-younes, R. Cadene, N. Thome, M. Cord. AAAI 2019. GitHub code
Revisiting Multi-Task Learning with ROCK: a Deep Residual Auxiliary Block for Visual Detection. [pdf]
T. Mordan, N. Thome, G. Henaff, M. Cord. NeurIPS 2018.
HybridNet: Classification and Reconstruction Cooperation for Semi-Supervised Learning. [pdf]
T. Robert, N. Thome, M. Cord. ECCV 2018.
Manifold Learning in Quotient Spaces. [pdf]
E. Mehr, A. Lieutier, F. Sanchez, N. Thome, M. Cord, V. Guitteny. CVPR 2018.
SHADE: Information-Based Regularization for Deep Learning. [pdf]
M. Blot, T. Robert, N. Thome, M. Cord. ICIP 2018. ICIP'18 Best Paper Award
Cross-modal Retrieval in the Cooking Context: Learning Semantic Text-Image Embeddings. [pdf]
M. Carvalho, R. Cadene, D. Picard, L. Soulier, N. Thome, M. Cord. SIGIR 2018.
MUTAN: Multimodal Tucker Fusion for Visual Question Answering. [pdf]
H. Ben-younes, R. Cadene, M. Cord, N. Thome. ICCV 2017.
Deformable Part-based Fully Convolutional Network for Object Detection. [pdf]
T. Mordan, N. Thome, M. Cord, G. Henaff. BMCV 2017 (oral). BMVC'17 Best Science Paper Award
WILDCAT: Weakly Supervised Learning of Deep ConvNets for Image Classification, Pointwise Localization and Segmentation. [pdf]. T. Durand, T.Mordan, N. Thome, M. Cord. CVPR 2017.
WELDON: Weakly Supervised Learning of Deep Convolutional Neural Networks. [pdf]
T. Durand, N. Thome, M. Cord. CVPR 2016.
Gaze Latent Support Vector Machine for Image Classification. [pdf]
X. Wang, N. Thome, M. Cord. ICIP 2016.
Max-Min convolutional neural networks for image classification. [pdf]
M. Blot, N. Thome, M. Cord. ICIP 2016.
Deep Neural Netwrks Under Stress. [pdf]
M. Carvalho, M. Cord, S. Avila, N. Thome, E. Valle. ICIP 2016.
MANTRA: Minimum Maximum Latent Structural SVM for Image Classification and Ranking. [pdf]
T. Durand, N. Thome, M. Cord. ICCV 2015.
LR-CNN For Fine-grained Classification with Varying Resolution. [pdf].
M. Chevalier, N. Thome, M. Cord, J. Fournier, G. Henaff and E. Dusch. ICIP 2015.
Exemplar Based Metric Learning For Robust Visual Localization. [pdf].
C. Le Barz, N. Thome, M. Cord, M. Sanfourche and S. Herbin. ICIP 2015.
Incremental Learning of Latent Structural SVM for Weakly Supervised Image Classification. [pdf]. T. Durand, N. Thome, M. Cord. ICIP 2014, Paris, France, 27-30 Oct 2014.
Semantic Pooling for Image Categorization using Multiple Kernel Learning. [pdf]
T. Durand, D. Picard, N. Thome, M. Cord. ICIP 2014, Paris, France, 27-30 Oct 2014.
Fantope Regularization in Metric Learning. [pdf]- [Project Page]
M. Law, N. Thome, M. Cord. CVPR 2014, Columbus, Ohio, USA, 24-27 June 2014.
Sequentially Generated Instance-Dependent Image Representations for Classification. [pdf]
G. Dulac-Arnold, L. Denoyer, N. Thome, M. Cord, P. Gallinari. ICLR 2014, Banff, Canada, 14-16 April 2014.
Top-Down Regularization of Deep Belief Networks. [pdf]
H. Goh, N. Thome, M. Cord, J.H. Lim. NIPS 2013, p 1878-1886, Lake Tahoe, Nevada, USA, 5-8 December 2013.
Quadruplet-wise Image Similarity Learning. [pdf] [Project Page]
M. Law, N. Thome, M. Cord. ICCV 2013 , Sydney, Australia, 3-6 December 2013.
Image Classification using Object Detectors. [pdf]
T. Durand, N. Thome, M. Cord, S. Avila. ICIP 2013, p. 4340-4344, Melbourne, Australia, 15-18 Sep 2013.
Dynamic Scene Classification: Learning Motion Descriptors with Slow Features Analysis. [pdf] C. Theriault, N. Thome, M. Cord. CVPR 2013, p 2603-2610, Portland, OR, USA, 23-28 June 2013.
Unsupervised and Supervised Visual Codes with Restricted Boltzmann Machines. [pdf]
H. Goh, N. Thome, M. Cord, J.H. Lim. ECCV 2012, p 298-311, Firenze, Italy, 7-13 Oct 2012.
Structural and visual comparisons for Web page archiving. [pdf]
M. Law, N. Thome, M. Cord, S. Gancarski. DocEng 2012.
Contextual Detection of drawn Symbols in old Maps. [pdf]
J. Guyomard, N. Thome, M. Cord, T. Artières. ICIP 2012, p 837-840, Orlando, USA, 2012.
BossaNova at ImageCLEF 2012 Flickr Photo Annotation Task. [pdf]
S. Avila, N. Thome, M. Cord, E. Valle, A. Araùjo. CLEF 2012.
Learning geometric combinations of Gaussian kernels with alternating Quasi-Newton algorithm [pdf]
D. Picard, N. Thome, M. Cord, A. Rakotomamonjy. ESANN 2012.
Classification of Urban Scenes from Georeferenced Images in Urban Street-View Context
C. Iovan, D. Picard, N. Thome, M. Cord. ICMLA 2012.
HMAX-S: Deep scale representation for biologically inspired image categorization. [pdf]
C. Theriault, N. Thome, M. Cord. ICIP 2011, p 1261-1264, Brussels, 11-14 Sep 2011.
BOSSA: extended BoW formalism for image classification. [pdf]
S. Avila, N. Thome, M. Cord, E. Valle, A. de Albuquerque. ICIP 2011, p 2909-2912, ISBN: 978-1-4673-0062-9, Brussels, 11-14 Sep 2011.
SnooperTrack: Text Detection and Tracking for Outdoor Videos. [pdf]
R. Minetto, N. Thome, M. Cord, N. Leite, J. Stolfi. ICIP 2011, p 505-508, ISBN: 978-1-4577-1304-0, Brussels, 11-14 Sep 2011.
Learning Invariant Color Features with Sparse Topographic RBM. [pdf]
H. Goh, L. Kusmierz, J.H. Lim, N. Thome, M. Cord. ICIP 2011., p 1241-1244, Brussels, 11-14 Sep 2011.
Efficient Bag-of-Feature kernel representation for image similarity search. [pdf]
F. Precioso, M. Cord, D. Gorisse, N. Thome. ICIP 2011.p 109-112, Brussels, 11-14 Sep 2011.
Pedestrian head detection and tracking using graph skeleton for people counting in crowded environments. [pdf] K.E. Aziz, D. Merad, N. Thome, B. Fertil. MVA 2011.
People counting using skeleton graph and tracking.
K.E. Aziz, D. Merad, N. Thome, B. Fertil. SIPA 2011, Crete, Greece, 2011.
An efficient System for combining complementary kernels in complex visual categorization tasks. [pdf]
D. Picard, N. Thome, M. Cord. ICIP 2010 , p 3877-3880, Hong-Kong, 26-29 Sep 2010.
SnooperText: A Multiresolution System for Text Detection in Complex Visual Scenes. [pdf]
R. Minetto, N. Thome, M. Cord, J. Fabrizio, B. Marcotegui. ICIP 2010, p 3861-3864, Hong-Kong, 26-29 Sep 2010.
Fast People Counting using Head Detection from Skeleton Graph. [pdf]
D. Merad, K.E. Aziz, N. Thome. AVSS 2010, pp.233-240, Boston, 29 august-1 september 2010.
A Bottom/Up, View Point Invariant Human Detector. [pdf]
N. Thome and Sébastien Ambellouis. ICPR 2008, p. 1-4, Tampa, Florida, December 8-11 2008.
A Combined Statistical-Structural Strategy for Alphanumeric Recognition. [pdf]
N. Thome and Antoine Vacavant. ISVC 2007, p. 529-538 Lake Tahoe, Nevada, California, November 26--28 2007.
A HHMM-Based Approach for Robust Fall Detection. [pdf]
N. Thome and Serge Miguet. ICARCV 2006, P. 1-8, Singapore, 4-9 december 2006.
Human Body Part Labeling and Tracking Using Graph Matching Theory. [pdf]
N. Thome, Djamel Mérad and Serge Miguet. AVSS 2006 , p. 38-43, Sydney, 21-24 november 2006.
A Robust Appearance Model for Tracking Human Motions. [pdf]
N. Thome and Serge Miguet. AVSS 2005, p. 528-533, Como, 15-16 september 2005. Best Paper Award.

International Workshops

Residual Model-Based Reinforcement Learning for Physical Dynamics. [pdf]
Z. El Asri, C. Rambour, V. Le Guen, N. Thome. 3rd Offline RL Workshop: Offline RL as a "Launchpad", NeurIPS 2022.
Memory transformers for full context and high-resolution 3D Medical Segmentation. [pdf]
L. Themyr, C. Rambour, N. Thome, T. Collins, A. Hostettler. Machine Learning in Medical Imaging (MLMI) workshop, MICCAI 2022.
Adapting Multi-Input Multi-Output schemes to Vision Transformers. [pdf]
R. Sun,C. Masson, N. Thome, M. Cord. Workshop on Efficient Deep Learning for Computer Vision (ECV), CVPR 2022.
Towards efficient feature sharing in MIMO architectures. [pdf]
R. Sun, A. Ramé, C. Masson, N. Thome, M. Cord. Workshop on Transformers and Attention for vision (T4V), CVPR 2022.
U-Net Transformer: Self and Cross Attention for Medical Image Segmentation. [pdf]
O. Petit, N. Thome, C. Rambour, L. Themyr, T. Collins, L. Soler. Machine Learning in Medical Imaging (MLMI) workshop, MICCAI 2021.
Beyond First-Order Uncertainty Estimation with Evidential Models for Open-World Recognition. [pdf]
C. Corbière, M. Lafon, N. Thome, M. Cord, P. Pérez. workshop on Uncertainty and Robustness in Deep Learning, ICML 2021.
Semantic Augmentation with Self-Supervised Content Mixing for Semi-Supervised Learning. [pdf]
R. Sun, C. Masson, G. Hénaff, N. Thome, M. Cord. Workshop on Self-Supervised Learning, Theory and Practice, NeurIPS 2020.
A Deep Physical Model for Solar Irradiance Forecasting with Fisheye Images. [pdf]
V. Le Guen, N. Thome. CVPR OMNI-CV workshop, 2020.
Prévision de l'irradiance solaire par réseaux de neurones profonds à l'aide de caméras au sol. [pdf]
V. Le Guen, N. Thome. GTETSI, XXVIIème Colloque francophonede traitement du signal et des images, Lille, 26-29 Aout 2019.
Biasing Deep ConvNets for Semantic Segmentation of Medical Images with a Prior-driven Prediction Function. [pdf]. O. Petit, N. Thome, L. Soler. Extended abstract in MIDL, London, 8-10 July 2019.
Multitask Classification and Segmentation for Cancer Diagnosis in Mammograph. [pdf]. T. Le, N. Thome, S. Bernard, V. Bismuth, F. Patoureaux. Extended abstract in MIDL, London, 8-10 July 2019.
Handling Missing Annotations for Semantic Segmentation with Deep ConvNets. [pdf]
O. Petit, N. Thome, A. Charnoz, A. Hostettler, L. Soler. 4th Workshop on Deep Learning in Medical Image Analysis, MICCAI 2018.
Fully Convolutional Neural Network for accurate segmentation in CT Data. [pdf]
B. Radharapu, N. Thome. Women in Machine Learning Workshop, NIPS 2017.
M2CAI Challenge: Convolutional Neural Networks for Video Frames Classification. [pdf] - [poster]
R. Cadene, T. Robert, N. Thome, M. Cord. . M2CAI Workshop, Workflow Challenge, MICCAI 2016.
Low resolution convolutional neural network for automatic target recognition.
M. Chevalier, N.home, M. Cord, G. Henaff, and E. Dusch. In 7th International Symposium on Optronics in Defence and Security, 2016.
Recipe Recognition with Large Multimodal Food Dataset. [pdf]
X. Wang, D. Kumar, N. Thome, M. Cord, F. Precioso. Workshop on Cooking and Eating Activities, ICME 2015.
Absolute geo-localization thanks to Hidden Markov Model and exemplar-based metric learning. [pdf]
C. Le Barz, N. Thome, M. Cord, Martial Sanfourche and S. Herbin. Workshop on Computer Vision in Vehicle Technology, CVPR 2015.
Hybrid Pooling Fusion in the BoW Pipeline. [pdf]
M. Law, N. Thome, and M. Cord. Workshop on Information fusion in computer vision for concept recognition, ECCV 2012.
Structural and Visual Similarity Learning for Web Page Archiving. [pdf]
M. Law, C. Sureda, N. Thome, S. Gancarski and M. Cord. 10th workshop on Content-Based Multimedia Indexing, CBMI 2012.
Text Detection and Recognition in Urban Scenes. [pdf]
R. Minetto, N. Thome, M. Cord, J. Stolfi, F. Precioso, J. Guyomard, N.J Leite. CVRS workshop - ICCV 2011.
Combining complementary kernels in complex visual categorization. [poster][abstract]
N. Thome, M. Cord and D. Picard. KDCV workshop - ICCV 2011, Barcelona, 6-13 Nov 2011.
Biasing Restricted Boltzmann Machines to Manipulate Latent Selectivity and Sparsity. [pdf] [Supplementary] H. Goh, N. Thome, M. Cord. NIPS 2010 Workshop on Deep Learning and Unsupervised Feature Learning, p 1-8, Vancouver, Canada, 2010.

Book Chapters

Beyond full supervision in deep learning [chapter]
N. Thome. Multi-faceted Deep Learning: Models and Data, Springer, 2021.
Cortical Networks of Visual Recognition
C. Thériault, N. Thome, and M. Cord. Biologically-inspired Computer Vision: Fundamentals and Applications, 2015.
Bag of Words Image Representation: Key Ideas and Further Insight
M. Law, N. Thome, and M. Cord. Fusion in Computer Vision - Understanding Complex Visual Content, 2014.

Habilitation to Drive Research (HDR)

Representations & Learning for Semantic Annotation of Visual Data [manuscript] - [slides] , 1st July 2015.

PhD Thesis

Hierarchical Representations for Shape Recognition, People Identification and Motion Analysis in Image Sequences, 11 July 2007.