Augmenting Data Management with AI and Machine Learning

In this lecture, I will focus on the novel trend in data management aiming at leveraging Artificial Intelligence techniques and Machine Learning algorithms. AI and ML have already proved a tremendous potential to dramatically impact the foundations, algorithms, and models of several data management tasks, such as data cleaning, data integration, data analytics and query inference. Part of the data preparation, standardization, and cleaning processes, such as entity matching and deduplication for instance, could be automated by making a ML model learn and predict the matches routinely. Reasoning techniques as implemented in AI and knowledge-based systems can be used during entity resolution to capture unforeseen entity matches. Data integration is a classical problem at the intersection of the fields of Artificial Intelligence and database systems. Data integration can also benefit from ML as the data to be integrated can be sampled and used to design the data integration algorithms. The material of the lecture is partly based on an international tutorial previously given by the lecturer. 


Angela Bonifati is a Full Professor and the head of the Database Group at Lyon 1 University in France. She received a PhD from Politecnico di Milano in 2002 and was a postdoctoral researcher at Inria in Paris until 2003. 
Her current research interests are on the interplay of relational and graph-shaped data paradigms, particularly on query processing, data integration and curation, metadata management and learning for these data models. 
On these topics, she has co-authored more than 100 publications in top-tier conferences and journals of the data management field.
She is the Program Chair of EDBT 2020, Associate Editor of PVLDB (2020-2022), the Demo Co-Chair of ICDE 2020 and the Sigmod 2019 and Sigmod 2020 Workshops Co-chair. 
She was Vice Chair of ICDE 2018 for the information extraction; data cleaning, and curation Track and Vice Chair of ICDE 2011 for the semi-structured data Track. 
She is an Associate Editor for several journals, including the VLDB Journal, ACM Transactions on Database Systems (TODS) and Distributed and Parallel Databases.
She holds several visiting professor positions, the latest of which at the University of Waterloo (Canada) in 2019.