Rechercher

[ZBP16] Nettoyage de données guidé par la sémantique inter-colonnes

Atelier, Poster ou Démonstration dans une Conférence Internationale : 16ème conférence Internationale Francophone sur l'Extraction et Gestion des Connaissances (EGC 2016), January 2016, pp.-, Reims, France,

Mots clés: Data quality, Big data, Semantic dependencies, Data cleaning

Résumé: Today, the volume of unstructured and heterogeneous data is exploding, coming from multiple sources with different levels of quality. Therefore, it is very probable to manipulate data without knowledge about their structures and their semantics. In fact, the meta-data may be insufficient or totally absent. Data anomalies may be due to the poverty of their semantic descriptions, or even the absence of their descriptions. We propose an approach to understand better the semantics and the structure of the data. It helps to correct the intra-column anomalies (homogenization) and then the inter-columns ones caused by the violation of semantic dependencies

Equipe: sys

BibTeX

@inproceedings {
ZBP16,
title="{Nettoyage de données guidé par la sémantique inter-colonnes}",
author=" H. ZAIDI and F. Boufares and Y. Pollet ",
booktitle="{16ème conférence Internationale Francophone sur l'Extraction et Gestion des Connaissances (EGC 2016)}",
year=2016,
month="January",
pages="-",
address="Reims, France",
}