Source Identification and Query Rewriting In Open Xml Data Integration Systems

[BSS07] Source Identification and Query Rewriting In Open Xml Data Integration Systems

Revue Internationale avec comité de lecture : Journal IADIS'07 Int. J. on WWW/Internet, vol. 5(1), pp. 29-44, 2007
motcle:
Résumé: This paper presents OpenXView, a model for open, large scale XML data integration systems, characterized by the autonomy of users that publish XML data on a common topic. Autonomy implies frequent and unpredictable changes to data and a high degree of structure heterogeneity. OpenXView provides an original integration schema, based on an hybrid ontology - XML schema structure model. We propose solutions for several important problems in such systems: easy access to data through a simple query language over the common schema, simple data integration view management when data changes and scalable query rewriting algorithms. This paper focuses on source identification for query rewriting in OpenXView, i.e. the computation of combinations of sources that can answer a user query. It proposes two algorithms for minimal source combinations, scalable with the number of sources. The first one is based on a general branch-and-bound strategy, while the second one, very efficient, is limited to queries whose number of attributes is no more than 8, sufficient in most applications.

Collaboration: ETIS
@article {
BSS07,
title="{ Source Identification and Query Rewriting In Open Xml Data Integration Systems}",
author="F. Boisson and M. Scholl and I. Sebei and D. Vodislav",
journal="IADIS'07 Int. J. on WWW/Internet",
year=2007,
volume=5,
number=1,
pages="29-44",
}

Agenda

rss Suivre le laboratoire
 

Contacts

CNAM-CEDRIC
292 Rue St Martin
FR-75141 Paris Cedex 03
Tel: +33 01 40 27 22 96
Fax: +33 01 40 27 22 96


ENSIIE-CEDRIC
1 square de la résistance
FR-91025 EVRY
Tel: +33 01 69 36 73 05
Fax: +33 01 69 36 73 05