 |












|
|
Using Schema Matching to Simplify Heterogeneous Data Translation | Full Paper (PDF)
|
A broad spectrum of data is available on the Web in distinct heterogeneoussources, and stored under different formats.
As the number of systems that utilize this heterogeneous data grows, the importance of data translation and conversion mechanisms increases greatly.
In this paper we present a new translation system, based on schema-matching, aimed at simplifying the intricate task of data conversion.
We observe that in many cases the schema of the data in the source system is very similar to that of the target system.
In such cases, much of the translation work can be done automatically, based on the schemas similarity.
This saves a lot of effort for the user, limiting the amount of programming needed.
We define common schema and data models, in which schemas and data (resp.)from many common models can be represented.
Using a rule-based method, the source schema is compared with the target one, and each component in the source schema is matched with a corresponding component in the target schema.
Then, based on the matching achieved, data instances of the source schema can be translated to instances of the target schema.
We show that our schema-based translation system allows a convenient specification and customization of data conversions, and can be easily combinedwith the traditional data-based translation languages.
|
References, where available, link to the DBLP on the World Wide Web.
[1]Serge Abiteboul:
Querying Semi-Structured Data.
ICDT 1997: 1-18[2]Serge Abiteboul, Sophie Cluet, Tova Milo:
A Database Interface for File Updates.
SIGMOD Conference 1995: 386-397[3]Serge Abiteboul, Sophie Cluet, Tova Milo:
Correspondence and Translation for Heterogeneous Data.
ICDT 1997: 351-363[4]...
[5]Rafi Ahmed, Philippe De Smedt, Weimin Du, William Kent, Mohammad A. Ketabchi, Witold Litwin, Abbas Rafii, Ming-Chien Shan:
The Pegasus Heterogeneous Multidatabase System.
IEEE Computer 24(12): 19-27(1991)[6]Paolo Atzeni, Riccardo Torlone:
Schema Translation between Heterogeneous Data Models in a Lattice Framework.
DS-6 1995: 345-364[7]Carlo Batini, Maurizio Lenzerini, Shamkant B. Navathe:
A Comparative Analysis of Methodologies for Database Schema Integration.
Computing Surveys 18(4): 323-364(1986)[8]...
[9]...
[10]Peter Buneman, Susan B. Davidson, Gerd G. Hillebrand, Dan Suciu:
A Query Language and Optimization Techniques for Unstructured Data.
SIGMOD Conf. 1996: 505-516[11]Peter Buneman, Susan B. Davidson, Anthony Kosky:
Theoretical Aspects of Schema Merging.
EDBT 1992: 152-167[12]Michael J. Carey, Laura M. Haas, Peter M. Schwarz, Manish Arya, William F. Cody, Ronald Fagin, Myron Flickner, Allen Luniewski, Wayne Niblack, Dragutin Petkovic, Joachim Thomas II, John H. Williams, Edward L. Wimmers:
Towards Heterogeneous Multimedia Information Systems: The Garlic Approach.
RIDE-DOM 1995: 124-131[13]Ti-Pin Chang, Richard Hull:
Using Witness Generators to Support Bi-directional Update Between Object-Based Databases.
PODS 1995: 196-207[14]Sudarshan S. Chawathe, Hector Garcia-Molina, Joachim Hammer, Kelly Ireland, Yannis Papakonstantinou, Jeffrey D. Ullman, Jennifer Widom:
The TSIMMIS Project: Integration of Heterogeneous Information Sources.
IPSJ 1994: 7-18[15]Vassilis Christophides, Serge Abiteboul, Sophie Cluet, Michel Scholl:
From Structured Documents to Novel Query Facilities.
SIGMOD Conference 1994: 313-324[16]Sophie Cluet, Claude Delobel, Jérôme Siméon, Katarzyna Smaga:
Your Mediators Need Data Conversion!
SIGMOD Conference 1998: 177-188[17]Peter Buneman, Susan B. Davidson, Anthony Kosky:
Semantics of Database Transformations.
Semantics in Databases 1995: 55-91[18]Susan B. Davidson, Anthony Kosky:
WOL: A Language for Database Transformations and Constraints.
ICDE 1997: 55-65[19]...
[20]Alon Y. Levy, Anand Rajaraman, Joann J. Ordille:
Querying Heterogeneous Information Sources Using Source Descriptions.
VLDB 1996: 251-262[21]R. J. Miller, Yannis E. Ioannidis, Raghu Ramakrishnan:
The Use of Information Capacity in Schema Integration and Translation.
VLDB 1993: 120-133[22]R. J. Miller, Yannis E. Ioannidis, Raghu Ramakrishnan:
Schema equivalence in heterogeneous systems: bridging theory and practice.
IS 19(1): 3-31(1994)[23]Yannis Papakonstantinou, Hector Garcia-Molina, Jeffrey D. Ullman:
MedMaker: A Mediation System Based on Declarative Specifications.
ICDE 1996: 132-141[24]Yannis Papakonstantinou, Hector Garcia-Molina, Jennifer Widom:
Object Exchange Across Heterogeneous Information Sources.
ICDE 1995: 251-260[25]Gio Wiederhold:
Intelligent Integration of Information - Foreword.
JIIS 6(2/3): 93-98(1996)[26]...
[27]...
|
@inproceedings{DBLP:conf/vldb/MiloZ98, author = {Tova Milo and Sagit Zohar}, editor = {Ashish Gupta and Oded Shmueli and Jennifer Widom}, title = {Using Schema Matching to Simplify Heterogeneous Data Translation}, booktitle = {VLDB'98, Proceedings of 24rd International Conference on Very Large Data Bases, August 24-27, 1998, New York City, New York, USA}, publisher = {Morgan Kaufmann}, year = {1998}, isbn = {1-55860-566-5}, pages = {122-133}, crossref = {DBLP:conf/vldb/98}, bibsource = {DBLP, http://dblp.uni-trier.de} }
|
DBLP: Copyright ©1999 by Michael Ley (ley@uni-trier.de).
|
|