Welcome to DiSC 2002
SIGMOD 2001
 = SIGMOD'01 Website
 = SIGMOD/PODS'01 Plena
<<< = SIGMOD'01 Papers>>>
 = Demos
 = Industrial Sessions
 = Panels
 = Tutorials
PODS 2001
 SIGMOD RECORD 2001
CIKM 2001
CoopIS 2001
DASFAA 2001
DASFAA 2000
DBPL 2001
Data Engineering Bul
DEXA_EC-WEB 2001
DMKD 2001
 DPDJ 2001
HYPERTEXT 2001
ICDE 2001
ICDM 2001
ICDT 2001
JCDL 2001
KDD 2001
 KDD_EXPLORATIONS 20
KRDB 2001
MDM 2001
MIR 2001
MIS 2001
RIDE 2001
SBBD 2001
 SIGIR 2001
 SIGIR FORUM 2001
SSDBM 2001
SSTD 2001
TODS 2001
TIME 2001
VLDB 2001
VLDBJ 2001

VQBD: exploring semistructured data


Sudarshan S. Chawathe, Thomas Baby, and Jihwang Yeo

  View Paper (PDF)  

Return to Demostrations


Abstract

The VQBD (``vee­cubed'') project addresses the following problem: What is the best way to explore an XML document of unknown structure and content? We use data exploration to denote the interactive task of gathering the information needed to use data for purposes such as generating a re­ port, writing queries, building user interfaces, and writing applications. We focus on XML documents that are too large to browse in their entirety, even with the assistance of pretty­printing software (e.g., multi­megabyte or larger XML documents). In a relational or object database, the schema (e.g., table definitions, class definitions, integrity constraints, and stored procedures) provides some of the information necessary for writing queries and applications. However, the schema is rarely sufficient for these tasks. Typ­ ically, one must probe and browse the database to discover data coverage, typical and exceptional values, and other in­ formation required to gain a better understanding of the database. In an XML environment, the need for such data exploration is much greater because it is quite likely that the XML data of interest is not accompanied by a schema. In­ deed, much XML data is semistructured, meaning its struc­ ture is irregular, incomplete, and frequently changing. The rapid adoption of XML as a data exchange standard makes this semistructured data exploration problem increasingly important. The VQBD system allows the structured ex­ ploration of arbitrary XML data. We describe some key features very briefly below; a detailed description appears at http://www.cs.umd.edu/projects/vqbd/.


DiSC'02 © 2003 Association for Computing Machinery