Welcome to D
SIGMOD'00
 = SIGMOD'00 We
 = Plenary Talk
<<< = SIGMOD'00 Pa>>>
PODS'00
SIGMOD Recor
CIKM 2000/CI
COMAD 2000
Data Enginee
DL 2000
DPDJ
EDBT 2000
Hypertext 20
ICDE 2000
KDD 2000
KDD Explorat
KRDB 2000
SBBD 2000
SIGIR 2000
SIGIR Forum
SSDBM 2000
TODS
VLDB'00
VLDBJ

MOCHA: A Self-Extensible Database Middleware System for Distributed Data Sources


Manuel Rodriguez-Martinez and Nick Roussopoulos

  View Paper (PDF)  

Return to Research Sessions


Abstract

We present MOCHA, a new self-extensible database middleware system designed to interconnect distributed data sources. MOCHA is designed to scale to large environments and is based on the idea that some of the user-defined functionality in the system should be deployed by the middleware system itself. This is realized by shipping Java code implementing either advanced data types or tailored query operators to remote data sources and have it executed remotely. Optimized query plans push the evaluation of powerful data-reducing operators to the data source sites while executing data-inflating operators near the client's site. The Volume Reduction Factor is a new and more explicit metric introduced in this paper to select the best site to execute query operators and is shown to be more accurate than the standard selectivity factor alone. MOCHA has been implemented in Java and runs on top of Informix and Oracle. We present the architecture of MOCHA, the ideas behind it, and a performance study using scientific data and queries. The results of this study demonstrate that MOCHA provides a more flexible, scalable and efficient framework for distributed query processing compared to those in existing middleware solutions.


References


Note: References link to DBLP on the Web.

[CGMH+94]
Sudarshan S. Chawathe , Hector Garcia-Molina , Joachim Hammer , Kelly Ireland , Yannis Papakonstantinou , Jeffrey D. Ullman , Jennifer Widom : The TSIMMIS Project: Integration of Heterogeneous Information Sources. IPSJ 1994 : 7-18
[Inf97]
...
[Ora99]
...
[CS96]
Surajit Chaudhuri , Kyuseok Shim : Optimization of Queries with User-defined Predicates. VLDB 1996 : 87-98
[FJK96]
Michael J. Franklin , Björn Þór Jónsson , Donald Kossmann : Performance Tradeoffs for Client-Server Query Processing. SIGMOD Conf. 1996 : 149-160
[GMSvE98]
Michael Godfrey , Tobias Mayr , Praveen Seshadri , Thorsten von Eicken : Secure and Portable Database Extensibility. SIGMOD Conference 1998 : 390-401
[Gra93]
Goetz Graefe : Query Evaluation Techniques for Large Databases. ACM Computing Surveys 25(2) : 73-170(1993)
[HKWY97]
Laura M. Haas , Donald Kossmann , Edward L. Wimmers , Jun Yang : Optimizing Queries Across Diverse Data Sources. VLDB 1997 : 276-285
[HS93]
Joseph M. Hellerstein , Michael Stonebraker : Predicate Migration: Optimizing Queries with Expensive Predicates. SIGMOD Conference 1993 : 267-276
[ML86]
Lothar F. Mackert , Guy M. Lohman : R* Optimizer Validation and Performance Evaluation for Distributed Queries. VLDB 1986 : 149-159
[MS99]
Tobias Mayr , Praveen Seshadri : Client-Site Query Extensions. SIGMOD Conference 1999 : 347-358
[RMR00a]
Manuel Rodriguez-Martinez , Nick Roussopoulos : Automatic Deployment of Application-Specific Metadata and Code in MOCHA. EDBT 2000 : 69-85
[RMR00b]
...
[RS97]
Mary Tork Roth , Peter M. Schwarz : Don't Scrap It, Wrap It! A Wrapper Architecture for Legacy Data Sources. VLDB 1997 : 266-275
[SAC+79]
Patricia G. Selinger , Morton M. Astrahan , Donald D. Chamberlin , Raymond A. Lorie , Thomas G. Price : Access Path Selection in a Relational Database Management System. SIGMOD Conference 1979 : 23-34
[SLR97]
Praveen Seshadri , Miron Livny , Raghu Ramakrishnan : The Case for Enhanced Abstract Data Types. VLDB 1997 : 66-75
[Sto93]
Michael Stonebraker , James Frew , Kenn Gardels , Jeff Meredith : The Sequoia 2000 Benchmark. SIGMOD Conference 1993 : 2-11
[TRV96]
Anthony Tomasic , Louiqa Raschid , Patrick Valduriez : Scaling Heterogeneous Databases and the Design of Disco. ICDCS 1996 : 449-457

Referenced by

  1. M. Tamer Özsu : Review - MOCHA: A Self-Extensible Database Middleware System for Distributed Data Sources. ACM SIGMOD Digital Review 2 : (2000)
  2. Manuel Rodriguez-Martinez , Nick Roussopoulos : Automatic Deployment of Application-Specific Metadata and Code in MOCHA. EDBT 2000 : 69-85

BIBTEX


@inproceedings{DBLP:conf/sigmod/RodriguezR00,
  author    = {Manuel Rodriguez-Martinez and
                Nick Roussopoulos},
   editor    = {Weidong Chen and
                Jeffrey F. Naughton and
                Philip A. Bernstein},
   title     = {MOCHA: A Self-Extensible Database Middleware System for Distributed
                Data Sources},
   booktitle = {Proceedings of the 2000 ACM SIGMOD International Conference on
                Management of Data, May 16-18, 2000, Dallas, Texas, USA},
   journal   = {SIGMOD Record},
   publisher = {ACM},
   volume    = {29},
   number    = {2},
   year      = {2000},
   isbn      = {1-58113-218-2},
   pages     = {213-224},
   crossref  = {DBLP:conf/sigmod/2000},
   bibsource = {DBLP, http://dblp.uni-trier.de} } },




DiSC'01 Copyright ©2002 ACM Inc.