Welcome to D
SIGMOD'00
PODS'00
 = PODS'00 Webs
 = Plenary Talk
<<< = PODS'00 Pape>>>
SIGMOD Recor
CIKM 2000/CI
COMAD 2000
Data Enginee
DL 2000
DPDJ
EDBT 2000
Hypertext 20
ICDE 2000
KDD 2000
KDD Explorat
KRDB 2000
SBBD 2000
SIGIR 2000
SIGIR Forum
SSDBM 2000
TODS
VLDB'00
VLDBJ

(Almost) Optimal Parallel Block Access for Range Queries


Mikhail J. Atallah and Sunil Prabhakar

  View Paper (PDF)  

Return to Range Queries / Selectivity Estimation


Abstract

Range queries are an important class of queries for several applications including relational and spatial databases, visualization, and GIS applications. For large datasets, the performance of range queries is limited by disk I/O. Performance improvements are achieved by tiling the multi-dimensional data and distributing it among multiple disks or nodes. Consequently, in order to process a range query, it is necessary to access only those tiles or blocks that intersect with the query. Given k disks, a query that accesses m blocks needs a number of parallel block accesses that is at least dm=ke (which is known to be unachievable except for a few special cases [1]). Though several schemes for the allocation of tiles to disks have been developed, no scheme with guaranteed worst-case performance is known. We establish that any range query on a 2q x 2q-block grid of blocks can be performed using k = 2t disks (t less than or equal to q), in at most dm=ke+O(log k) parallel block accesses. We also give two natural generalizations of the scheme to higher dimensions. We achieve this result by judiciously distributing the blocks among the k nodes or disks. Experimental data show that the algorithm achieves very close to dm=ke performance (on average less than 0.5 away from dm=ke, with a worst-case of 3).

Although several declustering schemes for range queries have been developed, prior to our work no additive non-trivial performance bounds were known. Our scheme guarantees performance within a (small) additive deviation from dm=ke. This guarantee is true for any number of dimensions. Subsequent to this work, Bhatiaet al. [4] have proved that such a performance bound is essentially optimal for this kind of scheme, and have also extended our results to the case where the number of disks is a product of the form k1*k2*...*kt where the kis need not be 2.


References


Note: References link to DBLP on the Web.

[1]
Khaled A. S. Abdel-Ghaffar , Amr El Abbadi : Optimal Allocation of Two-Dimensional Data. ICDT 1997 : 409-418
[2]
Stefan Berchtold , Christian Böhm , Bernhard Braunmüller , Daniel A. Keim , Hans-Peter Kriegel : Fast Parallel Similarity Search in Multimedia Databases. SIGMOD Conference 1997 : 1-12
[3]
Randeep Bhatia , Rakesh K. Sinha , Chung-Min Chen : Declustering Using Golden Ratio Sequences. ICDE 2000 : 271-280
[4]
Randeep Bhatia , Rakesh K. Sinha , Chung-Min Chen : Hierarchical Declustering Schemes for Range Queries. EDBT 2000 : 525-537
[5]
David Hung-Chang Du , J. S. Sobolewski : Disk Allocation for Cartesian Product Files on Multiple-Disk Systems. TODS 7(1) : 82-101(1982)
[6]
Christos Faloutsos , Pravin Bhagwat : Declustering Using Fractals. PDIS 1993 : 18-25
[7]
Myoung-Ho Kim , Sakti Pramanik : Optimal File Distribution For Partial Match Retrieval. SIGMOD Conference 1988 : 173-182
[8]
Jianzhong Li , Jaideep Srivastava , Doron Rotem : CMD: A Multidimensional Declustering Method for Parallel Data Systems. VLDB 1992 : 3-14
[9]
Sunil Prabhakar , Khaled A. S. Abdel-Ghaffar , Divyakant Agrawal , Amr El Abbadi : Cyclic Allocation of Two-Dimensional Data. ICDE 1998 : 94-101
[10]
Sunil Prabhakar , Divyakant Agrawal , Amr El Abbadi : Efficient Disk Allocation for Fast Similarity Searching. SPAA 1998 : 78-87
[11]
...

Referenced by

  1. Randeep Bhatia , Rakesh K. Sinha , Chung-Min Chen : Hierarchical Declustering Schemes for Range Queries. EDBT 2000 : 525-537

BIBTEX


@inproceedings{DBLP:conf/pods/AtallahP00,
  author    = {Mikhail J. Atallah and
                Sunil Prabhakar},
   title     = {(Almost) Optimal Parallel Block Access for Range Queries},
   booktitle = {Proceedings of the Nineteenth ACM SIGMOD-SIGACT-SIGART Symposium
                on Principles of Database Systems, May 15-17, 2000, Dallas, Texas,
                USA},
   publisher = {ACM},
   year      = {2000},
   isbn      = {1-58113-214-X},
   pages     = {205-215},
   crossref  = {DBLP:conf/pods/00},
   bibsource = {DBLP, http://dblp.uni-trier.de} } },




DiSC'01 Copyright ©2002 ACM Inc.