Welcome to D
SIGMOD 2003
PODS 2003
SIGMOD-RECOR
ADBIS
CIDR 2003
CIKM 2003
DASFAA 2003
Data Enginee
DEBS
DMKD 2003
DOLAP 2003
DPDJ 2003
ER
GIS 2003
Hypertext 20
ICDE 2003
ICDM 2003
ICDT 2003
JCDL 2003
KRDB 2003
MIR 2003
MIS 2003
MMDB 2003
RIDE 2003
SBBD 2003
SIGIR 2003
SIGIR-FORUM
SIGKDD 2003
SIGKDD-EXP
SSDBM 2003
TIME 2003
TODS
VLDB 2003
VLDB Journal
WIDM 2003
About DiSC 2
Editorial Bo
Acknowledgem
DiSC 2004 Pr
ADVIS
DiSC'04 Feed
DiSC'04 Site
Search DiSC'
Author Index
Copyright No

The Data-Centric Revolution in Networking

Historically, there has been little overlap between the database and networking research communities; they operate on very different levels and focus on very different issues. While this strict separation of concerns has lasted for many years, in this talk I will argue that the gap has recently narrowed to the point where the two fields now have much to say to each other.

Networking research has traditionally focused on enabling communication between network hosts. This research program has produced a myriad of specific algorithms and protocols to solve such problems as error recovery, congestion control, routing, multicast and quality-of-service. It has also led to a set of general architectural principles, such as fate sharing and the end-to-end principle, that provide widely applicable guidelines for allocating functionality among network entities.

This research and design paradigm has been exclusively host-centric; hosts are assumed to know which other hosts (or multicast groups) to contact, and the research focuses on making the resulting host-host communication robust and efficient. However, an increasing number of applications involve accessing particular data objects whose location can't easily be determined within the current Internet architecture. Networking researchers have consequently begun looking at a variety of approaches that are more data-centric than host-centric, in that the basic abstractions refer to the name of the data rather than its location.

This data-centric trend is most visible in two areas of networking research: sensornets and distributed hash tables. Data-centrism is natural for sensornets because the identity of individual nodes is far less important than the data they collect. Traditional networking has toyed with data-centrism in various limited forms (e.g., web redirection, intentional naming), but the recent advent of distributed hash tables has led to a much broader and more explicit engagement with the data-centric paradigm. In both the sensornet and traditional Internet cases, data-centric research initially focused on how to efficiently access data based on logical names. More recent research has used distributed data structures to support more general queries.

Networking researchers have thus unwittingly wandered into the Coddian world of physical data independence, a territory far more familiar to database researchers. This talk will describe our journey to your land.




©2004 Association for Computing Machinery