| |
XV Simpósio Brasileiro de Banco de Dados
Promovido pela SBC, Sociedade
Brasileira de Computação

Em cooperação com o ACM SIGMOD
João Pessoa -- Paraíba -- Brasil
2 a 4 de outubro 2000
| Tutorial
#1 |
| Título: |
Querying
the Web |
| Tipo: |
Avançado |
| Língua: |
Inglês |
| Autores: |
Daniela
Florescu - INRIA - França (Daniela.Florescu@inria.fr)
Juliana Freire - Bell Labs - EUA (juliana@research.bell-labs.com)
Ioana Manolescu - INRIA - França (Ioana.Manolescu@inria.fr)
|
| Resumo: |
The
Web is an ever growing source of information, but the distributed
nature of publishing on the Web has lead to a myriad of disconnected
services and data. Significant effort has been put into helping
people find the information they want, from keyword-based search
engines, to more sophisticated portals and shopping bots, which
provide a more structured and integrated view of Web sources. Given
the size of the Web, its dynamic nature, and the high degree of
heterogeneity of data and services, providing infrastructure for
querying and integrating Web information sources creates new challenges
and exciting opportunities for the database community.
This tutorial describes the key problems in querying the Web,
and presents recent research inspired by it. Its intended audience
includes researchers in database, software and application developers.
|
|
|
| Tutorial
#2 |
| Título: |
Web
Mining: Concepts, Practices and Research |
| Nível: |
Avançado |
| Língua: |
Inglês |
| Autores: |
Osmar
R. Zaiane (zaiane@cs.ualberta.ca)
Department of Computing Science
University of Alberta
Edmonton, Canadá |
| Resumo: |
Many associate Web
Mining with search engine technologies. Web mining is more than
just searching for resources on the Internet. Web mining is
a conglomerate of techniques to discover interesting patterns from
the large dynamic collection of interconnected resources that form
the World-Wide Web. Three distinct research areas constitute the
web mining discipline: Web Usage Mining, discovering patterns and
behaviours from web access logs; Web Content Mining, discovering
implicit knowledge from within web documents; and Web Structure
Mining, which exploits the presence of links to and from documents
to discover pertinent knowledge. This tutorial aims at shedding
light on these web mining techniques and current research, and expose
some open problems and challenges that researchers in industry and
academia are still facing. |
| |
| Tutorial
#3 |
| Título: |
Semistructured Data |
| Nível: |
Avançado |
| Língua: |
Português |
| Autores: |
Ronaldo
dos Santos Mello (ronaldo@inf.ufrgs.br)
Carina Friedrich Dorneles (dorneles@inf.ufrgs.br)
Adrovane Kade (adrovane@vitoria.upf.ufrgs.br)
Carlos Alberto Heuser (heuser@inf.ufrgs.br)
Instituto de Informática
Universidade Federal do Rio Grande do Sul (II/UFRGS) |
| Resumo: |
Semistructured data
are highly heterogeneous data that do not have a predefined associated
schema. An example is data in the Web, varying from text in natural
language to well-formatted records. This tutorial presents the state-of-art
of research on semistructured data, covering the following topics:
modelling, query languages, data extraction and relationship with
ontologies. |
| |
| Tutorial
#4 |
| Título: |
Genome Databases |
| Nível: |
Avançado |
| Língua: |
Português |
| Autores: |
Luiz
Fernando Bessa Seibel, PUC-Rio (seibel@inf.puc-rio.br)
Melissa Lemos, PUC-Rio (melissa@inf.puc-rio.br)
Sérgio Lifschitz, PUC-Rio (lifschitz@inf.puc-rio.br)
|
| Resumo: |
Genome
Databases are nowadays a fundamental tool for molecular biologists
and geneticists. Researchers in this area usually store sequences
and related anotations in databases so to later query them aiming
at specific biological analysis. Sequence comparisons and genes
discovery (besides their function and characteristics) are relevant
examples among all processing needs. In order to make these databases
useful and available, there are many important issues that need
to be discussed, e.g., which data model should be used; the choice
of algorithms for complex analysis on sequences; how to deal with
multiple copies of the same sequences; considerations on I/O optimization;
etc. There is still the problem of integrating all the existing
genome databases. Many different research groups have been sequencing
distinct organisms and each use its own database, often based on
different data models and technologies. The goal of this tutorial
is to present main genome databases approaches, data models and
algorithms being considered. The enphasys will be on research problems
from a databases point of view. |
|
|
|