SBBD 2000 - Tutoriais

[Voltar] [Página inicial]

SBBD'2000 - Tutoriais

XV Simpósio Brasileiro de Banco de Dados
Promovido pela SBC, Sociedade Brasileira de Computação
Em cooperação com o ACM SIGMOD

João Pessoa -- Paraíba -- Brasil
2 a 4 de outubro 2000

Tutorial #1
Título:	Querying the Web
Tipo:	Avançado
Língua:	Inglês
Autores:	Daniela Florescu - INRIA - França (Daniela.Florescu@inria.fr) Juliana Freire - Bell Labs - EUA (juliana@research.bell-labs.com) Ioana Manolescu - INRIA - França (Ioana.Manolescu@inria.fr)
Resumo:	The Web is an ever growing source of information, but the distributed nature of publishing on the Web has lead to a myriad of disconnected services and data. Significant effort has been put into helping people find the information they want, from keyword-based search engines, to more sophisticated portals and shopping bots, which provide a more structured and integrated view of Web sources. Given the size of the Web, its dynamic nature, and the high degree of heterogeneity of data and services, providing infrastructure for querying and integrating Web information sources creates new challenges and exciting opportunities for the database community. This tutorial describes the key problems in querying the Web, and presents recent research inspired by it. Its intended audience includes researchers in database, software and application developers.

Tutorial #2
Título:	Web Mining: Concepts, Practices and Research
Nível:	Avançado
Língua:	Inglês
Autores:	Osmar R. Zaiane (zaiane@cs.ualberta.ca) Department of Computing Science University of Alberta Edmonton, Canadá
Resumo:	Many associate Web Mining with search engine technologies. Web mining is more than just searching for resources on the Internet. Web mining is a conglomerate of techniques to discover interesting patterns from the large dynamic collection of interconnected resources that form the World-Wide Web. Three distinct research areas constitute the web mining discipline: Web Usage Mining, discovering patterns and behaviours from web access logs; Web Content Mining, discovering implicit knowledge from within web documents; and Web Structure Mining, which exploits the presence of links to and from documents to discover pertinent knowledge. This tutorial aims at shedding light on these web mining techniques and current research, and expose some open problems and challenges that researchers in industry and academia are still facing.

Tutorial #3
Título:	Semistructured Data
Nível:	Avançado
Língua:	Português
Autores:	Ronaldo dos Santos Mello (ronaldo@inf.ufrgs.br) Carina Friedrich Dorneles (dorneles@inf.ufrgs.br) Adrovane Kade (adrovane@vitoria.upf.ufrgs.br) Carlos Alberto Heuser (heuser@inf.ufrgs.br) Instituto de Informática Universidade Federal do Rio Grande do Sul (II/UFRGS)
Resumo:	Semistructured data are highly heterogeneous data that do not have a predefined associated schema. An example is data in the Web, varying from text in natural language to well-formatted records. This tutorial presents the state-of-art of research on semistructured data, covering the following topics: modelling, query languages, data extraction and relationship with ontologies.

Tutorial #4
Título:	Genome Databases
Nível:	Avançado
Língua:	Português
Autores:	Luiz Fernando Bessa Seibel, PUC-Rio (seibel@inf.puc-rio.br) Melissa Lemos, PUC-Rio (melissa@inf.puc-rio.br) Sérgio Lifschitz, PUC-Rio (lifschitz@inf.puc-rio.br)
Resumo:	Genome Databases are nowadays a fundamental tool for molecular biologists and geneticists. Researchers in this area usually store sequences and related anotations in databases so to later query them aiming at specific biological analysis. Sequence comparisons and genes discovery (besides their function and characteristics) are relevant examples among all processing needs. In order to make these databases useful and available, there are many important issues that need to be discussed, e.g., which data model should be used; the choice of algorithms for complex analysis on sequences; how to deal with multiple copies of the same sequences; considerations on I/O optimization; etc. There is still the problem of integrating all the existing genome databases. Many different research groups have been sequencing distinct organisms and each use its own database, often based on different data models and technologies. The goal of this tutorial is to present main genome databases approaches, data models and algorithms being considered. The enphasys will be on research problems from a databases point of view.

[Voltar] [Página inicial]