Efficient and Transparent Application Recovery in Client-Server Information Systems
David B. Lomet, Gerhard Weikum
Full Paper (PDF)

Slides (HTML)

Abstract
Database systems recover persistent data, providing high database availability. However, database applications, typically residing on client or middle-tier application-server machines, may lose work because of a server failure. This prevents the masking of server failures from the human user and substantially degrades application availability. This paper aims to enable high application availability with an integrated method for database server recovery and transparent application recovery in a client-server system. The approach, based on application message logging, is similar to earlier work on distributed system fault tolerance. However, we exploit advanced database logging and recovery techniques and request/reply messaging properties to significantly improve efficiency. Forced log I/Os, frequently required by other methods, are usually avoided. Restart time, for both failed server and failed client, is reduced by checkpointing and log truncation. Our method ensures that a server can recover independently of clients. A client may reduce logging overhead in return for dependency on server availability during client restart.

References

References, where available, link to the DBLP on the World Wide Web.

[Alvisi95]
...
[Bartlett81]
Joel F. Bartlett: A NonStop Kernel. SOSP 1981: 22-19
[Bernstein97]
Philip A. Bernstein, Brian Harry, Paul Sanders, David Shutt, Jason Zander: The Microsoft Repository. VLDB 1997: 3-12
[Bernstein90]
Philip A. Bernstein, Meichun Hsu, Bruce Mann: Implementing Recoverable Requests Using Queues. SIGMOD Conference 1990: 112-122
[Borg83]
Anita Borg, Jim Baumbach, Sam Glazer: A Message System Supporting Fault Tolerance. SOSP 1983: 90-99
[Borg89]
Anita Borg, Wolfgang Blau, Wolfgang Graetsch, Ferdinand Herrmann, Wolfgang Oberle: Fault Tolerance Under UNIX. TOCS 7(1): 1-24(1989)
[Borr81]
Andrea J. Borr: Transaction Monitoring in ENCOMPASS: Reliable Distributed Transaction Processing. VLDB 1981: 155-165
[Bulterman95]
...
[Chandy85]
K. Mani Chandy, Leslie Lamport: Distributed Snapshots: Determining Global States of Distributed Systems. TOCS 3(1): 63-75(1985)
[Elmagarmid92]
...
[Elnozahy96]
...
[Georgakopoulos95]
Dimitrios Georgakopoulos, Mark F. Hornick, Amit P. Sheth: An Overview of Workflow Management: From Process Modeling to Workflow Automation Infrastructure. Distributed and Parallel Databases 3(2): 119-153(1995)
[Gray93]
Jim Gray, Andreas Reuter: Transaction Processing: Concepts and Techniques. Morgan Kaufmann 1993, ISBN 1-55860-190-2
Contents
[Huang95]
...
[Johnson87]
...
[Kaiser97]
Gail E. Kaiser, Jim Whitehead: Collaborative Work: Distributed Authoring and Versioning. IEEE Internet Computing 1(2): 76-77(1997)
[Kim84]
Won Kim: Highly Available Systems for Database Applications. Computing Surveys 16(1): 71-98(1984)
[Lomet92]
David B. Lomet: MLR: A Recovery Method for Multi-level Systems. SIGMOD Conference 1992: 185-194
[Lomet95]
David B. Lomet, Mark R. Tuttle: Redo Recovery after System Crashes. VLDB 1995: 457-468
[Lomet97]
...
[Lomet98]
David B. Lomet: Persistent Applications Using Generalized Redo Recovery. ICDE 1998: 154-163
[Lomet98a]
...
[Mohan92]
C. Mohan, Donald J. Haderle, Bruce G. Lindsay, Hamid Pirahesh, Peter Schwarz: ARIES: A Transaction Recovery Method Supporting Fine-Granularity Locking and Partial Rollbacks Using Write-Ahead Logging. TODS 17(1): 94-162(1992)
[Mohan93]
C. Mohan: A Cost-Effective Method for Providing Improved Data Availability During DBMS Restart Recovery After a Failure. VLDB 1993: 368-379
[Ramamritham96]
...
[Strom85]
Robert E. Strom, Shaula Yemini: Optimistic Recovery in Distributed Systems. TOCS 3(3): 204-226(1985)
[Strom88]
...
[Weikum90]
Gerhard Weikum, Christof Hasse, Peter Brössler, Peter Muth: Multi-Level Recovery. PODS 1990: 109-123
[Weikum93]
Gerhard Weikum, Christof Hasse: Multi-Level Transaction Management for Complex Objects: Implementation, Performance, Parallelism. VLDB Journal 2(4): 407-453(1993)
BIBTEX

@inproceedings{DBLP:conf/sigmod/LometW98,
author = {David B. Lomet and
Gerhard Weikum},
editor = {Laura M. Haas and
Ashutosh Tiwary},
title = {Efficient and Transparent Application Recovery in Client-Server
Information Systems},
booktitle = {SIGMOD 1998, Proceedings ACM SIGMOD International Conference
on Management of Data, June 2-4, 1998, Seattle, Washington, USA},
publisher = {ACM Press},
year = {1998},
isbn = {0-89791-955-5},
pages = {460-471},
crossref = {DBLP:conf/sigmod/98},
bibsource = {DBLP, http://dblp.uni-trier.de}
}


DBLP: Copyright ©1999 by Michael Ley (ley@uni-trier.de).