skip to main content
article
Free access

An application of fault tolerance patterns and coordinated atomic actions to a problem in railway scheduling

Published: 01 October 2000 Publication History

Abstract

Developing and applying advanced approaches for system structuring is vital for fighting ever-increasing complexity of modern and future software systems. The concept of Coordinated Atomic (CA) actions has been developed at Newcastle University for designing and structuring complex concurrent and distributed applications. Certain successful experience has been gained in applying them in several application areas. The purpose of the research, some initial results of which we report here, is twofold: to show how CA actions can be used in a new application area (a railway control system) and to analyse how the design patterns which have been developed using our previous experience can help in designing such system using CA actions.

References

[1]
D. Beder, C. Rubira, "A Meta-Level Software Architecture based on Patterns for Developing Dependable Collaboration-based Designs", Proc. 2nd Brazilian Workshop on Fault Tolerance (WTF'00), Curitiba, Brazil, 2000.
[2]
F. Buschmann, R. Meunier, H. Rohnert, P. Sommerlad, M. Stal, "A System of Patterns: Patterns-Oriented Software", John Wiley & Sons, 1996.
[3]
R. Campbell, B. Randell, "Error Recovery in Asynchronous Systems", IEEE TSE, SE-12, 8, 1986, 811-826.
[4]
E. Gamma, R. Helm, R. Johnson, J. Vlissides, "Design Patterns: Elements of Reusable Object-Oriented Software", Addison-Wesley, 1995.
[5]
A. Garcia, D. Beder, C. Rubira, "A Software Architecture Based on Patterns for Exceptional Condition Handling", Submitted to 5th IEEE Int. Symp. on High Assurance Systems Engineering (ISHASE'2000).
[6]
J. Gray & A. Reuter, "Transaction Processing: Concepts and Techniques", Morgan Kaufmann Publ., 1993.
[7]
W. Janssen, D. Severson, M. Spreitzer, "ILU 1.8 reference Manual", Xerox Corporation, 1995.
[8]
A. Oliva & L. Buzato, "Composition of Meta-Objects in Guaranáá", Proc. OOPSLA Workshop on Reflective Programming in C++ and Java, Vancouver, Canada, 86-90, 1998.
[9]
A. Romanovsky, J. Xu, B. Randell, "Coordinated Exception Handling in Real-Time Distributed Object Systems", Computer Systems Science and Engineering, 14, 4, 1999, 197-208.
[10]
A. Romanovsky, A. Zorzo, "Coordinated Atomic Actions as a Technique for Implementing Distributed Gamma Computation", Journal of Systems Architecture, 45, 15, 1999, 1357-1374.
[11]
C. Rubira, "Structuring Fault-Tolerant Object-Oriented Systems Using Inheritance and Delegation", PhD Thesis, Department of Computing Science, University of Newcastle upon Tyne, 1994.
[12]
C. Snow, "Distributed Real-Time Control of a Distributed Model Railway Layout", in preparation.
[13]
J. Vachon, D. Buchs, M. Buffo, G. Serugendo, B. Randell, A. Romanovsky, R. Stroud, J. Xu, "COALA --- A Formal Language for Co-ordinated Atomic Actions", 3rd Year Report, ESPRIT LTR Project on Design for Validation, 1998.
[14]
J. Xu, B. Randell, A. Romanovsky, C. Rubira, R. Stroud, Z. Wu, "Fault Tolerance in Concurrent Object-Oriented Software through Co-ordinated Error Recovery", Proc. 25th Int. Symp. on Fault-Tolerant Computing, Pasadena, USA, 1995, 499-508.
[15]
J. Xu, A. Romanovsky, B. Randell, "Coordinated Exception Handling in Distributed Object Systems: from Model to System Implementation", Proc. 18th Int. Conf. Distributed Computing Systems, Amsterdam, 1998, 26-29.
[16]
J. Xu, B. Randell, A. Romanovsky, R. J. Stroud, A. Zorzo, E. Canver, F. von Henke, "Rigorous Development of a Safety-Critical System Based on Coordinated Atomic Actions", Proc. 29th Int. Symp. on Fault-Tolerant Computing, Madison, USA, 1999, 68-75.
[17]
A. Zorzo, A. Romanovsky, J. Xu, B. Randell, R. Stroud, I. Welch, "Using Coordinated Atomic Actions to Design Complex Safety-Critical Systems: The Production Cell Case Study", Software: Practice & Experience, 29, 7, 1999, 1-21.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM SIGOPS Operating Systems Review
ACM SIGOPS Operating Systems Review  Volume 34, Issue 4
October 2000
90 pages
ISSN:0163-5980
DOI:10.1145/506106
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 October 2000
Published in SIGOPS Volume 34, Issue 4

Check for updates

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)15
  • Downloads (Last 6 weeks)8
Reflects downloads up to 15 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2017)A Formal Approach to implement java exceptions in cooperative systemsJournal of Systems and Software10.1016/j.jss.2016.07.033131:C(475-490)Online publication date: 1-Sep-2017
  • (2013)Special Issue on Dependable Software SystemsThe Computer Journal10.1093/comjnl/bxt01656:6(671-673)Online publication date: 1-Jun-2013
  • (2012)Coordinating Exceptions of Java SystemsProceedings of the 2012 Eighth International Conference on the Quality of Information and Communications Technology10.1109/QUATIC.2012.26(108-113)Online publication date: 2-Sep-2012
  • (2012)Scheduling and Rescheduling of Railway Operations: A Review and Expository AnalysisTechnology Operation Management10.1007/s13727-012-0006-x2:2(102-122)Online publication date: 7-Jun-2012
  • (2010)Formalization of an architectural model for exception handling coordination based on CA action conceptsScience of Computer Programming10.1016/j.scico.2009.12.00675:5(333-349)Online publication date: 1-May-2010
  • (2009)Improving reliability of cooperative concurrent systems with exception flow analysisJournal of Systems and Software10.1016/j.jss.2008.12.01582:5(874-890)Online publication date: 1-May-2009
  • (2009)Global-to-local approach to rigorously developing distributed system with exception handlingJournal of Computer Science and Technology10.1007/s11390-009-9220-924:2(238-249)Online publication date: 1-Mar-2009
  • (2007)COPA-CASEProceedings of the 2007 OTM confederated international conference on On the move to meaningful internet systems - Volume Part I10.5555/1780909.1780965(255-264)Online publication date: 25-Nov-2007
  • (2007)A pattern-based approach for modeling and analyzing error recoveryArchitecting dependable systems IV10.5555/1768283.1768291(115-141)Online publication date: 1-Jan-2007
  • (2007)Measuring Reliability as a Mean Failure CostProceedings of the 10th IEEE High Assurance Systems Engineering Symposium10.1109/HASE.2007.47(403-404)Online publication date: 14-Nov-2007
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media