Publication Details

Healing Data Races On-The-Fly

KŘENA Bohuslav, LETKO Zdeněk, TZOREF-BRILL Rachel, UR Shmuel and VOJNAR Tomáš. Healing Data Races On-The-Fly. In: Proceedings of 5th International Workshop on Parallel and Distributed Systems: Testing and Debugging Modelling - PADTAD'07. London: Association for Computing Machinery, 2007, pp. 54-64. ISBN 978-1-59593-734-6.
Czech title
Healing Data Races On-The-Fly
Type
conference paper
Language
english
Authors
Křena Bohuslav, Ing., Ph.D. (DITS FIT BUT)
Letko Zdeněk, Ing. (FIT BUT)
Tzoref-Brill Rachel (IBM)
Ur Shmuel (IBM)
Vojnar Tomáš, prof. Ing., Ph.D. (DITS FIT BUT)
Keywords

Concurrency, Testing, Self-Healing

Abstract

Testing of concurrent software is extremely difficult. Despite all the progress in the testing and verification technology, concurrent bugs, the most common of which are deadlocks and races, make it to the field. This paper describes a set of techniques, implemented in a tool called ConTest, allowing concurrent programs to self-heal at run-time.

Concurrent bugs have the very desirable property for healing that some of the interleaving produce correct results while in others bugs manifest. Healing concurrency problems is about limiting, or changing the probability of interleaving, such that bugs will be seen less.  When healing concurrent programs, if a deadlock does not result from limiting the interleaving, we are sure that the result of the healed program could have been in the original program and therefore no new functional bug has been introduced.

In this initial work which deals with different types of data races, we suggest three types of healing mechanisms:  (1)changing the probability of interleaving by introducing sleep or yield statements or by changing thread priorities, (2) removing interleaving using synchronisation commands like locking and unlocking certain mutexes or waits and notifies, and (3) removing the result of "bad interleaving" by replacing the value of variables by the one that "should" have been taken. We also classify races according to the relevant healing strategies to apply.

Published
2007
Pages
54-64
Proceedings
Proceedings of 5th International Workshop on Parallel and Distributed Systems: Testing and Debugging Modelling - PADTAD'07
Conference
PADTAD - Parallel and Distributed Systems: Testing and Debugging, Seattle, US
ISBN
978-1-59593-734-6
Publisher
Association for Computing Machinery
Place
London, GB
BibTeX
@INPROCEEDINGS{FITPUB8372,
   author = "Bohuslav K\v{r}ena and Zden\v{e}k Letko and Rachel Tzoref-Brill and Shmuel Ur and Tom\'{a}\v{s} Vojnar",
   title = "Healing Data Races On-The-Fly",
   pages = "54--64",
   booktitle = "Proceedings of 5th International Workshop on Parallel and Distributed Systems: Testing and Debugging Modelling - PADTAD'07",
   year = 2007,
   location = "London, GB",
   publisher = "Association for Computing Machinery",
   ISBN = "978-1-59593-734-6",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/8372"
}
Back to top