...challenge in all of this is testing to insure that the implementations are correct. Since most of the more interesting errorsinvolve race conditions, we need to find ways of stopping each side at just the right time, sending the messages on the...
http://blogs.sun.com/scalinggames/entry/miracle_4
...toward hard (i.e., permanent) and soft (i.e., transient) malfunctions. System-level solutions for hard errorsinvolve redundancy, and thus require the on-line connection of a stand-by unit and disconnection of the faulty unit...
http://www.ddj.com/embedded/197003159