“Node Failure Detection and Membership in CANELy”
in Proceedings of the IEEE International Conference on Dependable Systems and Networks (DSN03). San Francisco, California, USA, June 2003., Jun. 2003.
Abstract: Fault-tolerant distributed systems based on fieldbuses may benefit to a great extent from the availability of semantically rich communication services, such as those provided by group communication, clock synchronization, membership and failure detection. This is specially true of distributed critical control applications. However, the migration of those services to the realm of simple fieldbuses, such as the native Controller Area Network (CAN) protocol family, presents non-negligible problems, since it lacks most of the functionality required of a fault-tolerant distributed system, such as reliable message broadcast guarantees, distributed node failure detection, and site membership services. As part of our endeavor to design a CAN-based infrastructure support for extremely reliable distributed computer control, dubbed CAN Enhanced Layer (CANELy), we have been addressing the problem of fault-tolerant communications on fieldbuses in a comprehensive way. In this paper, we show that node failure detection and site membership services can be efficiently supported by a simple software layer built on top of an exposed CAN controller interface.
Research line(s): Timeliness and Adaptation in Dependable Systems (TADS)