Browse wiki

From Navigators

Jump to: navigation, search
Abstract In today's world where distributed systems … In today's world where distributed systems form many of our critical infrastructures, dependability outages are becoming increasingly common. In many situations, it is necessary to not just detect a failure, but also to diagnose the failure, i.e., to identify the source of the failure. Diagnosis is challenging since high throughput applications with frequent interactions between the different components allow fast error propagation. It is desirable to consider applications as black-boxes for the diagnosis process. In this paper, we propose a Monitor architecture for diagnosing failures in large-scale network protocols. The Monitor only observes the message exchanges between the protocol entities (PEs) remotely and does not access internal protocol state. At runtime, it builds a causal graph between the PEs based on their communication and uses this together with a rule base of allowed state transition paths to diagnose the failure. The tests used for the diagnosis are based on the rule base and are assumed to have imperfect coverage. The hierarchical Monitor framework allows distributed diagnosis handling Byzantine failures at individual Monitors. The framework is implemented and applied to a reliable multicast protocol executing on our campus-wide network. Fault injection experiments are carried out to evaluate the accuracy and latency of the diagnosis. the accuracy and latency of the diagnosis.
Author Gunjan Khanna + , Mike Yu Cheng + , Padma Varadharajan + , Saurabh Bagchi + , Miguel Correia + , Paulo Verissimo +
Journal IEEE Transactions on Dependable and Secure Computing, vol. 4, no. 4, pp. 266-279, Oct-Dec, 2007  +
Key Gunjan-khanna2007automated-rule-based-193  +
Missing ResearchLine  +
Month oct  +
NumPubDate 2,007.1  +
Title Automated Rule-Based Diagnosis Through A Distributed Monitor System  +
Type article  +
Url  +
Year 2007  +
Categories Publication  +
Modification¬†dateThis property is a special property in this wiki. 14 January 2013 18:15:50  +
hide properties that link here 
  No properties link to this page.


Enter the name of the page to start browsing from.
Personal tools
Navigators toolbox