A distributed system in general has a common goal like to solve a computational and logical problem and each and every autonomous system has an individual user working on a separate entity of the problem. Sharing of the resources is the key factors across any kind of distributed systems and there are different techniques used to share the resources among these autonomous systems and which were discussed in the literature.

Individual computational nodes or computers are monitored across the distributed systems to achieve the desired problem and the monitoring of these nodes includes recording all the possible cases of faults that may rise across the individual autonomous systems.

Distributed system should be designed in a manner such that is should be able to tolerate the faults that cause due to the individual systems that form the group of autonomous system.

  This paper is written and submitted by sai