 |
|
| |
|
 |
 |
at Global Oneness Community.
Share your dreams and let others help you with the interpretation!
Dream Sharing Forum
|
 |
Fault-tolerant system - Fault-tolerance by duplication |  | Fault-tolerant system - Fault-tolerance by duplication: Encyclopedia II - Fault-tolerant system - Fault-tolerance by duplication |  | Duplication can give fault-tolerance in three ways:
Replication: Providing multiple identical instances of the same system, directing tasks or requests to all of them in parallel, and choosing the correct result on the basis of a quorum;
Redundancy: Providing multiple identical instances of the same system and switching to one of the remaining instances in case of a failure (fall-back or backup);
Diversity: Providing multiple different implementations of the same specification, and using them like rep ...
See also:Fault-tolerant system, Fault-tolerant system - Fault-tolerance by duplication |  | | Fault-tolerant system, Fault-tolerant system - Fault-tolerance by duplication, Byzantine fault tolerance, Cluster, Defence in depth, Fault resistance, Object group, Process group, Transaction |  | |
|  |  | Fault-tolerant system: Encyclopedia II - Fault-tolerant system - Fault-tolerance by duplication
Fault-tolerant system - Fault-tolerance by duplication
Duplication can give fault-tolerance in three ways:
- Replication: Providing multiple identical instances of the same system, directing tasks or requests to all of them in parallel, and choosing the correct result on the basis of a quorum;
- Redundancy: Providing multiple identical instances of the same system and switching to one of the remaining instances in case of a failure (fall-back or backup);
- Diversity: Providing multiple different implementations of the same specification, and using them like replicated systems to cope with errors in a specific implementation.
A redundant array of independent disks (RAID) is an example of a fault-tolerant storage device that uses redundancy.
A lockstep fault-tolerant machine uses replicated elements operating in parallel. At any time, all the replications of each element should be in the same state. The same inputs are provided to each replication, and the same outputs are expected. The outputs of the replications are compared using a voting circuit. A machine with two replications of each element is termed dual modular redundant (DMR). The voting circuit can then only detect a mismatch and recovery relies on other methods. A machine with three replications of each element is termed triple modular redundant (TMR). The voting circuit can determine which replication is in error when a two-to-one vote is observed. In this case, the voting circuit can output the correct result, and discard the erroneous version. After this, the internal state of the erroneous replication is assumed to be different from that of the other two, and the voting circuit can switch to a DMR mode. This model can be applied to any larger number of replications.
Lockstep fault tolerant machines are most easily made fully synchronous, with each gate of each replication making the same state transition on the same edge of the clock, and the clocks to the replications being exactly in phase. However, it is possible to build lockstep systems without this requirement.
Bringing the replications into synchrony requires making their internal stored states the same. They can be started from a fixed initial state, such as the reset state. Alternatively, the internal state of one replicant can be copied to another replicant.
One variant of DMR is pair-and-spare. Two replicated elements operate in lockstep as a pair, with a voting circuit that detects any mismatch between their operations and outputs a signal indicating that there is an error. Another pair operates exactly similarly. A final circuit selects the output of the pair that does not proclaim that it is in error. Pair-and-spare requires four replicants rather than the three of TMR, but has been used commercially.
Other related archivesByzantine fault tolerance, Cluster, Defence in depth, Diversity, HTML, Redundancy, Replication, Transaction, Transmission Control Protocol, availability, checkpointing, fault tolerant design, forward compatible, idempotent, life-critical, lockstep, packet-switched, parallel, quorum, redundant array of independent disks, self-stabilization, storage device, system
 Adapted from the Wikipedia article "Fault-tolerance by duplication", under the G.N U Free Docmentation License. Please also see http://en.wikipedia.org/wiki |
|
|
More material related to Fault-tolerant System can be found here:
|
|
« Back
|
Search the Global Oneness web site |
|
|
|
|
 |
Sneak-Peek of Global Oneness Community
Hi friend! The Global Oneness Community, the place for information and sharing about Oneness is not really launched yet (you will see there is still some clean up to do) ...but it is now open for a sneak-peek! And if you wish - please register and become one of the very first members to do so! Jonas
Forum Home,
Articles,
Photo Gallery,
Videos,
News,
Sitemap
...and much more!
|