Implementing highly available, highly reliable virtual processors

dc.contributor.authorMacdonald, Robert Noëlen_US
dc.date.accessioned2024-08-14T22:33:20Z
dc.date.available2024-08-14T22:33:20Z
dc.date.copyright1994en_US
dc.date.issued1994
dc.degree.departmentDepartment of Computer Science
dc.degree.levelMaster of Science M.Sc.en
dc.description.abstractA fault-tolerant distributed facility called a Halt on Failure Processor (HFP) and its performance in a network of workstations are described. Process replication and n-modular redundancy are used to achieve fault tolerance in a general purpose workstation environment. A blacklisting mechanism is used to differentiate between slow and crashed workstations. The system achieves high availability by keeping a list of healthy work­stations. The HFP will halt rather than deliver the results from an erroneous cal­culation to its users. The design of the HFP is presented along with the type and number of errors it is capable of hand ling. The implementation using the existing Remote Execution Manager is discussed. Extensive performance studies were carried out within a network of Sun SPARC workstations running UNIX. Performance results are presented and the costs of performing fault management at various levels are exposed. Flaws in the way UNIX reports load information and their implication on load-balancing are pointed out. It is shown that IIFPs can achieve high availability and fault-tolerance using the idle cycles of workstations in a local area network with little performance degrada­tion.en
dc.format.extent95 pages
dc.identifier.urihttps://hdl.handle.net/1828/18792
dc.rightsAvailable to the World Wide Weben_US
dc.titleImplementing highly available, highly reliable virtual processorsen_US
dc.typeThesisen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
MACDONALD_ROBERT_MSc_1994_676787.pdf
Size:
1.98 MB
Format:
Adobe Portable Document Format