Fault Tolerance Model for Hadoop Distributed System

Fault tolerance approaches in distributed systems are essentially based on replication and checkpointing. Each of these approaches has its advantages and limitations. This paper has two objectives: first, it proposes a fault tolerance approach based on the nodes status of a distributed system. For t...

Full description

Saved in:
Bibliographic Details
Main Authors: Soraya Setti Ahmed, Yahya Slimani, Riadh Frefita
Format: Article
Language:English
Published: Graz University of Technology 2025-01-01
Series:Journal of Universal Computer Science
Subjects:
Online Access:https://lib.jucs.org/article/120840/download/pdf/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832581890798780416
author Soraya Setti Ahmed
Yahya Slimani
Riadh Frefita
author_facet Soraya Setti Ahmed
Yahya Slimani
Riadh Frefita
author_sort Soraya Setti Ahmed
collection DOAJ
description Fault tolerance approaches in distributed systems are essentially based on replication and checkpointing. Each of these approaches has its advantages and limitations. This paper has two objectives: first, it proposes a fault tolerance approach based on the nodes status of a distributed system. For this purpose, it defines 3 nodes status: safety, faulty and potentially faulty. With respect of classical node status (safety, faulty), it introduces a new status that we call potentially faulty. This last node allows to enhance the availability of a distributed system. Second, it discusses the efficiency of the proposed model on two types of architectures: virtual multi-node cluster and a physical multi-node cluster with WIFI connection. Experiments have showed that proposed approach increases the system performance throughput and its fault tolerance level.
format Article
id doaj-art-455e991e55974ad496206a8d13716582
institution Kabale University
issn 0948-6968
language English
publishDate 2025-01-01
publisher Graz University of Technology
record_format Article
series Journal of Universal Computer Science
spelling doaj-art-455e991e55974ad496206a8d137165822025-01-30T08:31:23ZengGraz University of TechnologyJournal of Universal Computer Science0948-69682025-01-01311729210.3897/jucs.120840120840Fault Tolerance Model for Hadoop Distributed SystemSoraya Setti Ahmed0Yahya Slimani1Riadh Frefita2Mustapha Stambouli UniversityISAMM, Manouba UniversityEsprit School, Pôle Technologique,Fault tolerance approaches in distributed systems are essentially based on replication and checkpointing. Each of these approaches has its advantages and limitations. This paper has two objectives: first, it proposes a fault tolerance approach based on the nodes status of a distributed system. For this purpose, it defines 3 nodes status: safety, faulty and potentially faulty. With respect of classical node status (safety, faulty), it introduces a new status that we call potentially faulty. This last node allows to enhance the availability of a distributed system. Second, it discusses the efficiency of the proposed model on two types of architectures: virtual multi-node cluster and a physical multi-node cluster with WIFI connection. Experiments have showed that proposed approach increases the system performance throughput and its fault tolerance level.https://lib.jucs.org/article/120840/download/pdf/Distributed SystemsHadoopFault ToleranceNetw
spellingShingle Soraya Setti Ahmed
Yahya Slimani
Riadh Frefita
Fault Tolerance Model for Hadoop Distributed System
Journal of Universal Computer Science
Distributed Systems
Hadoop
Fault Tolerance
Netw
title Fault Tolerance Model for Hadoop Distributed System
title_full Fault Tolerance Model for Hadoop Distributed System
title_fullStr Fault Tolerance Model for Hadoop Distributed System
title_full_unstemmed Fault Tolerance Model for Hadoop Distributed System
title_short Fault Tolerance Model for Hadoop Distributed System
title_sort fault tolerance model for hadoop distributed system
topic Distributed Systems
Hadoop
Fault Tolerance
Netw
url https://lib.jucs.org/article/120840/download/pdf/
work_keys_str_mv AT sorayasettiahmed faulttolerancemodelforhadoopdistributedsystem
AT yahyaslimani faulttolerancemodelforhadoopdistributedsystem
AT riadhfrefita faulttolerancemodelforhadoopdistributedsystem