Fault Tolerance Model for Hadoop Distributed System
Fault tolerance approaches in distributed systems are essentially based on replication and checkpointing. Each of these approaches has its advantages and limitations. This paper has two objectives: first, it proposes a fault tolerance approach based on the nodes status of a distributed system. For t...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Graz University of Technology
2025-01-01
|
Series: | Journal of Universal Computer Science |
Subjects: | |
Online Access: | https://lib.jucs.org/article/120840/download/pdf/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832581890798780416 |
---|---|
author | Soraya Setti Ahmed Yahya Slimani Riadh Frefita |
author_facet | Soraya Setti Ahmed Yahya Slimani Riadh Frefita |
author_sort | Soraya Setti Ahmed |
collection | DOAJ |
description | Fault tolerance approaches in distributed systems are essentially based on replication and checkpointing. Each of these approaches has its advantages and limitations. This paper has two objectives: first, it proposes a fault tolerance approach based on the nodes status of a distributed system. For this purpose, it defines 3 nodes status: safety, faulty and potentially faulty. With respect of classical node status (safety, faulty), it introduces a new status that we call potentially faulty. This last node allows to enhance the availability of a distributed system. Second, it discusses the efficiency of the proposed model on two types of architectures: virtual multi-node cluster and a physical multi-node cluster with WIFI connection. Experiments have showed that proposed approach increases the system performance throughput and its fault tolerance level. |
format | Article |
id | doaj-art-455e991e55974ad496206a8d13716582 |
institution | Kabale University |
issn | 0948-6968 |
language | English |
publishDate | 2025-01-01 |
publisher | Graz University of Technology |
record_format | Article |
series | Journal of Universal Computer Science |
spelling | doaj-art-455e991e55974ad496206a8d137165822025-01-30T08:31:23ZengGraz University of TechnologyJournal of Universal Computer Science0948-69682025-01-01311729210.3897/jucs.120840120840Fault Tolerance Model for Hadoop Distributed SystemSoraya Setti Ahmed0Yahya Slimani1Riadh Frefita2Mustapha Stambouli UniversityISAMM, Manouba UniversityEsprit School, Pôle Technologique,Fault tolerance approaches in distributed systems are essentially based on replication and checkpointing. Each of these approaches has its advantages and limitations. This paper has two objectives: first, it proposes a fault tolerance approach based on the nodes status of a distributed system. For this purpose, it defines 3 nodes status: safety, faulty and potentially faulty. With respect of classical node status (safety, faulty), it introduces a new status that we call potentially faulty. This last node allows to enhance the availability of a distributed system. Second, it discusses the efficiency of the proposed model on two types of architectures: virtual multi-node cluster and a physical multi-node cluster with WIFI connection. Experiments have showed that proposed approach increases the system performance throughput and its fault tolerance level.https://lib.jucs.org/article/120840/download/pdf/Distributed SystemsHadoopFault ToleranceNetw |
spellingShingle | Soraya Setti Ahmed Yahya Slimani Riadh Frefita Fault Tolerance Model for Hadoop Distributed System Journal of Universal Computer Science Distributed Systems Hadoop Fault Tolerance Netw |
title | Fault Tolerance Model for Hadoop Distributed System |
title_full | Fault Tolerance Model for Hadoop Distributed System |
title_fullStr | Fault Tolerance Model for Hadoop Distributed System |
title_full_unstemmed | Fault Tolerance Model for Hadoop Distributed System |
title_short | Fault Tolerance Model for Hadoop Distributed System |
title_sort | fault tolerance model for hadoop distributed system |
topic | Distributed Systems Hadoop Fault Tolerance Netw |
url | https://lib.jucs.org/article/120840/download/pdf/ |
work_keys_str_mv | AT sorayasettiahmed faulttolerancemodelforhadoopdistributedsystem AT yahyaslimani faulttolerancemodelforhadoopdistributedsystem AT riadhfrefita faulttolerancemodelforhadoopdistributedsystem |