Abstract
Background: Cloud computing refers to the computing capacities of remote computers, where the user has considerable computing power without having to own power units. The probability of failures, which can occur during execution, increases in the number of nodes. Since failures cannot be completely avoided, one solution is to use failure tolerance mechanisms. Predicting failures has become a major task for engineers and software developers, as failure increases resource usage costs.
Objectives and Methods: This paper presents a hybrid method of predicting failures in a cloudcomputing environment based on the passive replication technique and multi-agent systems. It detects failures, improves the average response time and minimizes lost time. This method makes it possible to efficiently and transparently guarantee the continuity of cloud computing services in the presence of failures.
Results: The results show that the proposed method performs well in the presence of failures, improves the response time and minimizes the additional costs caused by the failures.
Conclusion: This paper proposes a hybrid method of predicting failures in cloud-computing based on the passive replication technique and multi-agent systems to detect failures and minimize lost time. The replication technique works by duplicating some system components, which are deployed simultaneously across different resources. This technique aims to make the system robust, increase availability and guarantee the execution of jobs. In addition, it is suitable for long-running tasks.
Keywords: Failure prediction, cloud-computing, multi-agent system, replication, controller agent, replication agent.
Graphical Abstract
[http://dx.doi.org/10.1109/ACCESS.2019.2941145]
[http://dx.doi.org/10.1145/2735399.2735405]
[http://dx.doi.org/10.1016/j.jksuci.2018.09.021]
[http://dx.doi.org/10.1109/JSYST.2012.2221934]
[http://dx.doi.org/10.1109/IAdCC.2014.6779432]
[http://dx.doi.org/10.1016/j.jnca.2015.10.004]
[http://dx.doi.org/10.1109/TIE.2015.2417501]
[http://dx.doi.org/10.1002/cpe.3190]
[http://dx.doi.org/10.1109/TPDS.2012.96]
[http://dx.doi.org/10.1109/TPDS.2016.2571281]
[http://dx.doi.org/10.1007/s11107-014-0463-1]
[http://dx.doi.org/10.1109/TC.2013.225]
[http://dx.doi.org/10.1109/SC.2014.79]
[http://dx.doi.org/10.1109/ACCESS.2016.2623633]
[http://dx.doi.org/10.1109/FiCloud.2016.58]
[http://dx.doi.org/10.1016/j.procs.2020.03.106]
[http://dx.doi.org/10.1109/JCN.2020.000023]
[http://dx.doi.org/10.1109/ACCESS.2019.2945499]
[http://dx.doi.org/10.1002/spe.995]
[http://dx.doi.org/10.1109/WORDS.1999.806554]
[http://dx.doi.org/10.1007/978-0-387-35565-8_30]