Checkpointing: Fault Tolerance Mechanism

Neeraj Kumar Rathore*
Assistant Professor, Department of Computer Science & Engineering, Jaypee University of Engineering & Technology, Guna, M.P., India.
Periodicity:January - June'2017
DOI : https://doi.org/10.26634/jcc.4.1.13756

Abstract

Checkpointing is a technique for inserting fault tolerance into computing systems. It basically consists on storing a snapshot of the current application state, and uses it for restarting the execution in case of failure. It is saving the program state, usually to stable storage, so that it may be reconstructed later in time. Checkpointing provides the backbone for rollback recovery (fault-tolerance), playback debugging, process migration, and job swapping. It mainly focuses on fault-tolerance, process migration, and the performance of checkpointing on all computational platforms from uniprocessors to supercomputers.

Checkpointing and restart has been one of the most widely used techniques for fault tolerance in large parallel applications. By periodically saving application status to permanent storage (disk or tape), the execution can be restarted from the last checkpoint if system faults occur. It is an effective approach to tolerating both hardware and software faults. For example, a user who is writing a long program at a terminal can save the input buffer occasionally to minimize the rewriting caused by failures that affect the buffer.

Keywords

Checkpointing, Fault-Tolerance, Alchemi.NET, Grid Computing, GridSim.

How to Cite this Article?

Rathore. N (2017). Checkpointing: Fault Tolerance Mechanism. i-manager's Journal on Cloud Computing, 4(1), 28-34. https://doi.org/10.26634/jcc.4.1.13756

References

[1]. Chohan, R., & Rathore, N. K. (2012). Comparison of Load Balancing Technique in Grid. 17th Annual Conference of Gwalior Academy of Mathematical Science and National Symposium on Computational Mathematics & Information Technology (pp. 7-9). JUET, Guna, M.P.
[2]. Rathore, N. (2015). Efficient agent based priority scheduling and load balancing using fuzzy logic in grid computing. i-manager's Journal on Computer Science, 3(3), 11-22.
[3]. Rathore, N. (2015). Ethical Hacking and Security against Cyber Crime. i-manager's Journal on Information Technology, 5(1), 7-11.
[4]. Rathore, N. (2015). GridSim installation and implementation process. i-manager's Journal on Cloud Computing, 2(4), 29-40.
[5]. Rathore, N. (2015). MapReduce architecture for grid. i-manager's Journal on Software Engineering, 10(1), 21- 30.
[6]. Rathore, N. (2016). A Review towards: Load balancing Techniques. i-manager's Journal on Power Systems Engineering, 4(4), 47-60.
[7]. Rathore, N. (2016). Dynamic Threshold Based Load Balancing Algorithms.Wireless Personal Communications, 91(1), 151-185.
[8]. Rathore, N. (2016). Installation of Alchemi.Net in Computational Grid. i-manager's Journal on Computer Science, 4(2), 1-5.
[9]. Rathore, N. (2017). Performance of Hybrid Load Balancing Algorithm in Distributed Web Server System. National Academy Science Letters.
[10]. Rathore, N. An Efficient Load Balancing Algorithm in Distributed Network. Journal of Computer & Electrical Engineering, IOS Press Publication-Netherland (in revisions).
[11]. Rathore, N. K. (2016). Faults in Grid. International Journal of Software and Computer Science Engineering. 1(1), 1-19.
[12]. Rathore, N. K., & Chana, I. (2010). Checkpointing algorithm in Alchemi.NET. Pragyaan: Journal of Information Technology, 8(1), 32-38.
[13]. Rathore, N. K., & Chana, I. (2010). Fault tolerance algorithm in Alchemi.NET middleware. In National Conference on Education & Research (ConFR10), Third CSI National Conference of CSI Division V, Bhopal Chapter, IEEE Bombay, and MPCST Bhopal, organized by JUIT, India.
[14]. Rathore, N. K. (2011, December). Checkpointing: Fault Tolerance Mechanism. Journal on Cloud Computing, 4(1), 28-35.
[15]. Rathore, N. K., & Chana, I. (2011, December). A cogitative analysis of load balancing technique with job migration in grid environment. In World Congress on Information and Communication Technology (WICT), Mumbai, IEEE Proceedings Paper (pp. 77-82).
[16]. Rathore, N. K., & Chana, I. (2016). Job migration policies for grid environment. Wireless Personal Communications, 89(1), 241-269.
[17]. Rathore, N. (2008). Comparative Analysis of Checkpointing. PIMR Third National IT Conference, IT Enabled Practices and Emerging Management Paradigm book and category is Communication Technologies and Security Issues (pp no.-32-35). Prestige Management and Research, Indore, (MP) India.
[18]. Rathore, N. (2014). Efficient Hierarchical Load Balancing Technique based on Grid. In 29th M.P. Young Scientist Congress (p.55), Bhopal, M.P.
[19]. Rathore, N. (2015). Efficient Load Balancing Algorithm in Grid. In 30th M.P. Young Scientist Congress (p. 56). Bhopal, M.P.
[20]. Rathore, N. (2016). An Efficient Load Balancing Algorithm in Distributed Networks. Lambert Academic Publication House.
[21]. Rathore, N., & Chana, I. (2013). Report on hierarchal load balancing technique in grid environment. i-manager's Journal on Information Technology, 2(4), 21- 35.
[22]. Rathore, N., & Chana, I. (2013, September). A sender initiate based hierarchical load balancing technique for grid using variable threshold value. In Signal Processing, Computing and Control (ISPCC), 2013 IEEE International Conference on (pp. 1-6). IEEE.
[23]. Rathore, N., & Chana, I. (2014). Job migration with fault tolerance based QoS scheduling using hash table functionality in social Grid computing. Journal of Intelligent & Fuzzy Systems, 27(6), 2821-2833.
[24]. Rathore, N., & Chana, I. (2014). Load balancing and job migration techniques in grid: A survey of recent trends. Wireless Personal Communications, 79(3), 2089-2125.
[25]. Rathore, N., & Chana, I. (2015). Variable thresholdbased hierarchical load balancing technique in Grid. Engineering with Computers, 31(3), 597-615.
[26]. Rathore, N. K., & Chohan, R. (2016). An Enhancement of GridSim Architecture with Load Balancing. Scholars' Press.
[27]. Rathore, N. K., & Sharma, A. (2015). Efficient Dynamic Distributed Load Balancing Technique. Lambert Academic Publication House.
[28]. Rathore, N. K., Jain, N., & Mishra, A. (2017). An efficient image forgery detection using Biorthogonal Wavelet Transform and Singular Value Decomposition. In 5th International Conference on Advance Research Applied Science, Environment, Agriculture & Entrepreneurship Development (ARASEAED) (pp. 274- 281).
[29]. Sharma, V., Kumar, R., & Rathore, N. (2015). Topological Broadcasting using Parameter Sensitivity- Based Logical Proximity Graphs in Coordinated Ground- Flying Ad Hoc Networks. JoWUA, 6(3), 54-72.
If you have access to this article please login to view the article or kindly login to purchase the article

Purchase Instant Access

Single Article

North Americas,UK,
Middle East,Europe
India Rest of world
USD EUR INR USD-ROW
Online 15 15

Options for accessing this content:
  • If you would like institutional access to this content, please recommend the title to your librarian.
    Library Recommendation Form
  • If you already have i-manager's user account: Login above and proceed to purchase the article.
  • New Users: Please register, then proceed to purchase the article.