ARCHIVES
Adaptive Fault Tolerance in Machine Learning Systems: A Self-Healing Framework
¹ ² ³ ⁴ ⁵ ⁶ Computer Science and Engineering and Business Systems, Vimal Jyothi Engineering College, Kannur, Kerala, India.
Published Online: January-April 2026
Pages: 214-220
Cite this article
↗ https://www.doi.org/10.59256/indjcst.20260501030This project presents a plugin-based self-healing machine learning framework aimed at improving the reliability and robustness of deployed ML models in dynamic real-world environments. The framework autonomously detects, classifies, and mitigates runtime errors by continuously monitoring model behavior using reliability signals such as data drift, prediction confidence, entropy, and label consistency. Detected anomalies are semantically classified into error types including data drift, overconfidence, and label noise, enabling a policy-driven healing mechanism to select appropriate corrective actions such as safe retraining or protective blocking. Safety guards and validation checks ensure that learning and adaptation occur only under reliable conditions, preventing harmful self-updates. Through a gated feedback loop and selective learning strategy, the framework maintains long-term model stability while reducing performance degradation. Its modular, plugin- based design allows seamless integration with existing machine learning models without modifying core model logic, thereby minimizing human intervention and providing a practical approach to fault-tolerant machine learning systems suitable for real-world deployment.
Related Articles
2026
Artificial Intelligence in Learning and Teaching
2026
Admin Assist: An AI – Driven Configuration and Orchestration for Enterprise Application
2026
Enhancing Blood Group Identification using pigeon inspired optimization: An Innovative Approach
2026
Eco-Genius: Power Up Smart, Power Down Waste
2026
Crowd-Sourced Disaster Response and Rescue Assistant
2026
Unveiling Deepfake Detection Using Vision Transformers: A Survey and Experimental Study
2026
A Novel Stateful Orchestration Pattern for Data Affinity and Transactional Integrity in Sharded Backend Architectures
2026
Legal Challenges of Agentic AI Systems in Education and Employment Decision-Making
2026
New-Hybrid Soft Computing Model for Stock Market Predictions
2026
Human Emotion Distribution Learning from Face Images Using CNN


