ARCHIVES

Original Article

Adaptive Fault Tolerance in Machine Learning Systems: A Self-Healing Framework

Faiza Fathima1Adwaith Raj2Jojo George3Nandana Babu4Ashmila K.P5Dr. S. Vadhana Kumari6

¹ ² ³ ⁴ ⁵ ⁶ Computer Science and Engineering and Business Systems, Vimal Jyothi Engineering College, Kannur, Kerala, India.

Published Online: January-April 2026

Pages: 214-220

Abstract

This project presents a plugin-based self-healing machine learning framework aimed at improving the reliability and robustness of deployed ML models in dynamic real-world environments. The framework autonomously detects, classifies, and mitigates runtime errors by continuously monitoring model behavior using reliability signals such as data drift, prediction confidence, entropy, and label consistency. Detected anomalies are semantically classified into error types including data drift, overconfidence, and label noise, enabling a policy-driven healing mechanism to select appropriate corrective actions such as safe retraining or protective blocking. Safety guards and validation checks ensure that learning and adaptation occur only under reliable conditions, preventing harmful self-updates. Through a gated feedback loop and selective learning strategy, the framework maintains long-term model stability while reducing performance degradation. Its modular, plugin- based design allows seamless integration with existing machine learning models without modifying core model logic, thereby minimizing human intervention and providing a practical approach to fault-tolerant machine learning systems suitable for real-world deployment.

Related Articles

2026

Artificial Intelligence in Learning and Teaching

2026

Admin Assist: An AI – Driven Configuration and Orchestration for Enterprise Application

2026

Enhancing Blood Group Identification using pigeon inspired optimization: An Innovative Approach

2026

Eco-Genius: Power Up Smart, Power Down Waste

2026

Crowd-Sourced Disaster Response and Rescue Assistant

2026

Unveiling Deepfake Detection Using Vision Transformers: A Survey and Experimental Study

2026

A Novel Stateful Orchestration Pattern for Data Affinity and Transactional Integrity in Sharded Backend Architectures

2026

Legal Challenges of Agentic AI Systems in Education and Employment Decision-Making

2026

New-Hybrid Soft Computing Model for Stock Market Predictions

2026

Human Emotion Distribution Learning from Face Images Using CNN