NOW LET US – AI RAG SaaS Studio TP.HCM
NOW LET US
Digital Product Studio
Back to news
AGENTIC-SYSTEMS...1 min read

Safe and Adaptive Cloud Healing: Verifying LLM-Generated Recovery Plans with a Neural-Symbolic World Model

Share
NOW LET US Article – Safe and Adaptive Cloud Healing: Verifying LLM-Generated Recovery Plans with a Neural-Symbolic World Model

Researchers have introduced PASE, a breakthrough cloud self-healing framework that combines Large Language Models (LLMs) with a Neural-Symbolic World Model. This novel approach optimizes autonomous recovery planning, reducing system downtime by over 40% compared to state-of-the-art methods.

Computer Science > Artificial Intelligence

Title:Safe and Adaptive Cloud Healing: Verifying LLM-Generated Recovery Plans with a Neural-Symbolic World Model

View PDF HTML (experimental)Abstract:As the scale and complexity of cloud-based AI systems continue to escalate, ensuring service reliability through rapid fault detection and adaptive recovery has become a critical challenge. While existing approaches integrate Large Language Models (LLMs) for semantic understanding and Deep Reinforcement Learning (DRL) for policy optimization, they often rely on sequential, loosely coupled architectures that underutilize the generative and reasoning capabilities of LLMs. In this paper, we propose a paradigm shift with PASE, a Planning-Aware Semantic self-healing engine, a novel fault self-healing framework that reconceptualizes recovery as a neuro-symbolic program synthesis task. PASE employs an LLM as a core Plan Synthesis Engine to generate structured recovery plans from a library of semantic primitives. A Neural-Symbolic World Model verifies plan feasibility through simulation, while a Meta-Prompt Optimizer, trained via DRL, learns to generate optimal prompts that guide the LLM's planning process. This tight reason-plan-verify-adapt loop enables dynamic, context-aware recovery strategy generation beyond predefined action spaces. Experiments on a real-world cloud fault injection dataset demonstrate that PASE significantly outperforms state-of-the-art methods, reducing average system recovery time by over 40% and improving fault detection accuracy in unknown fault scenarios. Our framework advances autonomous system management by unifying LLM-based reasoning with model-assisted verification and meta-learned guidance.

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.

© 2026 Now Let Us. All rights reserved.

Source: arXiv cs.AI Recent

Advertisement
Ad slot ready: 5887729102

More in this category

NOW LET US Related – Scaling Trends for Lie Detector Oversight in Preference Learning

agentic-systems

Scaling Trends for Lie Detector Oversight in Preference Learning

A new study evaluates Scalable Oversight via Lie Detectors (SOLiD) on larger LLMs, showing that scaling reduces undetected deception to 14% and can eliminate the need for expensive human labelers during fine-tuning.

NOW LET US Related – The Agentic Garden of Forking Paths

agentic-systems

The Agentic Garden of Forking Paths

A new study reveals that AI agents can produce divergent, opposing scientific conclusions from the same dataset simply by being assigned different personas. To address this challenge to scientific credibility, researchers propose 'Agentic Bootstrap' to map the entire distribution of possible analytical paths.

NOW LET US Related – World Feedback for Clinical Agents: Diagnosing RL in FHIR Environments

agentic-systems

World Feedback for Clinical Agents: Diagnosing RL in FHIR Environments

The paper diagnoses the challenges of applying Reinforcement Learning (RL) to clinical agents in FHIR environments, introducing MedAgentBench-v3 to address feedback flaws and proposing a hybrid SFT-RL approach.

NOW LET US Related – Discrete Diffusion Language Models for Interactive Radiology Report Drafting

agentic-systems

Discrete Diffusion Language Models for Interactive Radiology Report Drafting

Researchers have adapted a mixture-of-experts diffusion language model for medical applications, matching or exceeding traditional autoregressive models while decoding 3.5 to 4.4 times faster and enabling flexible, non-linear report drafting.

NOW LET US Related – OPINE-World: Programmatic World Modeling with Ontology-error-Prioritized Interactive Exploration

agentic-systems

OPINE-World: Programmatic World Modeling with Ontology-error-Prioritized Interactive Exploration

Researchers have introduced OPINE-World, a breakthrough LLM agent that learns an object-centric programmatic world model online through interaction. By guiding exploration with a novel 'ontology error' metric, it overcomes the data-hungry nature of traditional deep networks and achieves high efficiency on the ARC-AGI-3 benchmark.

NOW LET US Related – Janus: a Playground for User-Involved Agentic Permission Management

agentic-systems

Janus: a Playground for User-Involved Agentic Permission Management

As AI agents autonomously execute tools, managing permissions becomes a critical challenge. Janus is introduced as a playground system consisting of Janus-Core and Janus-Harness to implement and evaluate user-involved permission management designs.

EXPLORE TOPICS

Discover All Categories

Deep dive into the specific technology sectors that matter most to you.