NOW LET US – AI RAG SaaS Studio TP.HCM
NOW LET US
Digital Product Studio
Back to news
AGENTIC-SYSTEMS...1 min read

Narration-of-Thought: Inference-Time Scaffolding for Defeasible Ethical Reasoning in Large Language Models

Share
NOW LET US Article – Narration-of-Thought: Inference-Time Scaffolding for Defeasible Ethical Reasoning in Large Language Models

Researchers introduce Narration-of-Thought (NoT), a zero-training inference-time scaffolding that structures LLM reasoning to dramatically improve ethical decision-making and reduce cognitive biases like stakeholder collapse and uncertainty suppression.

Computer Science > Artificial Intelligence

Title:Narration-of-Thought: Inference-Time Scaffolding for Defeasible Ethical Reasoning in Large Language Models

View PDF HTML (experimental)Abstract:Standard chain-of-thought on moral dilemmas exhibits two failure modes: stakeholder collapse (the trace names at most one party with a stake in the outcome) and uncertainty suppression (no explicit unknowns or hedges before committing to an action). We introduce narration-of-thought (NoT), a system prompt that structures chain-of-thought into five sections: protagonist, stakeholders, two-step consequences, uncertainty, then commitment. NoT adds no training, parameters, or fine-tuning. On 100 DailyDilemmas scenarios across four generators from three vendors, NoT cuts stakeholder collapse from up to 31% to under 1% and uncertainty suppression from up to 72% to 1-24% on every model. A matched-budget verbose-CoT control rules out token spend as the active ingredient; NoT retains Cliff's delta advantages of +0.79 to +0.90 on stakeholder count and +0.65 to +0.93 on uncertainty score for three of four generators, and a section ablation attributes each shift to its specific sub-instruction. Textual-gradient descent initialised at NoT improves the scaffold further; a cross-family training judge (different vendor from the generator) dominates an in-family one on every measured axis. Extended to a five-round multi-stakeholder debate protocol, the scaffold converts a 6% standoff into 95% full consensus on a calibration set and 100% combined convergence on a DailyDilemmas replication. The resulting traces externalise the stakeholders, consequences, and uncertainty grounding each commitment, providing an auditable substrate for dependable agentic deployment.

Current browse context:

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.

© 2026 Now Let Us. All rights reserved.

Source: arXiv cs.AI Recent

Advertisement
Ad slot ready: 5887729102

More in this category

NOW LET US Related – AlgoEvolve: LLM-driven Meta-evolution of Algorithmic Trading Programs

agentic-systems

AlgoEvolve: LLM-driven Meta-evolution of Algorithmic Trading Programs

Researchers have introduced AlgoEvolve, an LLM-driven evolutionary framework that automatically generates, evaluates, and optimizes Python-based algorithmic trading strategies. The system demonstrates emergent regime-adaptive logic and utilizes a meta-evolutionary loop to optimize prompts, outperforming human-designed instructions.

NOW LET US Related – COrigami: An AI Pipeline for Co-Designing Flat-Foldable Visually Recognisable Origami

agentic-systems

COrigami: An AI Pipeline for Co-Designing Flat-Foldable Visually Recognisable Origami

Researchers have developed COrigami, an end-to-end AI pipeline that generates flat-foldable origami crease patterns from natural language descriptions. By combining algorithmic optimization with reinforcement learning, the system serves as a collaborative assistant for human artists.

NOW LET US Related – Accelerating Skill Assessment in Chess: A Drift-Diffusion-Enhanced Elo Rating System

agentic-systems

Accelerating Skill Assessment in Chess: A Drift-Diffusion-Enhanced Elo Rating System

Researchers have developed DD-Elo, a new chess rating system based on the drift-diffusion model from cognitive neuroscience. By analyzing move-by-move data rather than just match outcomes, DD-Elo updates player ratings much faster and more accurately than the traditional Elo system.

NOW LET US Related – Knowledge-augmented Agentic AI for Mental Health Medication Information Seeking

agentic-systems

Knowledge-augmented Agentic AI for Mental Health Medication Information Seeking

Researchers have developed a knowledge-augmented multi-agent AI framework that integrates regulatory FDA records with patient narratives from Reddit and WebMD, offering a safer and more traceable way to seek mental health medication information.

NOW LET US Related – Agentic Analysis for Agentic Infrastructure: An LLM-Powered Pipeline for Comparative Governance of DAO and Corporate AI Protocols

agentic-systems

Agentic Analysis for Agentic Infrastructure: An LLM-Powered Pipeline for Comparative Governance of DAO and Corporate AI Protocols

Researchers have introduced an LLM-powered comparative pipeline to analyze the governance structures of AI agent protocols, comparing decentralized (DAO) and corporate-led standards.

NOW LET US Related – Geometry-Aware MCTS for Extremal Problems in Combinatorial Geometry

agentic-systems

Geometry-Aware MCTS for Extremal Problems in Combinatorial Geometry

Researchers have proposed a Geometry-Aware MCTS framework to solve complex extremal problems in combinatorial geometry. This new approach overcomes the limitations of traditional RL and Transformer models, establishing new best-known computational results on five out of six tested problems.

EXPLORE TOPICS

Discover All Categories

Deep dive into the specific technology sectors that matter most to you.