NOW LET US – AI RAG SaaS Studio TP.HCM
NOW LET US
Digital Product Studio
Back to news
AGENTIC-SYSTEMS...1 min read

Definitional alignment before capability alignment: a Design-Science framework for adjudicating claims about AGI

Share
NOW LET US Article – Definitional alignment before capability alignment: a Design-Science framework for adjudicating claims about AGI

A new research paper proposes DAF-AGI, a design-science framework to resolve conflicting claims about the arrival of Artificial General Intelligence (AGI) by prioritizing definitional alignment over capability alignment.

Computer Science > Artificial Intelligence

Title:Definitional alignment before capability alignment: a Design-Science framework for adjudicating claims about AGI

View PDF HTML (experimental)Abstract:Claims that artificial general intelligence has already arrived and claims that it remains decades away are often defended from overlapping evidence. "AGI" lacks a single shared and stable referent and competing operationalizations can return different verdicts on the same system. This article treats that under-specification as a design and governance problem. Following Design Science Research Methodology, it develops DAF-AGI, a second-order conceptual artifact with two coupled components: five ordinal criteria for assessing the adjudicative fitness of candidate definitions and a structured governance audit of authorship, interest, certification, external verification and revision authority. The artifact is demonstrated on five prominent measurement families and one deflationary boundary position in a documented corpus and then stress-tested against a stylized strong arrival claim: that current generative systems constitute AGI because they outperform a well-educated adult on many cognitive tasks. On evidence from the cited 2024-2025 sources, the claim was certifiable only under a performance-based operationalization; capability-ontology, psychometric and skill-acquisition approaches did not certify it, the economic family remains indeterminate and the deflationary position refuses binary adjudication. The contribution is a novel integration and operationalization, not an empirical validation: independent application, inter-rater testing and author-external cases remain necessary. The paper further proposes definitional sovereignty as an enabling component of algorithmic sovereignty: the institutional capacity to contest, certify and revise imported technological categories under public accountability.

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.

© 2026 Now Let Us. All rights reserved.

Source: arXiv cs.AI Recent

Advertisement
Ad slot ready: 5887729102

More in this category

NOW LET US Related – "Did you lie?" Evaluating Lie Detectors across Model Scale and Belief-Verified Model Organisms

agentic-systems

"Did you lie?" Evaluating Lie Detectors across Model Scale and Belief-Verified Model Organisms

A new study evaluates lie detectors for language models, revealing that while detector performance scales with model capability on prompted lies, current detectors fail sharply when tested on sophisticated, belief-verified model organisms.

NOW LET US Related – Deployment-Centered Evaluation: Predicting Query-Level Rejection Risk in a Clinical LLM System

agentic-systems

Deployment-Centered Evaluation: Predicting Query-Level Rejection Risk in a Clinical LLM System

A new study proposes a deployment-centered evaluation approach to predict the risk of clinicians rejecting LLM-generated responses in electronic health records. By leveraging deployment-specific context, the prediction model achieves an AUROC of 0.719, paving the way for targeted guardrails in clinical AI systems.

NOW LET US Related – Evoflux: Inference-Time Evolution of Executable Tool Workflows for Compact Agents

agentic-systems

Evoflux: Inference-Time Evolution of Executable Tool Workflows for Compact Agents

Researchers introduce Evoflux, an inference-time evolutionary search method that treats compact tool use as the repair of executable tool workflows, significantly boosting execution feasibility for small planners.

NOW LET US Related – Strategic Decision Support for AI Agents

agentic-systems

Strategic Decision Support for AI Agents

As AI agents increasingly act on behalf of users, a new research paper proposes a strategic decision-support framework that helps agents optimize when to seek human or tool assistance, balancing operational costs with decision accuracy.

NOW LET US Related – TrajGenAgent: A Hierarchical LLM Agent for Human Mobility Trajectory Generation

agentic-systems

TrajGenAgent: A Hierarchical LLM Agent for Human Mobility Trajectory Generation

Researchers have proposed TrajGenAgent, a hierarchical LLM-agent framework that generates realistic human mobility trajectories without model fine-tuning, addressing privacy and cost constraints in urban planning and epidemic control.

NOW LET US Related – PersonaDrive: Human-Style Retrieval-Augmented VLA Agents for Closed-Loop Driving Simulation

agentic-systems

PersonaDrive: Human-Style Retrieval-Augmented VLA Agents for Closed-Loop Driving Simulation

Researchers have introduced PersonaDrive, a breakthrough AI pipeline that leverages Vision-Language-Action (VLA) models and Retrieval-Augmented Generation (RAG) to simulate diverse human driving styles. This technology promises to revolutionize closed-loop driving simulations by creating highly realistic and varied behavior for non-ego traffic agents.

EXPLORE TOPICS

Discover All Categories

Deep dive into the specific technology sectors that matter most to you.