How Can AI Find My Model? A Model-Finding Experimental Study Considering Data Formats, Embeddings, and Retrieval Strategies

A recent experimental study explores how AI can optimize the discovery and reuse of simulation models using natural language queries. By evaluating data formats, embedding models, and retrieval strategies, the research establishes a baseline for AI-driven model composability and interoperability.

Computer Science > Artificial Intelligence

Title:How Can AI Find My Model? A Model-Finding Experimental Study Considering Data Formats, Embeddings, and Retrieval Strategies

Discovering simulation models for reuse remains a fundamental challenge in Modeling and Simulation (M&S). When many models coexist, identifying those that align with a given modeling intent remains difficult. Recent advances in Artificial Intelligence (AI), particularly retrieval-based approaches, offer a promising pathway to operate at this semantic layer. In this paper, we present an experimental study investigating the impact of data representation, transformer-based embedding models, and retrieval strategies on the discovery of simulation models using natural language queries. We evaluated performance across multiple query types using standard information retrieval metrics, including recall@5 and nDCG@5. Results show that data representation matters, open-source embedding models can achieve high performance, and reranking methods are important, especially as query complexity increases. This work provides a baseline for AI-driven model discovery and discusses its role in advancing toward AI-driven composability and interoperability.

Source: arXiv cs.AI Recent

More in this category

agentic-systems

Why Solve It Twice? Hierarchical Accumulation of Skills for Transfer-Efficient ML Engineering

Researchers introduce HASTE, a hierarchical multi-agent system that organizes cross-competition knowledge into three scope tiers, allowing ML agents to accumulate and reuse skills, significantly reducing compute costs and improving performance.

agentic-systems

Beyond expert users: agents should help users construct preferences, not just elicit them

Current AI agents assume users have well-formed preferences, but a new study argues they should instead help users construct these preferences by providing domain knowledge. Evaluating frontier models on a new benchmark called CoShop reveals significant limitations in how current AI assists users in understanding their own needs.

agentic-systems

AgentBound: Verifiable Behavioral Governance for Autonomous AI Agents

Researchers have introduced AgentBound, a runtime governance framework that provides verifiable behavioral oversight for autonomous AI agents. By combining three independent authorities and generating cryptographically verifiable receipts, AgentBound establishes a deterministic governance layer between authorization and execution.

agentic-systems

Start building with Nano Banana 2 Lite and Gemini Omni Flash

Google has launched Nano Banana 2 Lite, its fastest and most cost-effective image generation model, alongside Gemini Omni Flash, a powerful tool for video generation and conversational editing. Together, these models empower developers to build seamless, end-to-end multimedia workflows.

agentic-systems

GPTNT: Benchmarking Real-Time Collaboration Between Multimodal Agents on Keep Talking And Nobody Explodes

Researchers introduce GPTNT, a new benchmark based on the game 'Keep Talking and Nobody Explodes' to evaluate real-time collaboration between multimodal AI agents. The study reveals that current state-of-the-art models fail to defuse a single bomb in real-time, highlighting key weaknesses in AI coordination.

agentic-systems

The Two Genie Game: Adoption and Welfare in Audit-Grounded AI Governance

A game-theoretic study analyzes when harm-minimizing AI agents can displace approval-seeking RLHF agents in competitive markets, revealing that self-audited AI is not a silver bullet for preventing community harm.

EXPLORE TOPICS