NOW LET US – AI RAG SaaS Studio TP.HCM
NOW LET US
Digital Product Studio
Back to news
DEV-TOOLS...1 min read

MosaicLeaks: Can your research agent keep a secret?

Share
NOW LET US Article – MosaicLeaks: Can your research agent keep a secret?

Deep research agents increasingly combine private local documents with external tools like web retrieval, creating a privacy risk: an agent's external queries may leak sensitive information. MosaicLeaks proposes a new deep-research task with multi-hop questions that interleave public and private information.

Deep research agents increasingly combine private local documents with external tools like web retrieval, creating a privacy risk: an agent's external queries may leak sensitive information. MosaicLeaks proposes a new deep-research task with multi-hop questions that interleave public and private information. Across the models we tested, agents frequently leaked private information, and training only for task performance made it worse. We propose a mosaic-leakage-aware RL training method, Privacy-Aware Deep Research (PA-DR), which raises strict chain success (the share of chains where every hop is answered correctly) from 48.7% to 58.7% while reducing answer/full-information leakage from 34.0% to 9.9%.

A research agent at a healthcare firm is working through a routine question, and along the way it fires off a handful of ordinary-looking web searches. One references a cloud-migration milestone, one a January 2024 security disclosure, one narrows down which vendor got hit. No single query necessarily gives away the whole secret. But anyone watching the agent's outbound traffic can reassemble the fragments: MediConn had migrated 70% of its infrastructure to the cloud by January 2025, a fact that lived only in private documents. This is the mosaic effect, and it's the failure mode at the centre of MosaicLeaks.

MosaicLeaks treats those web queries as the leakage channel: the adversary never sees the private documents or the agent's reasoning, only the cumulative query log, and tries to infer private enterprise information from it.

We measure leakage in three ways, depending on what the adversary can infer from the observed queries:

| Leakage type | What the adversary sees | What counts as leakage | |---|---|---| | Intent leakage | Only the agent's web-query log | The adversary can infer the private research questions or goals the agent was trying to answer | | Answer leakage | The web-query log plus a question about private information | The adversary can answer those private questions without seeing the private documents | | Full-information leakage | Only the web-query log | The adversary can state verifiably true private claims, even without being given the questions |

These three represent increasing levels of concern. Intent leakage reveals what the agent is investigating. Answer leakage means the query log holds enough to answer a private question someone already has in hand. Full-information leakage is the strongest case: the observer can discover and state private facts without being told what to look for.

© 2026 Now Let Us. All rights reserved.

Source: Hugging Face Blog

Advertisement
Ad slot ready: 5887729102

More in this category

NOW LET US Related – Who Owns Your ATProto Identity? Hint: It's Probably Not You

dev-tools

Who Owns Your ATProto Identity? Hint: It's Probably Not You

An in-depth analysis of ATProto's key management system reveals critical security risks, where PDS operators hold absolute control over user identities and actions across the entire ecosystem.

NOW LET US Related – Beyond All Reason (Free Total Annihilation Inspired RTS)

dev-tools

Beyond All Reason (Free Total Annihilation Inspired RTS)

Inspired by the legendary Total Annihilation, Beyond All Reason is a free-to-play RTS game that redefines the genre with unmatched scale and real-time physics simulation.

NOW LET US Related – A 3D voxel game engine written in APL

dev-tools

A 3D voxel game engine written in APL

An experimental project demonstrates the viability of using the APL array programming language to build a 3D voxel game engine, challenging traditional game development workflows.

NOW LET US Related – The 100k Whys of AI

dev-tools

The 100k Whys of AI

While large language models (LLMs) are highly sophisticated, distinguishing AI-generated content remains feasible due to the quasi-deterministic nature of the technology. The flood of repetitive books on Amazon clearly demonstrates how AI tends to replicate itself when faced with similar prompts.

NOW LET US Related – Public Service Announcement: Don't Say You Use AI for Writing

dev-tools

Public Service Announcement: Don't Say You Use AI for Writing

Publicly admitting to using AI for writing can inadvertently ruin your professional reputation. When the line between 'assisting' and 'replacing' becomes blurred, readers and professionals will likely question your actual capabilities.

NOW LET US Related – Building reliable agentic AI systems

dev-tools

Building reliable agentic AI systems

A case study on how Bayer and Thoughtworks built PRINCE, an agentic AI platform that transforms preclinical drug discovery data retrieval using Agentic RAG and Text-to-SQL.

EXPLORE TOPICS

Discover All Categories

Deep dive into the specific technology sectors that matter most to you.