NOW LET US – AI RAG SaaS Studio TP.HCM
NOW LET US
Digital Product Studio
Back to news
DEV-TOOLS...2 min read

Matadisco – Decentralized Data Discovery

Share
NOW LET US Article – Matadisco – Decentralized Data Discovery

Matadisco is an experimental project that decouples data discovery from storage using the AT Protocol, enabling a decentralized and transparent way to find and share massive open datasets.

Open data is only as useful as it is discoverable

Petabytes of satellite imagery, climate models, and genomic sequences sit in public repositories — yet finding the right data means navigating dozens of siloed portals, each with different interfaces, APIs, and blind spots.

If you generate a derived dataset or clean up an existing one, there's often no way to make it findable. Government portals decide what gets published. Aggregators are centralized. Community contributions get lost.

How Matadisco works

Matadisco separates data discovery from data storage. Three pieces work together:

AT Protocol

Matadisco is built on AT Protocol, an open social protocol. Every record is cryptographically signed. No single entity controls the network and all components are open source and can be self-hosted.

Producers

Write Matadisco records to a PDS (Personal Data Server). A record is a lightweight pointer to metadata — a link, an optional preview, and a timestamp — so the schema works with any metadata standard: STAC, DataCite, IIIF, RSS, and more. A producer typically watches an existing catalogue or data source and publishes records automatically.

Consumers

Read records from the network via a PDS or Jetstream, filter for what's relevant, and present them as a web-based portal for users. A satellite imagery portal, a scientific data hub, a cultural heritage archive — each built in about 100 lines of code.

The schema

The Matadisco record is defined as an ATProto Lexicon. In MLF syntax:

/// A Matadisco record
record matadisco {
/// The time the original metadata/data was published
publishedAt!: Datetime,
/// A URI that links to resource containing the metadata
resource!: Uri,
/// Preview of the data
preview: {
/// The media type the preview has
mimeType!: string,
/// The URL to the preview
url: Uri,
},
}

Only resource and publishdAt are required. The preview is optional — for satellite imagery it's a thumbnail, for articles a summary, for podcasts an audio snippet.

See it in action

The matadisco-viewer streams new ATProto records in real time and renders them. Currently showing Copernicus Sentinel-2 satellite imagery.

Producers & Consumers

Producers write records into the network; consumers read and display them. The prototype demonstrates both roles:

sentinel-to-atproto(producer) — listens to Element 84's Earth Search STAC catalogue for new Sentinel-2 imagery and writes records to a PDS.

gdi-de-csw-to-atproto(producer) — imports metadata from the German geodata catalogue (GDI-DE) via CSW and publishes records to ATProto.

matadisco-viewer(consumer) — subscribes to a Bluesky Jetstream relay or reads from a PDS, filters for Matadisco records, and displays them as a portal with previews.

matadisco-geo-viewer(consumer) — a viewer specialised for geospatial metadata records with support for STAC metadata, rendering spatial previews on a map. Supports consumption from both Jetstream and PDS.

Because records flow through an open network, institutions manage their catalogues independently while participating in shared discovery.

Prior art & influences

  • FROST by Tom Nicholas — a Federated Registry of Scientific Things.
  • Edward Silverton's explorations of ATProto for IIIF and GLAM data.
  • Community discussions on metadata for long-form content and cross-platform discovery.

Get started

Matadisco is experimental — things may break or change. That also means there's room to shape it.

What's next

  • Image-based sources like GLAM collections using IIIF
  • Non-image sources — podcasts, research datasets, publications
  • Schema evolution informed by real-world use across different domains

Publish records under your own namespace, build a portal for your community, or propose changes to the schema. We'd love to hear from anyone working in open data, metadata standards, or scientific infrastructure.

© 2026 Now Let Us. All rights reserved.

Source: Hacker News

Advertisement
Ad slot ready: 5887729102

More in this category

EXPLORE TOPICS

Discover All Categories

Deep dive into the specific technology sectors that matter most to you.