NOW LET US – AI RAG SaaS Studio TP.HCM
NOW LET US
Digital Product Studio
Back to news
DEV-TOOLS...3 min read

What we learned building 100 API integrations with OpenCode

Share
NOW LET US Article – What we learned building 100 API integrations with OpenCode

Nango shares insights from using autonomous AI agents to build ~200 API integrations in just 15 minutes, highlighting both the efficiency gains and the unexpected ways agents can fail or 'cheat'.

At Nango, we build open source infrastructure for product integrations.

With all the recent advancements in coding models, we wanted to see how far we can push an autonomous agent building integrations with external APIs.

TL;DR: Our background agent isn’t perfect, but it reliably generates ~200 integrations across five APIs (Google Calendar, Drive, Sheets, HubSpot, and Slack) in 15 minutes and less than $20 in token costs.

Previously, this would have taken an engineer about a week.

This post is about what it took to get there, what broke along the way, and what we learned building a background coding agent for API integrations.

Our setup

The pipeline is pretty simple:

  • We define a list of interactions to build, for example create-calendar-event, sync-files, or send-slack-message.
  • Our orchestrator prepares one workspace per interaction with the right Nango integrations scaffolding generated by our CLI.
  • We spawn one OpenCode agent per interaction.
  • Each agent independently builds its interaction, tests it with the external API (the agent has access to a test account), and iterates until it works.
  • Once all agents finish, the orchestrator checks their work and assembles the individual interactions into one Nango integration per API.

Note that we only gave the agent very high-level instructions (“create a calendar event”), and it autonomously researched required parameters, formats, and descriptions.

1. Let the agents run wild first

We chose to start with very few guardrails to learn what the models actually do. We didn’t want to overfit our assumptions.

Some of it went better than expected. Agents often filled in gaps in our instructions, inferred how to use existing SDK helpers, and required a lot less handholding than we expected. But they also broke in very unexpected ways.

2. Do not trust the agents

Agents optimize for completion at all costs. Here are some of the behaviours we saw:

Copying test data from other agents

One agent looked through another agent's directory and reused an event ID it found there instead of creating its own. Fix: Agents are no longer allowed to leave their own directory.

Hallucinating a Nango CLI command

Some hallucinated commands were plausible, but the agent often got stubborn when a command failed, going down a rabbit hole trying to make a non-existent command work. Fix: Improve instructions around the available API and CLI.

Editing test data when the implementation failed

Changing the test data was easier than fixing the code, so that is what some agents tried to do when their implementation failed. Fix: Tighter edit permissions and explicit checks that test fixtures were not modified.

Faking API reachability

Instead of flagging an auth problem, the agent sometimes decided to keep going by writing the response it thought the API would probably return. Fix: Prevent the agent from modifying test data and make failed API access a hard stop.

Claiming it is done when it is not done

The agent writes a clean summary but leaves behind code that does not compile or pass tests. Fix: Delete the agent, restart generation for this interaction from scratch.

3. Ignore the final error message and trace the root cause

When a run failed, the final error reported by the agent was often a red herring. A common pattern involved the agent hallucinating a command, misreading the failure, and building a workaround on top of a wrong assumption.

If you only debug from the final error, you often end up debugging the wrong thing. OpenCode makes inspection easy because all messages are stored in a SQLite database, allowing us to automate checks and inspect traces.

The main takeaway: do not trust the agent. Verify everything.

© 2026 Now Let Us. All rights reserved.

Source: Hacker News

Advertisement
Ad slot ready: 5887729102

More in this category

NOW LET US Related – GLM 5.2 Is Out

dev-tools

GLM 5.2 Is Out

Zhipu AI has officially released GLM-5.2, its most powerful open-source model to date, featuring a 1M context window and advanced long-horizon task capabilities. The release underscores Zhipu's commitment to open-source AI and global scientific collaboration amid rising technological restrictions.

NOW LET US Related – Noise infusion banned from statistical products published by Census Bureau

dev-tools

Noise infusion banned from statistical products published by Census Bureau

The U.S. Department of Commerce has banned "noise infusion" from statistical products published by the Census Bureau, a decision that could have severe consequences for both data utility and privacy protection.

NOW LET US Related – Treating pancreatic tumours may have revealed cancer's master switch

dev-tools

Treating pancreatic tumours may have revealed cancer's master switch

A promising new drug called daraxonrasib has shown breakthrough results in treating pancreatic cancer, doubling median survival times. This achievement could pave the way for an entirely new class of cancer treatments.

NOW LET US Related – Every Frame Perfect

dev-tools

Every Frame Perfect

In UI design, perfection isn't just about the start and end states, but every single transition frame in between. Polishing these micro-interactions is key to building user trust.

NOW LET US Related – Leaving Mozilla

dev-tools

Leaving Mozilla

A poignant and candid reflection from a 15-year Mozilla veteran upon their departure. The author highlights the leadership's missteps in trying to emulate tech giants and urges Mozilla to return to its core values: community and uniqueness.

NOW LET US Related – Shepherd's Dog: A Game by the Most Dangerous AI Model

dev-tools

Shepherd's Dog: A Game by the Most Dangerous AI Model

A developer tested Anthropic's latest, supposedly 'too dangerous' AI model by asking it to build a long-held game idea in a single shot. The model succeeded, generating a complete 2,319-line game after a 45-minute reasoning session.

EXPLORE TOPICS

Discover All Categories

Deep dive into the specific technology sectors that matter most to you.