NOW LET US – AI RAG SaaS Studio TP.HCM
NOW LET US
Digital Product Studio
Back to news
STARTUPS-VC...6 min read

Z.ai launches ZCode to challenge Cursor, Claude Code and GitHub Copilot in AI coding

Share
NOW LET US Article – Z.ai launches ZCode to challenge Cursor, Claude Code and GitHub Copilot in AI coding

Z.ai has officially launched ZCode, a free desktop application described as an 'Agentic Development Environment' built for its flagship GLM-5.2 model, directly challenging Western rivals like Cursor and GitHub Copilot.

Z.ai, the Beijing-based artificial intelligence lab formerly known as Zhipu AI, on Wednesday officially launched ZCode, a free desktop application it describes as an "Agentic Development Environment" purpose-built for its flagship GLM-5.2 large language model. The move marks the company's most aggressive push yet into the fast-growing AI-powered coding tool market, where it now competes directly with Cursor, Claude Code, GitHub Copilot, and Google's Antigravity.

"Introducing ZCode, the official development environment for GLM-5.2," the company wrote on X, noting the tool is available on macOS, Windows, and Linux, supports bring-your-own-key (BYOK) configurations for third-party models, and offers a 1.5x usage-quota bonus for subscribers to its GLM Coding Plan.

Read one way, ZCode is simply another entrant in a crowded market. Read another, it is a single product that crystallizes three of the most consequential trends in enterprise software today: the race-to-the-bottom pricing of frontier AI models, the geopolitical balkanization of the AI stack, and the rapid maturation of agentic coding agents into what Gartner now estimates is a roughly $10 billion market.

An AI coding tool designed to think in projects, not prompts

Unlike traditional IDEs that bolt on AI through a chat sidebar or autocomplete extension, ZCode is best understood as an agent-first development environment. Its core design is built around long-horizon tasks: the user describes an outcome, the agent plans the work, edits files, runs checks, reviews progress, and continues across multiple iterations until the goal is met.

ZCode organizes the development experience around the ZCode Agent, deeply tuned for GLM-5.2, with emphasis on deep integration: the model, tools, and execution workflow are tuned together so the Agent fits continuous, multi-step real-world development tasks. The environment supports continuous follow-up across devices: desktop, mobile Remote, and Feishu / WeChat Bot can all keep the same workspace task moving. Sensitive commands, file changes, and high-permission actions go through confirmation before execution.

That remote-control feature — the ability to steer a running coding agent from WeChat, Feishu, or Telegram on a phone — is a differentiator that speaks directly to the Chinese developer market, where those messaging platforms dominate professional communication. You can keep checking progress and adding instructions while long-running work continues, from any device with these messaging apps.

The tool is free to download. Revenue flows through Z.ai's GLM Coding Plan subscription tiers, which start at $16.20 per month for a "Lite" plan and scale to $144 per month for "Max" — prices that undercut Anthropic's Claude Code and Cursor's comparable tiers by significant margins.

Through July 31, ZCode is offering a promotional 1.5x effective quota bonus for Coding Plan subscribers, with off-peak token consumption charged at a 0.67x coefficient. The platform also supports multiple AI models and agents, including Claude Code, Codex, Gemini, and OpenCode — a pragmatic concession to the reality that no single model wins every task.

GLM-5.2, the open-source model trained entirely on Chinese chips, powers the whole experience

ZCode's value proposition is inseparable from GLM-5.2, the model it was designed to showcase. Z.ai released GLM-5.2 on June 16, first to its Coding Plan subscribers and subsequently as open-source weights under the MIT license on Hugging Face — a sequencing decision that prioritized distribution over the traditional benchmark-led launch.

The model's specifications are formidable. GLM-5.2 is a 744-billion-parameter mixture-of-experts architecture with 40 billion active parameters, a genuine one-million-token context window — five times the 200K limit on its predecessor — and training on 28.5 trillion tokens. It ranked second globally on Code Arena as of mid-June, trailing only Anthropic's Claude Fable 5, making it one of the highest-performing publicly available models for coding tasks.

Critically, the model was built entirely without American chips. As Decrypt reported, GLM-5.2 "runs entirely on Huawei silicon." Stability AI founder Emad Mostaque estimated total training costs at roughly $25 million, with 80 percent spent on post-training — a figure that, if accurate, would make GLM-5.2 extraordinarily cheap relative to Western frontier models.

On benchmarks, GLM-5.2 performs within striking distance of the best proprietary systems. It trails Anthropic's Claude Opus 4.8 by just one percentage point on FrontierSWE, a benchmark measuring multi-hour autonomous engineering projects, while edging out OpenAI's GPT-5.5.

Its API pricing — $1.40 per million input tokens and $4.40 per million output — are a cost reduction of up to 82 percent compared to Anthropic's Claude Opus 4.8 at $5 and $25, respectively. Because ZCode is a first-party tool from the same company that makes the model, it requires no manual endpoint configuration — the model is wired in.

The Anthropic export ban gave Chinese AI its biggest opening yet

ZCode's arrival cannot be separated from the geopolitical drama that has roiled the AI industry over the past three weeks. On June 12, the U.S. government, citing national security authorities, issued an export control directive suspending all access to Anthropic's Fable 5 and Mythos 5 models by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. Enterprise clients in finance, healthcare, SaaS, and critical infrastructure found their core intelligence services abruptly disabled, without exception, prior warning, or effective recourse.

While the Trump administration lifted those controls just yesterday — Anthropic confirmed on June 30 that the Department of Commerce had rescinded the directive — the episode sent shockwaves through the developer community and accelerated interest in open-source, self-hostable alternatives. The government's crackdown on Anthropic coincided with a swift rise in Chinese open-source models that are proving to be almost as capable and significantly cheaper than some of the most powerful U.S. models.

Z.ai's timing was surgical. On the same day the Trump administration ordered Anthropic's most advanced models blocked for foreign nationals, Zhipu announced the open-source release of GLM-5.2 with no usage restrictions. The South China Morning Post reported that GLM-5.2 would be available to all users of Zhipu's new GLM Coding Plan subscription, "priced at just a tenth of Anthropic's premium Claude Code and Claude Max tiers."

The market responded accordingly. Zhipu AI's market capitalization crossed HK$1 trillion (US$128 billion) on June 22, driven by a 42 percent intraday share surge. JPMorgan raised its 2026–2030 revenue forecast for Zhipu by between 7 and 16 percent following the launch, projecting an over 534 percent revenue surge for 2026 and expecting the AI firm to turn a profit by 2028.

Why vendor lock-in now carries a geopolitical risk that no SLA can cover

The Fable 5 episode did more than embarrass Anthropic. It introduced a new risk category into enterprise AI procurement: sovereign access risk. When a government can disable a commercially deployed AI model overnight, the traditional evaluation criteria of developer experience, benchmark scores, and pricing become secondary to a more fundamental question: Will this tool still work tomorrow?

The event exposed the inadequacy of standard enterprise contract language. An investigation by FifthRow found that almost all standard Data Processing Addenda, SaaS agreements, and procurement SLAs "relied on vague 'force majeure' or 'compliance with law' catch-alls, not on precise, actionable regulatory suspension or kill-switch clauses."

ZCode's BYOK architecture and GLM-5.2's MIT-licensed open weights offer a partial answer. A development team can download the model, host the model locally, and run ZCode entirely within their own private cloud infrastructure, insulated from the whims of foreign regulators.

© 2026 Now Let Us. All rights reserved.

Source: VentureBeat

Advertisement
Ad slot ready: 5887729102

More in this category

NOW LET US Related – Trunk Tools' stack cut document review from 60 days to 10 by ditching general-purpose models

startups-vc

Trunk Tools' stack cut document review from 60 days to 10 by ditching general-purpose models

Trunk Tools built a specialized three-layer AI architecture to handle complex construction data, cutting document review times from months to days. By moving away from general-purpose LLMs, the company successfully automated highly-specific workflows with high accuracy.

NOW LET US Related – New Alibaba AI framework skips loading every tool, cutting agent token use 99%

startups-vc

New Alibaba AI framework skips loading every tool, cutting agent token use 99%

Researchers at Alibaba have developed SkillWeaver, a framework that optimizes how AI agents route tasks to specific tools. By utilizing a feedback loop called Skill-Aware Decomposition (SAD), it dramatically improves accuracy while cutting token consumption by over 99%.

NOW LET US Related – DeepSeek open sources DSpark, a new framework to speed up LLM inference by up to 85%

startups-vc

DeepSeek open sources DSpark, a new framework to speed up LLM inference by up to 85%

DeepSeek has open-sourced DSpark, an MIT-licensed framework that speeds up LLM inference by up to 85% using speculative decoding. The system aims to solve the high cost of AI deployment by making large models serve users faster and more efficiently.

NOW LET US Related – Researchers introduce Self-Harness, a framework that lets AI agents rewrite their own rules, boosting performance up to 60%

startups-vc

Researchers introduce Self-Harness, a framework that lets AI agents rewrite their own rules, boosting performance up to 60%

Researchers have introduced Self-Harness, a novel framework that enables LLM-based agents to systematically analyze and rewrite their own operating rules. This self-improving approach eliminates manual debugging and boosts agent performance by up to 60%.

NOW LET US Related – 7,000 Langflow servers are under attack. LangGraph and LangChain have the same holes

startups-vc

7,000 Langflow servers are under attack. LangGraph and LangChain have the same holes

Three of the most popular AI agent frameworks—Langflow, LangGraph, and LangChain—are facing severe security vulnerabilities, exposing sensitive API keys and enabling remote code execution. With thousands of servers already under active attack, these classic application security flaws highlight the risks of insecure defaults in rapid AI deployment.

NOW LET US Related – Fine-tuning forgets. RAG leaks context. Hypernetworks build the model your agent needs on demand.

startups-vc

Fine-tuning forgets. RAG leaks context. Hypernetworks build the model your agent needs on demand.

Enterprise AI agents often stall in production due to the limitations of fine-tuning and RAG. Hypernetworks offer a breakthrough by generating small, task-specific models on demand, bypassing context limits and retraining costs.

EXPLORE TOPICS

Discover All Categories

Deep dive into the specific technology sectors that matter most to you.