NOW LET US – AI RAG SaaS Studio TP.HCM
NOW LET US
Digital Product Studio
Back to news
DEV-TOOLS...2 min read

GLM 5.2 vs. Opus

Share
NOW LET US Article – GLM 5.2 vs. Opus

A head-to-head comparison between the new open-weights model GLM-5.2 and Claude Opus 4.8 in building a 3D platformer from scratch using raw WebGL.

GLM-5.2 just came out, and it's another step forward for what open models can do. The internet promptly freaked out, and it's hard to tell what's real and what's hype.

So we ran it head-to-head against Claude Opus 4.8: same one-shot prompt, build a 3D platformer in raw WebGL from scratch. Here's our take after running the test and digging through the benchmarks and the buzz.

We're not switching our main off Opus. In our test Opus was faster and shipped a cleaner, more correct game, and it can check its own visual output, which the text-only GLM-5.2 can't. But GLM-5.2 earns a permanent spot in the arsenal: it's a genuinely capable model at a fraction of the price, and because it's open weights, it'll always be available. A closed model can be retired or restricted with little warning (Fable was a recent reminder); weights you can download can't be taken away.

You can play both games right now, or grab the source:

**GLM-5.2's game:3dgame-glm.d.ritzademo.comOpus's game:3dgame-opus.d.ritzademo.comSource for both:**github.com/jamesdanielwhitford/glm-5.2-vs-opus-platformers

Both are browser games written from scratch, with no game engine or 3D rendering library like Three.js. The 3D models are free CC0 assets from Kenney.

Here's how the two runs compared:

| Metric | GLM-5.2 (Pi/OpenRouter) | Opus (Claude Code) | |---|---|---| | Wall-clock build time | 1h 10m 40s | 33m 30s | | Output tokens | 131,000 | 216,809 | | Peak context window | 16% of 1M | 19% of 1M | | Tool calls | 128 | 153 | | Cost | $5.39 (real billed) | ~$21.92 (estimate, list pricing) |

GLM-5.2 cost a fraction as much. Opus finished in half the time and shipped a cleaner game.

On paper, the benchmarks put GLM-5.2 just behind the top closed models, and the online buzz is a mix of genuine signal and astroturf. We get into both below, after the game.

What is GLM-5.2â

GLM-5.2 is Z.ai's latest flagship model. It's open weights under an MIT license, so you can download it, run it yourself, or call it through Z.ai's API.

It's built for long-horizon tasks, the kind of long, multi-step coding-agent work that runs for hours. It ships with a 1M-token context window and two thinking effort levels, High and Max, that trade speed for capability.

GLM-5.2 is text-only, not multimodal. It can't read images, so workflows built around screenshots or diagrams still need a model like Claude Opus.

Z.ai positions it roughly between Claude Opus 4.7 and 4.8 at similar token usage. Here's their

© 2026 Now Let Us. All rights reserved.

Source: Hacker News

Advertisement
Ad slot ready: 5887729102

More in this category

NOW LET US Related – Codex logging bug may write TBs to local SSDs

dev-tools

Codex logging bug may write TBs to local SSDs

A logging bug in Codex continuously writes massive amounts of data to local SQLite databases, potentially reaching 640 TB per year. This high write volume can quickly exhaust the write endurance (TBW) of consumer SSDs, destroying them in less than a year.

NOW LET US Related – Deno Desktop

dev-tools

Deno Desktop

Deno Desktop, introduced in Deno v2.9.0 (canary), compiles Deno projects into self-contained desktop applications with small binaries and native performance.

NOW LET US Related – Sakana Fugu

dev-tools

Sakana Fugu

Fugu-Ultra, a new AI agent system, has demonstrated superior performance over leading frontier models across six complex real-world benchmarks. From autonomously optimizing GPT training recipes and decoding classical Japanese manuscripts to playing blindfold chess and stock trading, Fugu-Ultra showcases the immense potential of orchestrated AI agents.

NOW LET US Related – Good results fine tuning a local LLM like Qwen 3:0.6B to categorize questions

dev-tools

Good results fine tuning a local LLM like Qwen 3:0.6B to categorize questions

A detailed guide on fine-tuning a tiny local LLM (Qwen 3:0.6B) to categorize household questions for metadata-aware RAG, boosting accuracy from 10% to 92% using Unsloth and opaque ID mapping.

NOW LET US Related – Did my old job only exist because of fraud?

dev-tools

Did my old job only exist because of fraud?

A software engineer reflects on a shocking realization that his early-career startup job, which relocated him to the US, might have only existed as a vehicle for a multi-million dollar venture capital fraud exposed by the SEC.

NOW LET US Related – Apertus – Open Foundation Model for Sovereign AI

dev-tools

Apertus – Open Foundation Model for Sovereign AI

Apertus has officially released Apertus Mini, a set of 16 small language models demonstrating advanced distillation and quantization techniques. Designed to meet EU AI Act requirements and support over 1,000 languages, Apertus aims to redefine open-source and sovereign AI.

EXPLORE TOPICS

Discover All Categories

Deep dive into the specific technology sectors that matter most to you.