NOW LET US – AI RAG SaaS Studio TP.HCM
NOW LET US
Digital Product Studio
Back to news
DEV-TOOLS...1 min read

Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains

Share
NOW LET US Article – Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains

JetBrains has released Mellum2, an open 12B Mixture-of-Experts model optimized for low-latency text and code tasks. It delivers competitive performance with over 2x faster inference compared to similar-sized models.

  • Mellum2 is a 12B-parameter Mixture-of-Experts model trained from scratch on natural language and code.
  • The model activates only 2.5B parameters per token, making it efficient for high-throughput, low-latency inference. Mellum2 is can be used for routing, RAG, summarization, sub-agents, high-throughput coding features, and private deployments.
  • It is released under the Apache 2.0 license.
  • Compared with similar-sized models, Mellum2 delivers competitive benchmark performance while achieving more than 2x faster inference.
  • Download the model on Hugging Face: https://huggingface.co/collections/JetBrains/mellum-2
  • For architecture details, training setup, benchmarks, and evaluation methodology, read the full technical report: https://arxiv.org/pdf/2605.31268
© 2026 Now Let Us. All rights reserved.

Source: Hugging Face Blog

Advertisement
Ad slot ready: 5887729102

More in this category

NOW LET US Related – The 29th International Obfuscated C Code Contest (IOCCC) 2025 Winners

dev-tools

The 29th International Obfuscated C Code Contest (IOCCC) 2025 Winners

The 29th International Obfuscated C Code Contest (IOCCC) has announced its 2025 winners, showcasing historic levels of submission volume and quality alongside mind-bending C programming creations.

NOW LET US Related – I design with Claude more than Figma now

dev-tools

I design with Claude more than Figma now

A designer shares how integrating Claude into their workflow completely transformed their process, shifting from static Figma mockups to building fully functional prototypes directly in the codebase.

NOW LET US Related – Valve P2P networking broken for more than 2 months

dev-tools

Valve P2P networking broken for more than 2 months

A major systemic issue with Valve's Steam Networking protocol has been severely impacting P2P gaming in the Middle East for over two months. Despite players contacting ISPs and Steam Support, this routing issue remains unresolved.

NOW LET US Related – Field of clones: How horse replicas came to dominate polo

dev-tools

Field of clones: How horse replicas came to dominate polo

In Argentina, cloning polo horses has evolved from a wild gamble into a highly lucrative, mature industry. While the technology dominates the sport, it continues to spark intense scientific and ethical debates.

NOW LET US Related – Show HN: Oproxy – inspect and modify network traffic from the browser

dev-tools

Show HN: Oproxy – inspect and modify network traffic from the browser

oproxy is a local HTTP, HTTPS, and SOCKS5 proxy for inspecting, replaying, and modifying traffic.

NOW LET US Related – Human-Like Neural Nets by Catapulting

dev-tools

Human-Like Neural Nets by Catapulting

A speculative proposal to train overparameterized neural networks using high learning rates to trigger 'catapulting' or 'grokking', potentially bridging the gap between artificial and human intelligence.

EXPLORE TOPICS

Discover All Categories

Deep dive into the specific technology sectors that matter most to you.