A frontier-class coding model, open and Nvidia-free
Meituan open-sourced LongCat-2.0, a 1.6-trillion-parameter agentic coding model trained entirely on Chinese chips. What open, self-hostable frontier models mean for your dev stack.
The list of AI models you can actually run yourself — not rent through an API — just got a serious new entry. On June 30, China's Meituan open-sourced LongCat-2.0, a 1.6-trillion-parameter coding model that had been quietly topping usage charts under a codename. It was trained without a single Nvidia GPU. For anyone building software, the interesting part isn't the geopolitics — it's that a near-frontier agentic coder is now weights you can download.
What actually happened
Meituan released LongCat-2.0 with 1.6 trillion parameters and a 1-million-token context window, open-sourced under a permissive MIT license (per VentureBeat). It's a Mixture-of-Experts design: 1.6T total parameters but only ~48 billion active per token, which is what makes a model that large affordable to actually serve. Before the reveal, it ran on OpenRouter under the codename "Owl Alpha" and was handling roughly 10 trillion tokens a month — meaning developers were already leaning on it hard before they knew whose it was.
The hardware story is the headline elsewhere: Meituan says it's the first trillion-parameter model to complete full training and inference on a 50,000-card cluster of domestic Chinese AI chips, no Nvidia involved (per the South China Morning Post). That proves frontier-scale training no longer depends on one company's GPUs — which, over time, is what breaks the compute pricing that flows into every AI bill.
Why it matters for your business
Most AI in your tools today is a rented API call. That's fine until the price changes, the model gets deprecated, your data has to leave your building, or a rate limit throttles you mid-workday. Open weights are the alternative: a model you can host on your own infrastructure, pin to a version that won't shift under you, and run against sensitive code or customer data without shipping it to a third party.
For a small shop, the calculus is honest: you probably won't self-host a 1.6T model on a server in the back office — that still needs real hardware. But you don't have to. Open weights this capable mean you can run it on infrastructure you rent and control, or lean on the growing set of hosts that serve open models at a fraction of frontier-API prices. The direction that matters is options. A year ago, "just use the open model" meant accepting a real quality drop for agentic coding work. LongCat-2.0 is another data point that the gap is closing — and every capable open model is leverage on the price you pay the closed ones.
Key takeaways
- Meituan open-sourced LongCat-2.0 on June 30 — a 1.6T-parameter (≈48B active) agentic coding model with a 1M-token context, under an MIT license
- It ran anonymously as "Owl Alpha" on OpenRouter, handling ~10T tokens/month before the reveal — developers were already relying on it
- It's the first trillion-parameter model fully trained and served on domestic Chinese chips with no Nvidia GPUs — frontier scale without the usual hardware dependency
- Capable open weights are leverage: a model you can host, version-pin, and run on your own data — and a check on closed-API pricing
Wondering whether an open model fits your stack? We help you weigh open weights against closed APIs on cost, privacy, and control — and build the system so switching is easy either way. See how we architect it or talk through your use case.
Sources: VentureBeat, South China Morning Post.
- #open-weights
- #coding-agents
- #self-hosting
- #longcat
- #ai-models
Tommy Rush — Founder, Rush Commerce
Operator turned builder. 15+ years running operations — now shipping the systems businesses run on. More
Get The Rush Report weekly — one email, zero fluff.
Keep reading
Straiker's $64M: your AI agents are attack surface
Straiker raised $64M to secure enterprise AI agents. Its research: 36% of coding-agent attacks hit RCE, 91% of productivity-agent attacks led to silent data theft.
Read itLiteLLM RCE: your AI gateway is now attack surface
CVE-2026-42271 turns LiteLLM into unauthenticated remote code execution and API-key theft. What the AI proxy layer means for your security.
Read it