Editorial illustration of open model blocks and mixture-of-experts routing paths
Editorial illustration of open model blocks and mixture-of-experts routing paths
+ Large Language Models News

Mistral 3 raises the open-model bar with Apache-licensed dense and MoE releases

Mistral 3 includes three small dense models and Mistral Large 3, a 675B-parameter sparse MoE with 41B active parameters released under Apache 2.0.

Mistral AI announced Mistral 3, a new model family that includes three small dense models and Mistral Large 3, its most capable model to date. The headline for builders is not only performance. The models are released under the Apache 2.0 license.

Mistral Large 3 is a sparse mixture-of-experts model with 41 billion active parameters and 675 billion total parameters. Mistral says it was trained from scratch on 3,000 NVIDIA H200 GPUs.

What changed

Mistral says Large 3 is its first mixture-of-experts model since the Mixtral series and that it debuts at number two in the OSS non-reasoning models category on LMArena. The company also released smaller Ministral models at 14B, 8B, and 3B.

The availability story is broad: Mistral AI Studio, Amazon Bedrock, Azure Foundry, Hugging Face, Modal, IBM watsonx, OpenRouter, Fireworks, Unsloth AI, Together AI, and others.

Why this matters

Open models matter when teams need customization, local deployment, auditability, or cost control. Apache licensing makes the release more useful for commercial builders than more restrictive open-weight drops.

The smaller models are also important. Not every AI workload needs a frontier-scale model. If the small models are strong enough, they can push more inference to edge devices, private environments, and cost-sensitive applications.

What to watch next

Watch the reasoning version Mistral says is coming soon, plus real deployment reports from teams serving Large 3 with vLLM, TensorRT-LLM, SGLang, and compressed checkpoints. Open model quality only matters if it can be served economically.

Sources

The AI Feed Desk

The AI Feed Desk

Editorial desk

The AI Feed Desk tracks AI provider updates, model releases, agent tooling, and enterprise adoption, turning fast-moving announcements into source-linked context for builders and operators.

Noticed a typo, incorrect information, or translation error?

Tell us so we can fix it.

Help Improve This Article

Related Articles

Anthropic's Stainless acquisition brings Claude closer to the API layer

Anthropic's May 18 acquisition of Stainless gives Claude Platform deeper control over SDKs, CLIs, and MCP server tooling, showing how agent competition is moving into API connectivity.

The AI Feed Desk

By The AI Feed Desk

Google I/O 2026 makes Gemini the operating layer for agents

Google's May 19 I/O updates put Gemini 3.5 Flash, Gemini Spark, Android automation, and Universal Cart into one agent strategy across apps, devices, developer tools, and commerce.

The AI Feed Desk

By The AI Feed Desk

OpenAI says its model disproved an 80-year geometry conjecture

OpenAI's May 20 unit-distance proof gives AI research a concrete milestone: a model-generated result checked by mathematicians, with human verification still central.

The AI Feed Desk

By The AI Feed Desk

Claude for Small Business turns agent workflows into a toggle install

Anthropic's May 13 Claude for Small Business release packages 15 workflows and connectors for QuickBooks, PayPal, HubSpot, Canva, Docusign, Google Workspace, and Microsoft 365.

The AI Feed Desk

By The AI Feed Desk

KPMG is embedding Claude into Digital Gateway and 276,000 employee workflows

Anthropic's May 19 KPMG alliance puts Claude inside Digital Gateway, tax and legal tools, and a global workforce, showing enterprise AI moving into owned workflow software.

The AI Feed Desk

By The AI Feed Desk