#mixture-of-experts

Cohere Releases North Mini Code, a 30B-Parameter MoE Model for Agentic Software Engineering

LLMs Jun 10, 2026

Cohere's first coding-specialized model combines sparse mixture-of-experts architecture with reinforcement learning for agent-based development tasks.

JetBrains Releases Mellum2, a 12B Sparse Model for Sub-Second Inference

LLMs Jun 1, 2026

JetBrains' new Mixture-of-Experts model achieves 2x speedup over dense peers while activating just 2.5B parameters per token.