Why it matters
The release of Mellum2 by JetBrains, a company known for developer tools, indicates a continued investment in AI models tailored for specific applications. As an MoE model, it offers potential for efficient inference and specialized capabilities, which could be beneficial for AI-powered developer tools.
JetBrains has introduced Mellum2, a new Mixture-of-Experts (MoE) model with 12 billion parameters. This model is a successor to their earlier Mellum model. Mellum2 is now accessible on the Hugging Face platform, making it available for developers and researchers to explore and integrate into their projects. The development of such models by a company like JetBrains suggests a strategic focus on enhancing AI capabilities within their ecosystem, potentially for code generation, intelligent assistance, or other developer-centric applications.
Featured on AI Radar: JetBrains Introduces Mellum2, a 12B Mixture-of-Experts Model