Mistral Unveils Ministral 3B and 8B AI Models for Edge Computing and On-Device Use

October 17, 2024 at 7:04:41 AM

TL;DR Mistral introduces two new AI models: Ministral 3B and Ministral 8B. These models excel in knowledge, reasoning, and efficiency for on-device and edge use cases, supporting up to 128k context length. Ministral 8B features a sliding-window attention pattern for faster inference. They cater to local, privacy-first applications like on-device translation and autonomous robotics. Both models outperform peers in benchmarks and are available now.

Mistral Unveils Ministral 3B and 8B AI Models for Edge Computing and On-Device Use

Mistral has introduced two new AI models, Ministral 3B and Ministral 8B, designed for on-device computing and edge use cases. These models excel in knowledge, commonsense reasoning, function-calling, and efficiency within the sub-10B category. They support up to 128k context length (currently 32k on vLLM), with Ministral 8B featuring an interleaved sliding-window attention pattern for faster and memory-efficient inference.

Use Cases

Les Ministraux cater to local, privacy-first inference needs for applications such as:

  • On-device translation
  • Internet-less smart assistants
  • Local analytics
  • Autonomous robotics

They are also efficient intermediaries for function-calling in multi-step workflows, handling input parsing, task routing, and API calls with low latency and cost.

The performance of les Ministraux has been benchmarked across multiple tasks, consistently outperforming peers. The models were compared to Gemma 2 2B, Llama 3.2 3B, Llama 3.1 8B, and Mistral 7B.

Pretrained Models

  • Ministral 3B and 8B were compared to other models in multiple categories, showcasing superior performance.

Instruct Models

  • Ministral 3B and 8B Instruct models were compared to Gemma 2 2B, Llama 3.2 3B, Llama 3.1 8B, and Gemma 2 9B, demonstrating significant improvements.

Availability and Pricing

Both models are available immediately with the following pricing:

  • Ministral 8B: $0.1 / M tokens (input and output), available under Mistral Commercial License and Mistral Research License.
  • Ministral 3B: $0.04 / M tokens (input and output), available under Mistral Commercial License.

For self-deployed use, commercial licenses are available upon request, with assistance in lossless quantization for specific use cases. Model weights for Ministral 8B Instruct are available for research use, and both models will soon be available from cloud partners.

Mistral AI continues to push the boundaries of frontier models, with Ministral 3B already outperforming the previous Mistral 7B on most benchmarks. Feedback is encouraged as they continue to innovate.

Have more questions on this topic? Ask our AI assistant for in-depth insights.

Read more from sources πŸ‘‡

The Only Digital Marketing Feed You'll Ever Need.

Stay informed your way. Tailored updates when and how you want them. 100% Free.

10,000+ Users

500+ Sources

1000+ Tools

Or

Related Posts

LinkedIn Launches AI-Powered Hiring Assistant Trending ️‍πŸ”₯

LinkedIn Launches AI-Powered Hiring Assistant

LinkedIn
LinkedIn

Official Source

Official Source

LinkedIn is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Audit your GA4 account in Minutes

Audit your GA4 account in Minutes

Sponsored
GA4 Auditor
GA4 Auditor

Verified Sponsor

Verified Sponsor

GA4 Auditor is a Verified Sponsor. Want to get featured here? Contact us.

Verified Sponsor
LinkedIn Launches Free AI Professional Certificates

LinkedIn Launches Free AI Professional Certificates

LinkedIn
LinkedIn

Official Source

Official Source

LinkedIn is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Google Releases SynthID Text for Watermarking AI-Generated Content

Google Releases SynthID Text for Watermarking AI-Generated Content

Google
Google

Official Source

Official Source

Google is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Adobe Launches GenStudio for Performance Marketing with Generative AI

Adobe Launches GenStudio for Performance Marketing with Generative AI

Meta Launches 'Movie Gen' for Creating and Editing Videos from Text Prompts

Meta Launches 'Movie Gen' for Creating and Editing Videos from Text Prompts

Meta
Meta

Official Source

Official Source

Meta is a Official Source. The source has been verified by Swipe Insight team.

Official Source
What AI in Search Means for Advertisers' Future

What AI in Search Means for Advertisers' Future

Think with Google
Think with Google

Official Source

Official Source

Think with Google is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Cloudflare Launches AI Audit to Help Websites Control AI Scraping

Cloudflare Launches AI Audit to Help Websites Control AI Scraping

Cloudflare
Cloudflare

Official Source

Official Source

Cloudflare is a Official Source. The source has been verified by Swipe Insight team.

Official Source

Related Tools

GA4 Auditor logo

GA4 Auditor

Verified Tool

Verified Tool

GA4 Auditor is a Verified Tool. Want to get this badge? Contact us.

Verified Tool

Automated GA4 audits with actionable insights

Get Featured Here

Showcase your tool in this list.

Contact Us
Formula Bot logo

Formula Bot

AI-powered data analysis and visualization tool

Data Analysis
Thunderbit logo

Thunderbit

No-code AI apps and automations for business users

Workflow Automation