French AI startup Mistral is introducing new AI model customization options, including paid plans, to enable developers and enterprises to fine-tune its generative models for specific use cases.
Self-Service SDK
Mistral has released a software development kit (SDK) named Mistral-Finetune for fine-tuning its models on various hardware setups, including workstations, servers, and small datacenter nodes. The SDK is optimized for multi-GPU setups but can also scale down to a single Nvidia A100 or H100 GPU for smaller models like Mistral 7B. For example, fine-tuning on a dataset like UltraChat, which includes 1.4 million dialogs with OpenAI’s ChatGPT, takes about half an hour using eight H100 GPUs.
Managed Fine-Tuning Services
For those preferring a managed solution, Mistral offers fine-tuning services through its API. Currently, these services are compatible with Mistral Small and Mistral 7B models, with plans to support more models in the coming weeks.
Custom Training Services
Mistral is also debuting custom training services, available only to select customers, to fine-tune any Mistral model using an organization’s specific data. This approach aims to create highly specialized and optimized models tailored to specific domains.
Mistral is reportedly seeking to raise around $600 million at a $6 billion valuation from investors like DST, General Catalyst, and Lightspeed Venture Partners. The company is looking to grow its revenue amidst increasing competition in the generative AI space. Since unveiling its first generative model in September 2023, Mistral has released several more models, including a code-generating model, and has introduced paid APIs. However, the company has not disclosed user numbers or revenue figures.