Meta Releases New AI Models for Audio, Text, and Watermarking

June 19, 2024 at 5:46:55 AM

TL;DR Meta’s Fundamental AI Research (FAIR) team is releasing new AI models and tools for audio generation, text-to-vision, and watermarking. Key releases include JASCO for text-to-music generation and AudioSeal for watermarking AI-generated speech. Chameleon models for visual and textual tasks will also be available. FAIR aims to inspire further advancements in AI by sharing their research publicly.

Meta Releases New AI Models for Audio, Text, and Watermarking

Meta’s Fundamental AI Research (FAIR) team has unveiled several new AI models and tools focusing on audio generation, text-to-vision, and watermarking. These releases aim to inspire further research and advance AI responsibly.

JASCO: Text-to-Music Generation

Meta introduced JASCO (Joint Audio and Symbolic Conditioning for Temporally Controlled Text-to-Music Generation), an AI model that enhances AI-generated sound by taking different audio inputs like chords or beats. Users can adjust features such as chords, drums, and melodies through text. The JASCO inference code will be part of the AudioCraft AI audio model library under an MIT license, while the pre-trained model will be available under a non-commercial Creative Commons license.

Listen to sample of works here.

AudioSeal: AI-Generated Speech Watermarking

AudioSeal is another tool from Meta designed to add watermarks to AI-generated speech, enabling the identification of AI-generated content. It allows for localized detection of AI-generated segments within longer audio snippets, increasing detection speed by 485 times. AudioSeal will be released with a commercial license.

Get the model here.

Chameleon: Multimodal Text Model

Meta Chameleon is a family of models that can combine text and images as input and output any combination of text and images with a single unified architecture for both encoding and decoding. While most current late-fusion models use diffusion-based learning, Meta Chameleon uses tokenization for text and images. This enables a more unified approach and makes the model easier to design, maintain, and scale. The possibilities are endless—imagine generating creative captions for images or using a mix of text prompts and images to create an entirely new scene.

FAIR will release two sizes of its multimodal text model, Chameleon 7B and 34B, under a research-only license. These models are designed for tasks requiring visual and textual understanding, such as image captioning. However, the Chameleon image generation model will not be released at this time.

Request access to the model here.

Multi-Token Prediction Approach

Meta will also provide researchers access to its multi-token prediction approach, which trains language models on multiple future words simultaneously rather than one at a time. This will be available under a non-commercial and research-only license.

Get the model on Hugging Face here.

Q&A

Have more questions on this topic? Ask our AI assistant for in-depth insights.

Read more from sources 👇

The Only Digital Marketing Feed You'll Ever Need.

Stay informed your way. Tailored updates when and how you want them. 100% Free.

10,000+ Users

500+ Sources

1000+ Tools

Or

Related Posts

Meta Faces Trial Over Alleged Use of Pirated Works in AI Training Dataset

Meta Faces Trial Over Alleged Use of Pirated Works in AI Training Dataset

Perplexity launches freemium Deep Research product for expert-level analysis

Perplexity launches freemium Deep Research product for expert-level analysis

Perplexity
Perplexity

Official Source

Official Source

Perplexity is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Google Expands Whisk AI Image Generator to Over 100 New Countries for Users

Google Expands Whisk AI Image Generator to Over 100 New Countries for Users

Google
Google

Official Source

Official Source

Google is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Tired of spending too much time creating audits for your clients?

Tired of spending too much time creating audits for your clients?

Featured
Google expands NotebookLM Plus to individual users with premium features

Google expands NotebookLM Plus to individual users with premium features

Google
Google

Official Source

Official Source

Google is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Meta Enhances Transparency with AI Labels for Ads Products

Meta Enhances Transparency with AI Labels for Ads Products

AI Meta Ads +1 more
Meta
Meta

Official Source

Official Source

Meta is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Meta may halt AI development for systems deemed too risky under new Frontier AI Framework

Meta may halt AI development for systems deemed too risky under new Frontier AI Framework

DeepSeek's R1 Model Release Shakes Silicon Valley and Challenges US AI Dominance Trending ️‍🔥

DeepSeek's R1 Model Release Shakes Silicon Valley and Challenges US AI Dominance

Related Tools

Markifact logo

Markifact

Verified Tool

Verified Tool

Markifact is a Verified Tool. Want to get this badge? Contact us.

Verified Tool

Marketing Workflows Powered by AI

Featured
Marketing Auditor logo

Marketing Auditor

Verified Tool

Verified Tool

Marketing Auditor is a Verified Tool. Want to get this badge? Contact us.

Verified Tool

Automated audits for Google Ads and Analytics.

Get Featured Here

Showcase your tool in this list.

Contact Us
Thunderbit logo

Thunderbit

No-code AI apps and automations for business users

Workflow Automation
Formula Bot logo

Formula Bot

AI-powered data analysis and visualization tool

Data Analysis