BigQuery ML Expands with Advanced Embedding Support and AI Features

September 26, 2024 at 6:43:52 AM

TL;DR Google has updated BigQuery ML with new embedding support and AI features. Embedding features include multimodal embeddings for text, image, and video, structured data embeddings using PCA or autoencoder models, and user/item embeddings via matrix factorization. AI features include document processing with the Document AI API and audio transcription with the Speech-to-Text API. These updates are generally available.

BigQuery ML Expands with Advanced Embedding Support and AI Features

Google has announced updates to BigQuery ML, enhancing its machine learning capabilities with new embedding support features and AI functionalities. These updates aim to provide users with more powerful tools for data analysis and processing across various modalities.

Expanded Embedding Support

BigQuery ML now offers the following new embedding support features:

  1. Multimodal Embeddings: Users can create embeddings for text, image, and video in the same semantic space using the ML.GENERATE_EMBEDDING function with Vertex AI's multimodal embedding large language models (LLMs).

  2. Structured Data Embeddings: The ML.GENERATE_EMBEDDING function now supports embeddings for structured independent and identically distributed (IID) data using principal component analysis (PCA) or autoencoder models.

  3. User/Item Embeddings: Matrix factorization models can be used with the ML.GENERATE_EMBEDDING function to create embeddings for user or item data.

To help users leverage these new capabilities, BigQuery has provided several tutorials:

  • Generating image embeddings
  • Generating video embeddings
  • Generating text embeddings
  • Generating and searching multimodal embeddings

New AI Features

BigQuery ML has introduced two major AI features:

  1. Document Processing:

    • Users can create a remote model based on the Document AI API, specifying a document processor.
    • The ML.PROCESS_DOCUMENT function can be used with this remote model to process documents from BigQuery object tables.
  2. Audio Transcription:

    • A remote model based on the Speech-to-Text API can be created, specifying a speech recognizer.
    • The new ML.TRANSCRIBE function works with this remote model to transcribe audio files from BigQuery object tables.

Tutorials are available for both of these features:

  • Processing documents with the ML.PROCESS_DOCUMENT function
  • Transcribing audio files with the ML.TRANSCRIBE function

Availability

All these new embedding support features and AI capabilities are now generally available (GA) in BigQuery ML.

These updates represent a significant enhancement to BigQuery ML's functionality, offering users more sophisticated tools for handling diverse data types, from structured data to text, images, audio, and video. They mark an important step in integrating advanced AI and machine learning capabilities directly into BigQuery's powerful data warehousing and analytics platform.

Have more questions on this topic? Ask our AI assistant for in-depth insights.

Read more from sources πŸ‘‡

The Only Digital Marketing Feed You'll Ever Need.

Stay informed your way. Tailored updates when and how you want them. 100% Free.

10,000+ Users

500+ Sources

1000+ Tools

Or

Related Posts

AI marketing workflows made simple

AI marketing workflows made simple

Featured
Markifact
Markifact

Verified Sponsor

Verified Sponsor

Markifact is a Verified Sponsor. Want to get featured here? Contact us.

Verified Sponsor
BigQuery Introduces Metadata Caching for SQL Translation

BigQuery Introduces Metadata Caching for SQL Translation

Google Cloud
Google Cloud

Official Source

Official Source

Google Cloud is a Official Source. The source has been verified by Swipe Insight team.

Official Source
BigQuery Data Transfer Service adds Google Analytics 4 reporting support Trending ️‍πŸ”₯

BigQuery Data Transfer Service adds Google Analytics 4 reporting support

Google Cloud
Google Cloud

Official Source

Official Source

Google Cloud is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Google Updates BigQuery with New Pipelines, Version Control, and Claude AI Integration

Google Updates BigQuery with New Pipelines, Version Control, and Claude AI Integration

Google Cloud
Google Cloud

Official Source

Official Source

Google Cloud is a Official Source. The source has been verified by Swipe Insight team.

Official Source
BigQuery Makes Analytics Hub Egress Controls and Data Clean Room Subscriptions Widely Available

BigQuery Makes Analytics Hub Egress Controls and Data Clean Room Subscriptions Widely Available

Google Cloud
Google Cloud

Official Source

Official Source

Google Cloud is a Official Source. The source has been verified by Swipe Insight team.

Official Source
BigQuery Data Transfer Service adds custom reports support for Google Ads

BigQuery Data Transfer Service adds custom reports support for Google Ads

Google Cloud
Google Cloud

Official Source

Official Source

Google Cloud is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Google Enhances Gemini in BigQuery with Python Code Completion

Google Enhances Gemini in BigQuery with Python Code Completion

Google Cloud
Google Cloud

Official Source

Official Source

Google Cloud is a Official Source. The source has been verified by Swipe Insight team.

Official Source
BigQuery Enhances Resource Utilization Charts with New Metrics and Configuration Options

BigQuery Enhances Resource Utilization Charts with New Metrics and Configuration Options

Google Cloud
Google Cloud

Official Source

Official Source

Google Cloud is a Official Source. The source has been verified by Swipe Insight team.

Official Source

Related Tools

Markifact logo

Markifact

Verified Tool

Verified Tool

Markifact is a Verified Tool. Want to get this badge? Contact us.

Verified Tool

Marketing Workflows Powered by AI

Featured
Marketing Auditor logo

Marketing Auditor

Verified Tool

Verified Tool

Marketing Auditor is a Verified Tool. Want to get this badge? Contact us.

Verified Tool

Automated audits for Google Ads and Analytics.

Get Featured Here

Showcase your tool in this list.

Contact Us
Databricks logo

Databricks

Generative AI-powered data intelligence platform

Data Engineering
GA4 SQL logo

GA4 SQL

Verified Tool

Verified Tool

GA4 SQL is a Verified Tool. Want to get this badge? Contact us.

Verified Tool

Generate GA4 BigQuery queries easily

Data Analysis
TapClicks logo

TapClicks

Automated marketing solutions powered by your data

Data Engineering
Stitch logo

Stitch

Automated cloud data pipelines, no coding needed

Data Engineering
Akkio logo

Akkio

AI-powered analytics for agencies

Data Analysis
NinjaCat logo

NinjaCat

AI-powered marketing data and analytics platform

Reporting
Funnel logo

Funnel

Aggregate and analyze marketing data seamlessly

Reporting
Fivetran logo

Fivetran

Effortlessly centralize and move data from any source

Data Engineering

Get Featured Here

Showcase your tool in this list.

Contact Us