BigQuery ML Expands with Advanced Embedding Support and AI Features

September 26, 2024 at 6:43:52 AM

TL;DR Google has updated BigQuery ML with new embedding support and AI features. Embedding features include multimodal embeddings for text, image, and video, structured data embeddings using PCA or autoencoder models, and user/item embeddings via matrix factorization. AI features include document processing with the Document AI API and audio transcription with the Speech-to-Text API. These updates are generally available.

BigQuery ML Expands with Advanced Embedding Support and AI Features

Google has announced updates to BigQuery ML, enhancing its machine learning capabilities with new embedding support features and AI functionalities. These updates aim to provide users with more powerful tools for data analysis and processing across various modalities.

Expanded Embedding Support

BigQuery ML now offers the following new embedding support features:

  1. Multimodal Embeddings: Users can create embeddings for text, image, and video in the same semantic space using the ML.GENERATE_EMBEDDING function with Vertex AI's multimodal embedding large language models (LLMs).

  2. Structured Data Embeddings: The ML.GENERATE_EMBEDDING function now supports embeddings for structured independent and identically distributed (IID) data using principal component analysis (PCA) or autoencoder models.

  3. User/Item Embeddings: Matrix factorization models can be used with the ML.GENERATE_EMBEDDING function to create embeddings for user or item data.

To help users leverage these new capabilities, BigQuery has provided several tutorials:

  • Generating image embeddings
  • Generating video embeddings
  • Generating text embeddings
  • Generating and searching multimodal embeddings

New AI Features

BigQuery ML has introduced two major AI features:

  1. Document Processing:

    • Users can create a remote model based on the Document AI API, specifying a document processor.
    • The ML.PROCESS_DOCUMENT function can be used with this remote model to process documents from BigQuery object tables.
  2. Audio Transcription:

    • A remote model based on the Speech-to-Text API can be created, specifying a speech recognizer.
    • The new ML.TRANSCRIBE function works with this remote model to transcribe audio files from BigQuery object tables.

Tutorials are available for both of these features:

  • Processing documents with the ML.PROCESS_DOCUMENT function
  • Transcribing audio files with the ML.TRANSCRIBE function

Availability

All these new embedding support features and AI capabilities are now generally available (GA) in BigQuery ML.

These updates represent a significant enhancement to BigQuery ML's functionality, offering users more sophisticated tools for handling diverse data types, from structured data to text, images, audio, and video. They mark an important step in integrating advanced AI and machine learning capabilities directly into BigQuery's powerful data warehousing and analytics platform.

Have more questions on this topic? Ask our AI assistant for in-depth insights.

Read more from sources πŸ‘‡

The Only Digital Marketing Feed You'll Ever Need.

Stay informed your way. Tailored updates when and how you want them. 100% Free.

10,000+ Users

500+ Sources

1000+ Tools

Or

Related Posts

Top-Notch Google Analytics Audit Tool

Top-Notch Google Analytics Audit Tool

Sponsored
GA4 Auditor
GA4 Auditor

Verified Sponsor

Verified Sponsor

GA4 Auditor is a Verified Sponsor. Want to get featured here? Contact us.

Verified Sponsor
 BigQuery Introduces Code-Free Workflow Tool for Streamlined Data Management Trending ️‍πŸ”₯

BigQuery Introduces Code-Free Workflow Tool for Streamlined Data Management

 BigQuery DataFrames Introduces Partial Ordering Mode for Enhanced Query Efficiency

BigQuery DataFrames Introduces Partial Ordering Mode for Enhanced Query Efficiency

Google Cloud
Google Cloud

Official Source

Official Source

Google Cloud is a Official Source. The source has been verified by Swipe Insight team.

Official Source
BigQuery Adds Terraform Support for IAM Tag Management

BigQuery Adds Terraform Support for IAM Tag Management

Google Cloud
Google Cloud

Official Source

Official Source

Google Cloud is a Official Source. The source has been verified by Swipe Insight team.

Official Source
BigQuery Data Transfer Expands DV360 Integration with New Tables

BigQuery Data Transfer Expands DV360 Integration with New Tables

Google Cloud
Google Cloud

Official Source

Official Source

Google Cloud is a Official Source. The source has been verified by Swipe Insight team.

Official Source
BigQuery Launches Vector Search and Vector Index Features

BigQuery Launches Vector Search and Vector Index Features

Google Cloud
Google Cloud

Official Source

Official Source

Google Cloud is a Official Source. The source has been verified by Swipe Insight team.

Official Source
BigQuery Data Transfer Now Supports Incremental Teradata Migrations

BigQuery Data Transfer Now Supports Incremental Teradata Migrations

Google Cloud
Google Cloud

Official Source

Official Source

Google Cloud is a Official Source. The source has been verified by Swipe Insight team.

Official Source
BigQuery Now Supports GROUP BY and SELECT DISTINCT with Arrays and Structs

BigQuery Now Supports GROUP BY and SELECT DISTINCT with Arrays and Structs

Related Tools

GA4 Auditor logo

GA4 Auditor

Verified Tool

Verified Tool

GA4 Auditor is a Verified Tool. Want to get this badge? Contact us.

Verified Tool

Automated GA4 audits with actionable insights

Get Featured Here

Showcase your tool in this list.

Contact Us
GA4 SQL logo

GA4 SQL

Verified Tool

Verified Tool

GA4 SQL is a Verified Tool. Want to get this badge? Contact us.

Verified Tool

Generate GA4 BigQuery queries easily

Data Analysis
TapClicks logo

TapClicks

Automated marketing solutions powered by your data

Data Engineering
Stitch logo

Stitch

Automated cloud data pipelines, no coding needed

Data Engineering
Akkio logo

Akkio

AI-powered analytics for agencies

Data Analysis
Databricks logo

Databricks

Generative AI-powered data intelligence platform

Data Engineering
NinjaCat logo

NinjaCat

AI-powered marketing data and analytics platform

Reporting
Funnel logo

Funnel

Aggregate and analyze marketing data seamlessly

Reporting
Fivetran logo

Fivetran

Effortlessly centralize and move data from any source

Data Engineering
Power My Analytics logo

Power My Analytics

Automate and integrate your marketing data

Reporting

Get Featured Here

Showcase your tool in this list.

Contact Us