BigQuery Data Transfer Now Supports Incremental Teradata Migrations

August 30, 2024 at 8:18:08 AM

TL;DR Google's BigQuery Data Transfer Service now supports incremental transfers for Teradata migrations, offering a more efficient way to update BigQuery datasets. Initial transfers create a complete snapshot, while subsequent transfers use timestamps to track and transfer only new or modified data. Limitations include no support for syncing deleted rows. Benefits include efficient data updates, reduced downtime, and flexibility.

BigQuery Data Transfer Now Supports Incremental Teradata Migrations

Google has announced that the BigQuery Data Transfer Service now supports incremental transfers when migrating data from Teradata data warehouses to BigQuery. This feature has reached general availability (GA), offering users a more efficient way to keep their BigQuery datasets updated with changes from their Teradata sources.

Key Features of Incremental Transfers

Initial Transfer

  • The first transfer creates a complete table snapshot in BigQuery

Subsequent Transfers

  • Follow annotations defined in a custom schema file
  • Use timestamps to track and transfer only new or modified data

How Incremental Transfers Work

The incremental transfer process operates on a per-table basis, using the following logic:

  1. Timestamp Tracking

    • Each transfer run saves a timestamp
    • Subsequent runs use the previous run's timestamp (T1) and the current run's start time (T2)
  2. Table-Specific Behavior

    • Tables without a COMMIT_TIMESTAMP column are skipped
    • Tables with only a COMMIT_TIMESTAMP column:
      • Rows with timestamps between T1 and T2 are extracted and appended to the existing BigQuery table
    • Tables with both COMMIT_TIMESTAMP and PRIMARY_KEY columns:
      • Rows with timestamps between T1 and T2 are extracted
      • New rows are appended, and modified rows are updated in the existing BigQuery table

It's important to note that the incremental migration from Teradata does not support syncing deleted rows with BigQuery. Users should be aware of this limitation when planning their data migration strategy.

Benefits for Users

This update offers several advantages for organizations migrating from Teradata to BigQuery:

  1. Efficient Data Updates: Only new or modified data is transferred, reducing processing time and resource usage
  2. Reduced Downtime: Incremental transfers allow for more frequent updates with minimal impact on operations
  3. Flexibility: The custom schema file allows users to define how different tables should be handled during transfers

Conclusion

The addition of incremental transfers to the BigQuery Data Transfer Service for Teradata migration represents a significant improvement in Google's data migration toolset. This feature allows organizations to more easily keep their BigQuery datasets in sync with their Teradata sources, facilitating smoother transitions to cloud-based data warehousing and analytics.

Have more questions on this topic? Ask our AI assistant for in-depth insights.

Read more from sources 👇

Related Posts

BigQuery Launches Vector Search and Vector Index Features

BigQuery Launches Vector Search and Vector Index Features

Google Cloud
Google Cloud

Official Source

Official Source

Google Cloud is a Official Source. The source has been verified by Swipe Insight team.

Official Source
BigQuery Now Supports GROUP BY and SELECT DISTINCT with Arrays and Structs

BigQuery Now Supports GROUP BY and SELECT DISTINCT with Arrays and Structs

The Ultimate Google Analytics Audit Tool

The Ultimate Google Analytics Audit Tool

Sponsored
GA4 Auditor
GA4 Auditor

Verified Sponsor

Verified Sponsor

GA4 Auditor is a Verified Sponsor. Want to get featured here? Contact us.

Verified Sponsor
BigQuery ML Integrates Anthropic Claude AI for Generative Text

BigQuery ML Integrates Anthropic Claude AI for Generative Text

Google Cloud
Google Cloud

Official Source

Official Source

Google Cloud is a Official Source. The source has been verified by Swipe Insight team.

Official Source
BigQuery Introduces Python Code Completion with Gemini

BigQuery Introduces Python Code Completion with Gemini

Google Cloud
Google Cloud

Official Source

Official Source

Google Cloud is a Official Source. The source has been verified by Swipe Insight team.

Official Source
BigQuery Enhances Anomaly Detection for Multivariate Time Series Models

BigQuery Enhances Anomaly Detection for Multivariate Time Series Models

Google Cloud
Google Cloud

Official Source

Official Source

Google Cloud is a Official Source. The source has been verified by Swipe Insight team.

Official Source

Related Tools

Featured
GA4 Auditor logo

GA4 Auditor

Automated GA4 audits with actionable insights

Data Analysis
GA4 SQL logo

GA4 SQL

Generate GA4 BigQuery queries easily

Data Analysis
TapClicks logo

TapClicks

Automated marketing solutions powered by your data

Data Engineering
Stitch logo

Stitch

Automated cloud data pipelines, no coding needed

Data Engineering
Akkio logo

Akkio

AI-powered analytics for agencies

Data Analysis
Databricks logo

Databricks

Generative AI-powered data intelligence platform

Data Engineering
NinjaCat logo

NinjaCat

AI-powered marketing data and analytics platform

Reporting
Funnel logo

Funnel

Aggregate and analyze marketing data seamlessly

Reporting
Fivetran logo

Fivetran

Effortlessly centralize and move data from any source

Data Engineering