How to Save 90% on BigQuery Storage Costs by Changing Default Configurations

May 27, 2024 at 4:26:25 PM

TL;DR Yerachmiel Feltzman, a senior Big Data Engineer at Tikal, reduced BigQuery storage costs by 90% for a client by changing a default configuration. BigQuery's time travel feature, enabled by default, can increase storage costs. Switching the storage billing model to physical bytes and reducing the time travel window to two days helped achieve these savings. Feltzman also discusses assessing cost savings and the necessity of time travel.

How to Save 90% on BigQuery Storage Costs by Changing Default Configurations

In an actual project, a senior Big Data Engineer at Tikal reduced BigQuery storage costs by 90% by changing a default configuration.

The key points include:

  • Default Configurations: Tools often come with default settings that may not be optimal for specific workflows.
  • BigQuery Time Travel: This feature allows retrieving older data states but increases storage costs.
  • Billing Models:
    • Physical Bytes: Includes time travel and fail-safe storage.
    • Logical Bytes: Excludes time travel and fail-safe storage.

Configuration Changes:

  • Set billing mode to physical bytes:
    ALTER SCHEMA my_landing_zone
    SET OPTIONS(storage_billing_model = 'PHYSICAL');
    
  • Reduce time travel window to 48 hours:
    ALTER SCHEMA my_landing_zone
    SET OPTIONS(max_time_travel_hours = 48);
    

Assessment Steps:

  1. Cost Savings: Use SQL queries to forecast costs and compare different billing models.
  2. Usage Needs: Determine if time travel is necessary for each dataset.
  3. Risks: Weigh the trade-offs between upfront storage costs and re-ingestion costs.
  4. Re-ingest Costs: Utilize free batch ingestion where possible.
  5. Re-ingest Complexity: Design pipelines for idempotent backfilling to simplify re-ingestion.

Conclusion:

  • Read Documentation: Understand vendor pricing structures.
  • Design for Cost Efficiency: Implement changes that balance cost savings with operational needs.

These adjustments resulted in significant cost savings while maintaining data integrity and operational efficiency.

Have more questions on this topic? Ask our AI assistant for in-depth insights.

The Only Digital Marketing Feed You'll Ever Need.

Stay informed your way. Tailored updates when and how you want them. 100% Free.

10,000+ Users

500+ Sources

1000+ Tools

Or

Related Posts

Gemini Assistant Launches in BigQuery Data Canvas for Enhanced Data Analytics

Gemini Assistant Launches in BigQuery Data Canvas for Enhanced Data Analytics

Google Cloud
Google Cloud

Official Source

Official Source

Google Cloud is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Top-Notch Google Ads Audit Tool

Top-Notch Google Ads Audit Tool

Featured
BigQuery ML Now Supports Llama and Mistral Models for Advanced Text Generation

BigQuery ML Now Supports Llama and Mistral Models for Advanced Text Generation

Google Cloud
Google Cloud

Official Source

Official Source

Google Cloud is a Official Source. The source has been verified by Swipe Insight team.

Official Source
BigQuery Introduces Metadata Caching for SQL Translation

BigQuery Introduces Metadata Caching for SQL Translation

Google Cloud
Google Cloud

Official Source

Official Source

Google Cloud is a Official Source. The source has been verified by Swipe Insight team.

Official Source
BigQuery Data Transfer Service adds Google Analytics 4 reporting support Trending ️‍πŸ”₯

BigQuery Data Transfer Service adds Google Analytics 4 reporting support

Google Cloud
Google Cloud

Official Source

Official Source

Google Cloud is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Google Updates BigQuery with New Pipelines, Version Control, and Claude AI Integration

Google Updates BigQuery with New Pipelines, Version Control, and Claude AI Integration

Google Cloud
Google Cloud

Official Source

Official Source

Google Cloud is a Official Source. The source has been verified by Swipe Insight team.

Official Source
BigQuery Makes Analytics Hub Egress Controls and Data Clean Room Subscriptions Widely Available

BigQuery Makes Analytics Hub Egress Controls and Data Clean Room Subscriptions Widely Available

Google Cloud
Google Cloud

Official Source

Official Source

Google Cloud is a Official Source. The source has been verified by Swipe Insight team.

Official Source
BigQuery Data Transfer Service adds custom reports support for Google Ads

BigQuery Data Transfer Service adds custom reports support for Google Ads

Google Cloud
Google Cloud

Official Source

Official Source

Google Cloud is a Official Source. The source has been verified by Swipe Insight team.

Official Source

Related Tools

Markifact logo

Markifact

Verified Tool

Verified Tool

Markifact is a Verified Tool. Want to get this badge? Contact us.

Verified Tool

Marketing Workflows Powered by AI

Featured
Marketing Auditor logo

Marketing Auditor

Verified Tool

Verified Tool

Marketing Auditor is a Verified Tool. Want to get this badge? Contact us.

Verified Tool

Automated audits for Google Ads and Analytics.

Get Featured Here

Showcase your tool in this list.

Contact Us
Databricks logo

Databricks

Generative AI-powered data intelligence platform

Data Engineering
GA4 SQL logo

GA4 SQL

Verified Tool

Verified Tool

GA4 SQL is a Verified Tool. Want to get this badge? Contact us.

Verified Tool

Generate GA4 BigQuery queries easily

Data Analysis
TapClicks logo

TapClicks

Automated marketing solutions powered by your data

Data Engineering
Stitch logo

Stitch

Automated cloud data pipelines, no coding needed

Data Engineering
Akkio logo

Akkio

AI-powered analytics for agencies

Data Analysis
NinjaCat logo

NinjaCat

AI-powered marketing data and analytics platform

Reporting
Funnel logo

Funnel

Aggregate and analyze marketing data seamlessly

Reporting
Fivetran logo

Fivetran

Effortlessly centralize and move data from any source

Data Engineering

Get Featured Here

Showcase your tool in this list.

Contact Us