How to Save 90% on BigQuery Storage Costs by Changing Default Configurations

May 27, 2024 at 4:26:25 PM

How to Save 90% on BigQuery Storage Costs by Changing Default Configurations

In an actual project, a senior Big Data Engineer at Tikal reduced BigQuery storage costs by 90% by changing a default configuration.

The key points include:

  • Default Configurations: Tools often come with default settings that may not be optimal for specific workflows.
  • BigQuery Time Travel: This feature allows retrieving older data states but increases storage costs.
  • Billing Models:
    • Physical Bytes: Includes time travel and fail-safe storage.
    • Logical Bytes: Excludes time travel and fail-safe storage.

Configuration Changes:

  • Set billing mode to physical bytes:
    ALTER SCHEMA my_landing_zone
    SET OPTIONS(storage_billing_model = 'PHYSICAL');
    
  • Reduce time travel window to 48 hours:
    ALTER SCHEMA my_landing_zone
    SET OPTIONS(max_time_travel_hours = 48);
    

Assessment Steps:

  1. Cost Savings: Use SQL queries to forecast costs and compare different billing models.
  2. Usage Needs: Determine if time travel is necessary for each dataset.
  3. Risks: Weigh the trade-offs between upfront storage costs and re-ingestion costs.
  4. Re-ingest Costs: Utilize free batch ingestion where possible.
  5. Re-ingest Complexity: Design pipelines for idempotent backfilling to simplify re-ingestion.

Conclusion:

  • Read Documentation: Understand vendor pricing structures.
  • Design for Cost Efficiency: Implement changes that balance cost savings with operational needs.

These adjustments resulted in significant cost savings while maintaining data integrity and operational efficiency.

Have more questions on this topic? Ask our AI assistant for in-depth insights.

Want Personalized Digital Marketing Insights at Your Preferred Time?

Our Smart Newsletter brings you the latest insights on the topics you love, delivered at your preferred time and frequency.

Discover More

BigQuery Now Allows Drag-and-Drop Tabs

BigQuery Now Allows Drag-and-Drop Tabs

Google Cloud
Google Cloud

Official Source

Official Source

Google Cloud is a Official Source. The source has been verified by Swipe Insight team.

Official Source
BigQuery tables now support tags for conditional access control using IAM policies

BigQuery tables now support tags for conditional access control using IAM policies

Google Cloud
Google Cloud

Official Source

Official Source

Google Cloud is a Official Source. The source has been verified by Swipe Insight team.

Official Source
BigQuery Adds JupyterLab Plugin for Data Exploration and Deployment

BigQuery Adds JupyterLab Plugin for Data Exploration and Deployment

Google Cloud
Google Cloud

Official Source

Official Source

Google Cloud is a Official Source. The source has been verified by Swipe Insight team.

Official Source
BigQuery System Variables: How to Use Them in Multi-Statement Queries

BigQuery System Variables: How to Use Them in Multi-Statement Queries

Constantin Lungu
Constantin Lungu

Top Creator

Top BigQuery Creator

Constantin Lungu is a Top BigQuery Creator. Part of Swipe Insight Select, a curated list of top creators.

Top BigQuery Creator
Facebook Ads Transfer in BigQuery Now Supports Adjustable Refresh Window

Facebook Ads Transfer in BigQuery Now Supports Adjustable Refresh Window

BigQuery Job History: Retrieve, Analyze, and Manage Queries Effortlessly

BigQuery Job History: Retrieve, Analyze, and Manage Queries Effortlessly

Constantin Lungu
Constantin Lungu

Top Creator

Top BigQuery Creator

Constantin Lungu is a Top BigQuery Creator. Part of Swipe Insight Select, a curated list of top creators.

Top BigQuery Creator