Google's Gemini Training Data Sources: A Deep Dive into Tech Companies' Transparency

May 13, 2024 at 10:44:02 PM

TL;DR Google's Gemini uses public sources and Gemini Apps user data to enhance its offerings. The specifics of these sources and personal data protection are unclear. Tech companies often focus on user input data usage, hiding details about AI training datasets. The post calls for stronger transparency obligations on AI companies, suggesting a mandatory information sheet detailing training dataset's sources, composition, and biases.

Google's Gemini Training Data Sources: A Deep Dive into Tech Companies' Transparency

Google's Gemini uses information from publicly accessible sources and Gemini Apps information to improve and develop its products, services, and machine learning technologies. However, the specific sources and Google's approach to "publicly accessible" remain unclear.

The company does not provide details on what is included or excluded or the precautions taken to protect personally identifiable information. The lack of transparency in tech companies, especially concerning the training of their AI systems, raises questions.

These companies often focus on using user input data, sometimes allowing users to use the system anonymously and advising users not to input confidential information. However, they usually hide details about the training dataset, including data sources and the detailed composition of the data used to train their AI systems.

This lack of transparency is not unique to Google. For instance, OpenAI's Mira Murati could not specify Sora's training data sources in an interview. The author suggests a mandatory information sheet detailing the training dataset's sources, compositions, possible biases, etc., accessible from the product.

Luiza argues for stronger transparency obligations on AI companies to overcome this phase of the "AI economy".

Q&A

Have more questions on this topic? Ask our AI assistant for in-depth insights.

Read more from sources 👇

Luiza Jarovsky
Luiza Jarovsky

Top Creator

Top Privacy & Compliance Creator

Luiza Jarovsky is a Top Privacy & Compliance Creator. Part of Swipe Insight Select, a curated list of top creators.

Top Privacy & Compliance Creator
- Google's Gemini uses public sources and Gemini Apps user data to enhance its offerings. The specifics of these sources a... Visit Source Open external source URL

The Only Digital Marketing Feed You'll Ever Need.

Stay informed your way. Tailored updates when and how you want them. 100% Free.

10,000+ Users

500+ Sources

1000+ Tools

Or

Related Posts

OpenAI launches o3 and o4-mini AI reasoning models with enhanced capabilities

OpenAI launches o3 and o4-mini AI reasoning models with enhanced capabilities

OpenAI
OpenAI

Official Source

Official Source

OpenAI is a Official Source. The source has been verified by Swipe Insight team.

Official Source
OpenAI developing X-like social media platform with ChatGPT integration

OpenAI developing X-like social media platform with ChatGPT integration

Tired of spending too much time creating audits for your clients?

Tired of spending too much time creating audits for your clients?

Featured
Gemini Advanced users can now create and share videos with Veo 2 starting today

Gemini Advanced users can now create and share videos with Veo 2 starting today

Google
Google

Official Source

Official Source

Google is a Official Source. The source has been verified by Swipe Insight team.

Official Source
OpenAI launches GPT-4.1 with significant improvements in coding, instruction, and context

OpenAI launches GPT-4.1 with significant improvements in coding, instruction, and context

OpenAI
OpenAI

Official Source

Official Source

OpenAI is a Official Source. The source has been verified by Swipe Insight team.

Official Source
OpenAI introduces memory feature in ChatGPT to enhance personalized user interactions Trending ️‍🔥

OpenAI introduces memory feature in ChatGPT to enhance personalized user interactions

OpenAI
OpenAI

Official Source

Official Source

OpenAI is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Google removes 240 million fake reviews in 2024 thanks to Gemini

Google removes 240 million fake reviews in 2024 thanks to Gemini

Google
Google

Official Source

Official Source

Google is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Google's Complete March AI Changelog: Gemini 2.5 Pro and Beyond

Google's Complete March AI Changelog: Gemini 2.5 Pro and Beyond

Google
Google

Official Source

Official Source

Google is a Official Source. The source has been verified by Swipe Insight team.

Official Source

Related Tools

Markifact logo

Markifact

Verified Tool

Verified Tool

Markifact is a Verified Tool. Want to get this badge? Contact us.

Verified Tool

Marketing Workflows Powered by AI

Featured
Marketing Auditor logo

Marketing Auditor

Verified Tool

Verified Tool

Marketing Auditor is a Verified Tool. Want to get this badge? Contact us.

Verified Tool

Automated audits for Google Ads and Analytics.

Get Featured Here

Showcase your tool in this list.

Contact Us