Google Releases SynthID Text for Watermarking AI-Generated Content

October 24, 2024 at 5:20:20 PM

TL;DR Google has released SynthID Text, a technology for watermarking and detecting AI-generated text. Available on Hugging Face and Google's Responsible GenAI Toolkit, it helps identify AI-generated content. SynthID Text embeds digital watermarks into text, making it harder to misuse AI-generated content. The tool is open-sourced and can be integrated into various frameworks. It is robust to minor text changes but less effective on thorough rewrites.

Google Releases SynthID Text for Watermarking AI-Generated Content

Google has released SynthID Text, a technology that allows developers to watermark and detect text generated by AI models. This tool, now generally available, can be downloaded from Hugging Face and Google’s Responsible GenAI Toolkit. SynthID Text embeds digital watermarks into AI-generated content, making it possible to identify such content and mitigate misinformation and misattribution issues.

SynthID Technology

SynthID, developed by Google DeepMind, watermarks AI-generated images, audio, text, and video. The watermarking tool has been open-sourced to help developers and businesses identify AI-generated content. The technology is integrated into Hugging Face Transformers v4.46.0+ and is also available on GitHub.

Watermark Application

SynthID Text functions as a logits processor in the model's generation pipeline, using a pseudorandom g-function to encode watermarking information. Key parameters for watermarking configuration include:

Keys Parameter: A list of unique, random integers used to compute g-function scores.
ngram_len Parameter: Balances robustness and detectability, with a default value of 5.

Additional configurations include:

Sampling Table: Configured by sampling_table_size and sampling_table_seed.
Context History: Repeated n-grams in preceding tokens are not watermarked to improve detectability.

No additional training is required; a watermarking configuration is passed to the model's .generate() method to activate the SynthID Text logit processor.

Google Releases SynthID Text for Watermarking AI-Generated Content

Watermark Detection and Verifiability

Watermark detection is probabilistic, using a Bayesian detector available with Hugging Face Transformers and on GitHub. The detector can output three states: watermarked, not watermarked, or uncertain. Detection thresholds can be tuned to achieve specific false positive and false negative rates.

Detection verification approaches include:

Fully-Private: No release or exposure of the detector.
Semi-Private: Exposes the detector through an API.
Public: Releases the detector for public use.

Limitations

SynthID Text watermarks are robust to minor transformations but have limitations:

Less effective on factual responses.
Reduced detector confidence when text is thoroughly rewritten or translated.

SynthID Text is not designed to stop motivated adversaries but can make malicious use of AI-generated content more difficult and can be combined with other approaches for better coverage.

Licensing

Content is licensed under the Creative Commons Attribution 4.0 License, and code samples under the Apache 2.0 License. For more details, refer to Google Developers Site Policies.