Bluesky Allows Third-Parties to Scrape Data for AI Training

November 27, 2024 at 5:18:20 PM

Bluesky Allows Third-Parties to Scrape Data for AI Training

Bluesky’s open API allows third-parties to scrape user data for AI training. Although Bluesky itself isn't using user content for AI training, others can access and use this data. A report by 404 Media revealed that a machine learning librarian at Hugging Face extracted 1 million public posts from Bluesky via its Firehose API for research purposes. This dataset was later removed due to controversy, highlighting that public posts on Bluesky are accessible to anyone.

Bluesky is exploring ways to let users communicate their consent preferences externally, but it cannot enforce these preferences outside its systems. The company stated that respecting these settings is up to external developers. Bluesky is in discussions with engineers and lawyers and plans to provide updates soon.

As Bluesky gains popularity, it faces the same scrutiny as other major social platforms.

Have more questions on this topic? Ask our AI assistant for in-depth insights.

The Only Digital Marketing Feed You'll Ever Need.

Stay informed your way. Tailored updates when and how you want them. 100% Free.

10,000+ Users

500+ Sources

1000+ Tools

Or

Related Tools

Marketing Auditor logo

Marketing Auditor

Verified Tool

Verified Tool

Marketing Auditor is a Verified Tool. Want to get this badge? Contact us.

Verified Tool

Automated audits for Google Ads and Analytics.

Get Featured Here

Showcase your tool in this list.

Contact Us