Applebot-Extended: Web Publishers Can Opt Out of AI Training via Robots.txt

June 14, 2024 at 10:18:02 PM

TL;DR Apple's new documentation explains how web publishers can block Applebot-Extended via robots.txt to prevent their content from training Apple's AI models. Apple relies on licensed and public data, not private data. Applebot supports meta tags to control indexing and snippets. Applebot-Extended offers additional control but does not crawl pages. Allowing Applebot-Extended can enhance AI model quality.

Applebot-Extended: Web Publishers Can Opt Out of AI Training via Robots.txt

Apple has released new documentation regarding the ability to block Applebot-Extended, allowing web publishers to opt out of having their website content used to train Apple’s foundation models for generative AI features. Apple emphasizes that it does not use private user data or interactions for training, relying instead on licensed materials and publicly available data.

Customizing Indexing Rules for Applebot

Applebot supports various robots meta tags in HTML documents to control indexing:

  • noindex: Prevents the page from being indexed.
  • nosnippet: Prevents generating a description or web answer for the page.
  • nofollow: Prevents following any links on the page.
  • none: Combines noindex, nosnippet, and nofollow.
  • all: Allows indexing, snippet generation, and link following.

Multiple directives can be combined in a single meta tag using a comma-separated list or multiple meta tags.

Controlling Data Usage

Apple provides an additional user agent, Applebot-Extended, which gives web publishers more control over how their content is used. To opt out, add the following rule in robots.txt:

User-agent: Applebot-Extended
Disallow: /private/

Applebot-Extended does not crawl webpages but determines how the data crawled by Applebot is used. Allowing Applebot-Extended can help improve Apple’s generative AI models.

About Search Rankings

Apple Search considers several factors for ranking web search results:

  • Aggregated user engagement
  • Relevancy and matching of search terms
  • Number and quality of links
  • User location-based signals
  • Webpage design characteristics

For more details, check out Apple Documentation.

Q&A

Have more questions on this topic? Ask our AI assistant for in-depth insights.

The Only Digital Marketing Feed You'll Ever Need.

Stay informed your way. Tailored updates when and how you want them. 100% Free.

10,000+ Users

500+ Sources

1000+ Tools

Or

Related Posts

AI marketing workflows made simple

AI marketing workflows made simple

Featured
Markifact
Markifact

Verified Sponsor

Verified Sponsor

Markifact is a Verified Sponsor. Want to get featured here? Contact us.

Verified Sponsor
Google updates Merchant Center documentation on energy efficiency and certification

Google updates Merchant Center documentation on energy efficiency and certification

Google for Developers
Google for Developers

Official Source

Official Source

Google for Developers is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Google to Drop Support for Special Announcement Structured Data by July 2025

Google to Drop Support for Special Announcement Structured Data by July 2025

Google for Developers
Google for Developers

Official Source

Official Source

Google for Developers is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Google to Redirect Country Code Domains to Google.com Trending ️‍🔥

Google to Redirect Country Code Domains to Google.com

Google
Google

Official Source

Official Source

Google is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Google Expands Availability for Structured Data Carousels in EEA Countries

Google Expands Availability for Structured Data Carousels in EEA Countries

Google for Developers
Google for Developers

Official Source

Official Source

Google for Developers is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Google Search Console API Adds Support for Hourly Data Trending ️‍🔥

Google Search Console API Adds Support for Hourly Data

Google Search Central
Google Search Central

Official Source

Official Source

Google Search Central is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Google Discover will expand to desktop

Google Discover will expand to desktop

Google Search Console Renames Shopping Tab Report to Merchant Opportunities

Google Search Console Renames Shopping Tab Report to Merchant Opportunities

Google Search Central
Google Search Central

Official Source

Official Source

Google Search Central is a Official Source. The source has been verified by Swipe Insight team.

Official Source

Related Tools

Markifact logo

Markifact

Verified Tool

Verified Tool

Markifact is a Verified Tool. Want to get this badge? Contact us.

Verified Tool

Marketing Workflows Powered by AI

Featured
Marketing Auditor logo

Marketing Auditor

Verified Tool

Verified Tool

Marketing Auditor is a Verified Tool. Want to get this badge? Contact us.

Verified Tool

Automated audits for Google Ads and Analytics.

Get Featured Here

Showcase your tool in this list.

Contact Us
Ahrefs logo

Ahrefs

SEO tools to boost traffic and rank higher

SEO
Lighthouse logo

Lighthouse

Automated insights for web performance and SEO

SEO
Surfer SEO logo

Surfer SEO

SEO content creation and optimization made easy

SEO
Sitebulb logo

Sitebulb

Efficient website crawler for better SEO audits

SEO
Screpy logo

Screpy

AI-Powered SEO and Web Analysis Simplified

SEO
Blogify logo

Blogify

Convert multimedia to SEO-optimized blogs fast

SEO
Answer the Public logo

Answer the Public

Unlock Consumer Insights for Content Creation

SEO
SEO Writing AI logo

SEO Writing AI

AI-powered SEO content in 1 click

SEO

Get Featured Here

Showcase your tool in this list.

Contact Us