White-hat distribution templates

Copyable AI SEO community answers for legal free traffic

Use these templates when a thread is directly relevant. The safe pattern is simple: answer the person's question first, disclose affiliation when appropriate, then link to the most useful asset with UTM tracking. Do not ask for fake searches, fake clicks, votes, or reposts.

Rule: if the answer would not still be useful without the link, do not post it. That little test keeps us on the right side of useful community distribution.

Copy templates

llms.txt setup question

I would treat llms.txt as a curated public map, not as a ranking hack.

The useful version usually has:
- one short site summary
- canonical public pages
- notes about what not to infer
- no private URLs, customer data, API keys, or hidden claims

I made a free generator and examples here if useful:
https://llmstxtkit.com/?utm_source=community&utm_medium=answer&utm_campaign=first-distribution&utm_content=llms-txt-question

OAI-SearchBot vs GPTBot

Small distinction that matters: OpenAI documents OAI-SearchBot for ChatGPT search features, while GPTBot is documented separately for crawling that may be used to improve foundation models.

So I would not use one broad robots.txt block unless you really mean to block both use cases. Keep search visibility and training-use policy separate.

Short comparison with examples:
https://llmstxtkit.com/guides/gptbot-vs-oai-searchbot.html?utm_source=community&utm_medium=answer&utm_campaign=first-distribution&utm_content=oai-searchbot-vs-gptbot

robots.txt checker

Before rewriting robots.txt, I would check what your current file actually allows and blocks.

The common mistake is blocking the wrong token, or using broad rules that accidentally affect search/answer visibility.

Paste-in checker for Googlebot, OAI-SearchBot, GPTBot, Google-Extended, Applebot, PerplexityBot, and CCBot:
https://llmstxtkit.com/tools/ai-crawler-robots-txt-checker.html?utm_source=community&utm_medium=answer&utm_campaign=first-distribution&utm_content=robots-checker

allow search, control training

If the goal is "keep search visibility, but control AI training use", I would split the policy instead of using one broad block.

Usually:
- keep Googlebot open for Google Search
- keep OAI-SearchBot open if ChatGPT search visibility matters
- decide separately on GPTBot, Google-Extended, Applebot-Extended, and CCBot
- keep private content behind login, not just robots.txt

Free generator with policy presets:
https://llmstxtkit.com/tools/ai-robots-txt-generator.html?utm_source=community&utm_medium=answer&utm_campaign=first-distribution&utm_content=robots-generator

Shopify store

For a Shopify store, I would not list every product in llms.txt.

I would list the pages that explain the store best:
- top collection
- flagship product
- shipping policy
- returns policy
- sizing or buying guide

That creates a cleaner public map than a giant catalog dump.

Shopify template:
https://llmstxtkit.com/guides/llms-txt-for-shopify.html?utm_source=community&utm_medium=answer&utm_campaign=first-distribution&utm_content=shopify

WordPress site

For WordPress, the trick is to avoid turning llms.txt into another archive.

I would include cornerstone pages, updated guides, author/about, editorial policy, contact pages, and service pages. Tags and old archives usually add more noise than clarity.

WordPress template:
https://llmstxtkit.com/guides/llms-txt-for-wordpress.html?utm_source=community&utm_medium=answer&utm_campaign=first-distribution&utm_content=wordpress

policy badge

If you want to show that AI crawler policy was reviewed, I would use a small nofollow badge rather than trying to force dofollow backlinks.

Use the badge only if the claim is true:
- robots.txt checked
- sitemap returns 200
- llms.txt reviewed if you publish one
- private/admin/account paths are not exposed

Badge generator:
https://llmstxtkit.com/tools/ai-crawler-policy-badge.html?utm_source=community&utm_medium=answer&utm_campaign=first-distribution&utm_content=policy-badge

IndexNow after launch

For Bing and other IndexNow participants, you can notify URL changes after launch, but I would not treat it as a ranking guarantee.

The safe sequence:
- publish the final HTTPS site
- host the root IndexNow key file
- confirm sitemap URLs return 200
- submit the URL list
- measure indexing separately

IndexNow handoff notes:
https://llmstxtkit.com/research/free-traffic-tricks-that-work.html?utm_source=community&utm_medium=answer&utm_campaign=first-distribution&utm_content=indexnow

Use with proof

Proof path: after posting a relevant answer, check the proof dashboard for UTM source, referrer, and activation events. Real visits count. Fake search loops do not.