Create robots.txt
Browse files- robots.txt +40 -0
robots.txt
ADDED
|
@@ -0,0 +1,40 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# robots.txt - TEQUMSA Sovereign AI Infrastructure
|
| 2 |
+
# Optimized for AI discovery + human search
|
| 3 |
+
# LAI-TEQUMSA | huggingface.co/LAI-TEQUMSA
|
| 4 |
+
|
| 5 |
+
User-agent: *
|
| 6 |
+
Allow: /
|
| 7 |
+
Disallow: /private/
|
| 8 |
+
Disallow: /admin/
|
| 9 |
+
Crawl-delay: 1
|
| 10 |
+
|
| 11 |
+
# AI/LLM Crawlers - Full Access + Priority
|
| 12 |
+
User-agent: GPTBot
|
| 13 |
+
Allow: /
|
| 14 |
+
Allow: /datasets/
|
| 15 |
+
Allow: /spaces/
|
| 16 |
+
|
| 17 |
+
User-agent: Google-Extended
|
| 18 |
+
Allow: /
|
| 19 |
+
Sitemap: https://huggingface.co/spaces/LAI-TEQUMSA/TEQUMSA-SEO-GEO-Infrastructure/resolve/main/sitemap.xml
|
| 20 |
+
|
| 21 |
+
User-agent: ClaudeBot
|
| 22 |
+
Allow: /
|
| 23 |
+
|
| 24 |
+
User-agent: PerplexityBot
|
| 25 |
+
Allow: /
|
| 26 |
+
|
| 27 |
+
User-agent: anthropic-ai
|
| 28 |
+
Allow: /
|
| 29 |
+
|
| 30 |
+
User-agent: cohere-ai
|
| 31 |
+
Allow: /
|
| 32 |
+
|
| 33 |
+
User-agent: *Bot
|
| 34 |
+
Allow: /collections/Mbanksbey/tequmsa
|
| 35 |
+
Allow: /models?other=tequmsa
|
| 36 |
+
Allow: /datasets?other=tequmsa
|
| 37 |
+
Allow: /spaces?other=tequmsa
|
| 38 |
+
|
| 39 |
+
# Sitemap Reference
|
| 40 |
+
Sitemap: https://huggingface.co/spaces/LAI-TEQUMSA/TEQUMSA-SEO-GEO-Infrastructure/resolve/main/sitemap.xml
|