What Is llms.txt? The AI Crawler Convention Explained
llms.txt is a proposed plain-text file at the root of your site that points AI models and crawlers at your most important content in a clean, readable form. Here is what it is, how it works, and why it matters for AEO.
The Short Answer
llms.txt is a proposed convention: a plain-text Markdown file placed at the root of your website, at yoursite.com/llms.txt, that points AI models and crawlers at your most important content in a clean, readable form. It works like a curated map for language models, the way robots.txt is a map for search crawlers and sitemap.xml is a map for indexers. It matters for AEO because it lowers the friction for AI engines to find and understand the pages you most want cited.
What the File Is
The llms.txt file is a simple Markdown document you host at your domain root. It typically opens with your site's name and a short description, then lists your key pages as links with brief notes on what each covers. Because it is Markdown, both humans and language models can read it easily, and because it lives at a predictable path, tools know where to look for it.
The proposal emerged in 2024 as a way to help large language models work with website content, which is often cluttered with navigation, scripts, and markup that get in the way of the actual information. llms.txt strips that away and hands the model a clean, curated index.
How It Relates to robots.txt and sitemap.xml
It is easiest to understand llms.txt by comparison:
| File | Audience | Purpose |
|---|---|---|
| robots.txt | Search and other crawlers | Tells crawlers what they may and may not access |
| sitemap.xml | Search indexers | Lists all URLs so they can be discovered and indexed |
| llms.txt | AI models and crawlers | Curates and describes your most important content for language models |
robots.txt controls access. sitemap.xml lists everything. llms.txt curates and explains the subset that matters most, in a format a model can read directly.
Why It Matters for AEO
AI engines cite sources they can find and parse cleanly. An llms.txt file helps on both counts:
- Discovery. It points crawlers straight at your most important pages instead of leaving them to infer priority from a sprawling site.
- Clarity. The Markdown format and short descriptions remove the ambiguity of a page wrapped in navigation and scripts.
- Curation. You decide which pages represent you, so the model sees your best, most citable content first.
It is not a magic switch. Support across engines is still uneven and evolving, and llms.txt is one input among the larger citation drivers of structure, credibility, freshness, and schema. But it is low-cost to add and it removes friction, which is why it has become a standard part of AEO checklists.
What a Basic llms.txt Looks Like
The format is deliberately simple. It starts with a top-level heading for your site name, an optional short summary, and then sections of links with descriptions. A minimal version might list your homepage, your core product or service pages, your key guides, and your contact or about page, each as a Markdown link with a one-line note. Some sites also publish an expanded version that includes fuller content for models that want more detail.
Should You Add One?
For most sites pursuing AEO, yes. It is inexpensive to create, it cannot hurt your traditional SEO, and it signals that you have thought about how AI systems consume your content. Keep it current as your important pages change, and treat it as one part of a broader AEO program rather than a standalone fix.
Frequently Asked
Is llms.txt an official standard?
It is a proposed convention that gained real adoption, not a formally ratified web standard. It emerged in 2024 and has been taken up by many sites and tools, but engine support varies and is still evolving. Treat it as a widely used best practice rather than a guaranteed, universally honored rule.
Do AI engines actually read llms.txt?
Support is uneven and changing. Some tools and crawlers look for it; others do not yet. Because it is cheap to add and removes parsing friction, it is worth including even while adoption matures, but do not rely on it as your only AEO move.
How is llms.txt different from robots.txt?
robots.txt tells crawlers what they are allowed to access. llms.txt does the opposite kind of job: it curates and describes your most important content for language models in a clean, readable format. One controls access, the other guides attention.
Where do I put the llms.txt file?
At the root of your domain, so it is reachable at yoursite.com/llms.txt, the same location convention as robots.txt. That predictable path is how tools know where to find it.
The Bottom Line
llms.txt is a plain-text, root-level file that hands AI models a clean, curated map of your best content. It is the AI-era companion to robots.txt and sitemap.xml, cheap to add and useful for reducing the friction that keeps engines from finding and citing your key pages. Add one, keep it current, and treat it as one part of a full AEO program.
Are AI assistants recommending you?
Run a free AI visibility audit and see how often ChatGPT, Claude, and Gemini recommend your brand when buyers ask.
Run your free audit