A new standard for AI: llms.txt
Traditional files like robots.txt and sitemap.xml help search engines understand your site structure — but they weren’t built with large language models (LLMs) in mind. That’s where llmstxt.org comes in. It's a growing standard tailored specifically for LLMs, offering content in a format that’s easier for AI to read and reason about.
As LLMs become a more common way developers and users access documentation, clarity and structure are more important than ever. Parsing raw HTML often leads to messy results: cluttered navigation, JavaScript, styling tags — all noise from the perspective of an AI model.
In fact, we may already be at the point where LLMs are consuming developer docs more than humans do. Andrej Karpathy even called this shift out in a recent post.
The llms.txt spec introduces two files:
/llms.txt – a lightweight, structured index of your docs, similar in spirit to sitemap.xml, but more markdown-friendly.
/llms-full.txt – a single, comprehensive text dump of all your documentation, ready for ingestion.
In addition, the specification recommends that websites offering content potentially useful to LLMs also provide a clean Markdown version of each page. This version should be accessible at the same URL as the original page, with .md appended.
By using these, documentation updates become much easier to manage, especially for tools that rely on LLMs to serve answers and insights.