28 points ericciarla | 1 comments | 21 Nov 24 19:33 UTC | HN request time: 0.201s
 | source Hey HN! It’s Eric from Firecrawl (
https://firecrawl.dev).
I just launched llms.txt Generator, a tool that transforms any website into a clean, structured text file optimized for feeding to LLMs. You can learn more about the standard at https://llmstxt.org.
Here’s how it works under the hood:
1. We use Firecrawl, our open-source scraper, to fetch the full site, handling JavaScript-heavy pages and complex structures.
2. The markdown content is parsed and then the title and description are extracted using GPT-4o-mini.
3. The everything is combined and the result is a lightweight llms.txt file that you can paste into any LLM.
Let me know what you think!