Prefix any URL with `pure.md/` to reliably convert web pages into markdown
In 2025, you'll frequently encounter 401s and 403s when requesting an HTML page with fetch or curl.
pure.md is a REST API that lets developers reliably access web content. With pure.md, you can:
Avoid bot detection by mimicking real user behavior
Render JavaScript-heavy websites, PDFs, images, and files
Scrape web pages into markdown optimized for an LLM
Crawl search engines for up-to-date knowledge
Extract JSON from web pages using natural language
It's almost too easy to get started: simply prefix a given URL with `https://pure.md/`. I use it myself to easily pipe webpage content directly into an LLM, knowing that all the superfluous HTML will be stripped out so that the input token count remains low. You can use it reliably in production -- behind the scenes, traffic gets emulated as a real browser and routed through a dynamic proxy pool.
What do you think? Happy to answer any questions you may have!
Replies
Blink Copilot
Have seen pure.md demonstrated live and it blew my mind. Congrats @jasonbarry!
FeaturePeek
@johnson_brad you're too kind! 💛