Jason Barry

Prefix any URL with `pure.md/` to reliably convert web pages into markdown

In 2025, you'll frequently encounter 401s and 403s when requesting an HTML page with fetch or curl.

pure.md is a REST API that lets developers reliably access web content. With pure.md, you can:

  • Avoid bot detection by mimicking real user behavior

  • Render JavaScript-heavy websites, PDFs, images, and files

  • Scrape web pages into markdown optimized for an LLM

  • Crawl search engines for up-to-date knowledge

  • Extract JSON from web pages using natural language

It's almost too easy to get started: simply prefix a given URL with `https://pure.md/`. I use it myself to easily pipe webpage content directly into an LLM, knowing that all the superfluous HTML will be stripped out so that the input token count remains low. You can use it reliably in production -- behind the scenes, traffic gets emulated as a real browser and routed through a dynamic proxy pool.

What do you think? Happy to answer any questions you may have!

15 views

Add a comment

Replies

Best
Brad Johnson

Have seen pure.md demonstrated live and it blew my mind. Congrats @jasonbarry!

Jason Barry

@johnson_brad you're too kind! 💛