Product Hunt logo dark
  • Launches
    Coming soon
    Upcoming launches to watch
    Launch archive
    Most-loved launches by the community
    Launch Guide
    Checklists and pro tips for launching
  • Products
  • News
    Newsletter
    The best of Product Hunt, every day
    Stories
    Tech news, interviews, and tips from makers
    Changelog
    New Product Hunt features and releases
  • Forums
    Forums
    Ask questions, find support, and connect
    Streaks
    The most active community members
    Events
    Meet others online and in-person
  • Advertise
Subscribe
Sign in
Subscribe
Sign in
PromptPerf

PromptPerf

Data-driven AI tuning. Stop guessing, save time/$.

4.0
•1 review•

121 followers

Data-driven AI tuning. Stop guessing, save time/$.

4.0
•1 review•

121 followers

Visit website
LLMs
•
ChatGPT Prompts
•
A/B testing tools
LLMs change fast — GPT-4 updates silently, models vanish, and prompts break. PromptPerf helps you stay ahead by testing a prompt across GPT-4o, GPT-4, and GPT-3.5, comparing outputs to your expected result using similarity scoring. ✅ 3 test cases per run, unlimited runs ✅ CSV export ✅ Built-in scoring More models and batch runs coming soon. One feature per 100 users. Built solo. Feedback welcome 🙏 promptperf.dev
  • Overview
  • Launches1
  • Reviews1
  • Alternatives
  • Forum
  • Team
  • More
Company Info
promptperf.dev
PromptPerf Info
Launched in 2025View 1 launch
Forum
p/promptperf
  • Blog
  • •
  • Newsletter
  • •
  • Questions
  • •
  • Forums
  • •
  • Product Categories
  • •
  • Apps
  • •
  • About
  • •
  • FAQ
  • •
  • Terms
  • •
  • Privacy and Cookies
  • •
  • X.com
  • •
  • Facebook
  • •
  • Instagram
  • •
  • LinkedIn
  • •
  • YouTube
  • •
  • Advertise
© 2025 Product Hunt
SocialX

Similar Products

ChatGPT by OpenAI
ChatGPT by OpenAI
Get answers. Find inspiration. Be more productive.
4.8(1.2K reviews)
AILLMs
OpenAI
OpenAI
APIs and tools for building AI products
4.9(657 reviews)
LLMsAI Chatbots
Claude by Anthropic
Claude by Anthropic
A family of foundational AI models
4.9(586 reviews)
LLMsAI Chatbots
PostHog
The open source product OS
4.9(165 reviews)
Data analysis toolsWebsite analytics
Gemini
Gemini
Google's answer to GPT-4
4.8(136 reviews)
LLMsAI Chatbots
View more
PromptPerf gallery image
PromptPerf gallery image
PromptPerf gallery image
PromptPerf gallery image
PromptPerf gallery image
Free
Launch tags:
A/B Testing•Artificial Intelligence•Data & Analytics
Launch Team / Built With
Harshil Siyani
Supabase
Vercel
Laravel

What do you think? …

Harshil Siyani
Harshil Siyani
PromptPerf

PromptPerf

Maker
📌

As an AI developer, I spend a lot of time running prompts across different models and configs, tweaking temperature, comparing outputs, and manually checking which one gets it right.

It’s repetitive. Time-consuming. And easy to mess up.


So I built PromptPerf -> a tool that tests a single prompt across GPT-4o, GPT-4, and GPT-3.5, runs it multiple times, and compares the results to your expected output using similarity scoring.


⚡ No more guessing which prompt or model is better
⚡ No more switching between tabs
⚡ Just clean, fast feedback and a CSV if you want it


This started as a scratch-my-own-itch tool, but now I’m opening it up to anyone building with LLMs.


Unlimited free runs. More models coming soon. Feedback shapes the roadmap.


Would love to hear what you think! Keen on feedback and help to ensure I build a product that solves your problems
👉 promptperf.dev

Report
4mo ago
Neel Patel 🦕
Neel Patel 🦕

Whoa! This looks interesting!

Report
4mo ago
Harshil Siyani
Harshil Siyani
PromptPerf

PromptPerf

Maker

@neelptl2602 Thanks Neel, I plan on adding multiple models from Claude, Gemini and others soon to evaluate across models and different temperatures.

Report
4mo ago
Chris Pitchford
Chris Pitchford
Brev

Brev

This is super useful, thanks for building this.

Report
4mo ago
Harshil Siyani
Harshil Siyani
PromptPerf

PromptPerf

Maker
@seepitch thanks. Im planning on getting user feedback on if its easier for them to add their API key or should I provide credits for them to do tests. So far alot of the users signing up are not performing the evaluation as it requires an extra step to get their API key and come back. (Friction)
Report
3mo ago
Ambassador
Intercom
Intercom — Startups get 90% off Intercom + 1 year of Fin AI Agent free
Startups get 90% off Intercom + 1 year of Fin AI Agent free
Promoted

Do you use PromptPerf?

Forum Threads

PromptPerfp/promptperfHarshil Siyani
Harshil Siyani
•

3mo ago

A Big Thank You! and a Big Ask

Thank you everyone for the support. I have received nearly 40 signups and 1 paid user which is massive for me as I am still on the early stages of validating the product. So thank you everyone.
Next steps: Even though the signups are coming in I am tracking the usage of the app and I dont see many users running the evaluations and I need help. How should I get you to try and test the product.
My current thoughts are:
- User using their API keys means friction and would go away from the platform as can't test it immediately so perhaps allow free trial with my API key which involves unlimited runs with 3 test cases.

- Create a onboarding guide? Like the ones of enterprise softwares that says "Click here" "Next Steps": What tool can I use for this?

For both the options above will still need to inform them about the updates and hope they will signin again.

- Reach out to all 40 users for a 1-1 15 mins call and show them the product. Assuming 30% respond that 12 calls booked.
Do you have any suggestions? Keen on feedback. This is critical as I need to solve these issues before building next features i.e (Adding more models and multi model runs).

View all
4.0
Based on 1 review
Review PromptPerf?
Reviews
Helpful
Ajay Sahoo
Ajay Sahoo
Launching soon!
•350 reviews
An insightful interface and hoping for more unknown features to make it step aside for its evolution.
Report
3mo ago
Harshil Siyani
Harshil Siyani
PromptPerf

PromptPerf

Thank you Ajay for the review. The next phase is to add multiple model support so prompts can be tested against these and compared. Following this would be the ability to auto run the prompts against multiple models and temperatures with 3,5,10 runs to ensure the same prompt at the same temp on the same model provides consistent results (giving you the accuracy/consistent score on your prompt)

Report
3mo ago