Product Hunt logo dark
  • Launches
    Launch archive
    Most-loved launches by the community
    Launch Guide
    Checklists and pro tips for launching
  • Products
  • News
    Newsletter
    The best of Product Hunt, every day
    Stories
    Tech news, interviews, and tips from makers
    Changelog
    New Product Hunt features and releases
  • Forums
    Forums
    Ask questions, find support, and connect
    Streaks
    The most active community members
    Events
    Meet others online and in-person
  • Advertise
Subscribe
Sign in
HomeRecent commentsSearch all threadsStart new thread

Topic Forums

Forum General categoryp/generalForum Vibecoding categoryp/vibecodingForum AMA categoryp/amaForum Introduce yourself categoryp/introduce-yourselfForum Self-Promotion categoryp/self-promotion

Product Forums

Y Combinatorp/ycMigma AIp/migma-aiAprilp/april-yc-s25Applep/applep/meet-tingWarpp/warpGitArsenalp/gitarsenalDatastripesp/datastripesDevDiaryp/flowmev0 by Vercelp/v0Socials by DevVoidp/socials-by-devvoidDad Replyp/dad-replyWispr Flowp/wisprflowSlashit Appp/slashit-appZedp/zedReaddit Laterp/readdit-laterConfe.iop/confe-ioBroxi AIp/broxi-aiSuperhumanp/superhumanp/albato
recent
GPT-5

p/gpt-5

by

Aaron O'Leary

Aaron O'Leary

•17d ago
GPT-5: Not the AGI Messiah, but still pretty impressive
... like button. Benchmarks, benchmarks, benchmarks Benchmarks should always be taken with a grain of salt . They are effectively a snapshot of a models capabilities under near perfect conditions. Sort of like the Big Mac you see on the ads vs the Big Mac you get in the bag. It gives you a good idea, but they're far from a perfect measure of real world usage. Math (AIME 2025, no tools): 94.6 percent Real-world coding (SWE-bench Verified ... ... Multilingual programming (Aider Polyglot): 88.0 percent Multimodal understanding (MMMU): 84.2 percent Medical reasoning (HealthBench Hard): 46.2 percent Graduate-level logic (GPQA without tools, via GPT-Thinking Pro): 88.4 percent In production, GPT-5 Thinking cuts hallucinations by 45 percent versus

11

Subscribe
Sign in
Top Product Categories

Engineering & Development

  • Vibe Coding Tools
  • AI Coding Assistants
  • No-code platforms
  • AI coding agents

AI

  • AI Chatbots
  • LLMs
  • AI Infrastructure Tools
  • AI Voice Agents
  • AI Generative Art

Work & Productivity

  • AI notetakers
  • Note and writing apps
  • Team collaboration software
  • Search

Marketing & Sales

  • Lead generation software
  • Marketing automation platforms
  • Business intelligence software

Design & Creative

  • Video editing
  • Design resources
  • Graphic design tools

Social & Community

  • Social Networking
  • Professional networking platforms
  • Community management

Finance

  • Accounting software
  • Fundraising resources
  • Investing

Product add-ons

  • Figma Plugins
  • Chrome Extensions
See All Categories >>

Trending categories

  • Vibe Coding Tools
  • AI coding agents
  • AI Dictation Apps
  • AI notetakers
  • Code Review Tools
  • No-code platforms
  • Figma Plugins
  • Static site generators

Top reviewed

  • Lovable
  • n8n
  • Attio
  • PostHog
  • Vapi
  • Granola
  • Raycast
  • Supabase

Trending products

  • Lovable
  • Screen Studio
  • bolt.new
  • Wispr Flow
  • Framer
  • Replit
  • Vapi
  • Granola

Top forum threads

  • Cursor or Claude Code?
  • POLL: Domain or product first?
  • YC deadline in <2 weeks; Who's applying?
  • We Got into YC, Got Kicked Out, and Fought Our Way Back
  • How Wispr Flow found PMF through pivot
  • Best Vibe Coding tool so far?
  • Landing page roast - 48 hours only
  • Fix your tagline with the PH CEO
© 2025 Product Hunt
BlogNewsletterAppsAboutFAQTermsPrivacy & CookiesAdvertise