Product Hunt logo dark
  • Launches
    Coming soon
    Upcoming launches to watch
    Launch archive
    Most-loved launches by the community
    Launch Guide
    Checklists and pro tips for launching
  • Products
  • News
    Newsletter
    The best of Product Hunt, every day
    Stories
    Tech news, interviews, and tips from makers
    Changelog
    New Product Hunt features and releases
  • Forums
    Forums
    Ask questions, find support, and connect
    Streaks
    The most active community members
    Events
    Meet others online and in-person
  • Advertise
Subscribe
Sign in
Subscribe
Sign in
Trieve

Trieve

All-in-one AI Infrastructure Suite

5.0
•2 reviews•

285 followers

All-in-one AI Infrastructure Suite

5.0
•2 reviews•

285 followers

Visit website
Search
•
AI Infrastructure Tools
All-in-one solution for building search, discovery, and RAG combining leading search language models + tools for tuning quality
  • Overview
  • Launches3
  • Reviews2
  • Team
  • Awards
  • More
Company Info
trieve.aiGitHub
Trieve Info
Y Combinator
Launched in 2024View 3 launches
Forum
p/trieve-site-search-chat
  • Blog
  • •
  • Newsletter
  • •
  • Questions
  • •
  • Forums
  • •
  • Product Categories
  • •
  • Apps
  • •
  • About
  • •
  • FAQ
  • •
  • Terms
  • •
  • Privacy and Cookies
  • •
  • X.com
  • •
  • Facebook
  • •
  • Instagram
  • •
  • LinkedIn
  • •
  • YouTube
  • •
  • Advertise
© 2025 Product Hunt
SocialX
This is the 3rd launch from Trieve. View more
Trieve Vector Inference

Trieve Vector Inference

Deploy fast, unmetered embedding inference in your own VPC
TVI is an in-VPC solution for fast, unmetered embedding inference. Get fastest-in-class embeddings using any private, custom, or open-source models from dedicated embedding servers hosted in your own cloud. Battle-tested by billions of documents and queries.
Trieve Vector Inference gallery image
Trieve Vector Inference gallery image
Trieve Vector Inference gallery image
Trieve Vector Inference gallery image
Trieve Vector Inference gallery image
Free Options
Launch tags:
API•Developer Tools•Artificial Intelligence
Launch Team / Built With
Federico Chávez TorresDenzell F
Mintlify
AWS
kubernetes

What do you think? …

Federico Chávez Torres
Federico Chávez Torres
Trieve

Trieve

Maker
📌
Hello y'all, My name is Fede, I am the least technical member of Trieve and proud to announce the launch of our standalone embedding and reranking inference product, Trieve Vector Inference, on Product Hunt. We've been building AI applications together since late 2022. As we matured and eventually pivoted hard into building infrastructure, we quickly learned what we could and could not control. There were two major bottlenecks to being the performant end-to-end API we are today. The most important one of these was embedding and reranking inference. Building AI features at scale exposes two critical limitations of cloud embedding APIs: high latency and rate limits. Modern AI applications require better infrastructure. The platform supports any embedding model, whether it’s your own custom model, a private model, or popular open-source options. You get the flexibility to choose the right model for your use case while maintaining complete control over your infrastructure. We put together TVI to eliminate these bottlenecks for our own core product. It’s served billions of queries across billions of documents. After requests from others, we’ve sanded it down, wrote up some docs, and are now making it available for all. You can even get it on AWS Marketplace! Sincerely, Fede P.S If you're curious about the other bottleneck, we have a sister launch going on right now, today! as well for PDF2MD, a lightweight and powerful OCR service. Just click on our company profile to check it out (and support it!)
Report
9mo ago
Nevo David
Nevo David
Postiz

Postiz

Amazing product, I love that it's open-source :)
Report
9mo ago
Rodrigo Mendoza-Smith
Rodrigo Mendoza-Smith
This problem is so Trieve! As I read about "solving bottlenecks" and "building fast APIs for embedding and reranking inference", I couldn't think of any other team that could be behind this. I'm really curious to know how you made the reranking inference so quick—I'll be checking out your repo soon :)
Report
9mo ago
Federico Chávez Torres
Federico Chávez Torres
Trieve

Trieve

Maker
@r0dms hahaha thank you, yes it's very "Trieve" indeed. It works extremely well and it's something we're looking to push heavily. It's crazy how much you lose to cloud products in regards to control, quality, and speed.
Report
9mo ago
Appwrite
Appwrite — The open-source Vercel alternative
The open-source Vercel alternative
Promoted

Trieve Launches

Trieve Vector Inference
Trieve Vector Inference Deploy fast, unmetered embedding inference in your own VPC

Launched on November 21st, 2024

PDF2MD
PDF2MD Convert your PDFs to markdown With AI OCR

Launched on November 21st, 2024

Do you use Trieve?

5.0
Based on 2 reviews
Review Trieve?
Reviews
Helpful
Philip Miglinci
Philip Miglinci
Glasskube

Glasskube

•3 reviews
We use their site search and they are blazing fast!
Report
9mo ago
finbar
Edward Huang, CFA
Edward Huang, CFA
used Trieve to buildfinbarfinbar
(86 points)
Amazingly powerful search that works with minimal effort. Founders are super on-the-ball.
Report
7mo ago