Sign in

On-Demand GPU clusters - The Cheapest H100s Anywhere

Start new thread

Exla FLOPs - On-Demand GPU clusters - The Cheapest H100s Anywhere

by

Exla FLOPs

Exla FLOPs is the only service where you can instantly spin up 64, 128, or more GPUs - no waitlists, no commitments. Just clusters at your command.

Replies

Best

Exla FLOPs

Maker

📌

Hello!

I'm very excited to launch this product!

We built this because during our own AI training and fine‑tuning, we hit a wall when trying to scale past 8 GPUs. We were manually stitching together nodes across different clouds, so we decided to productize the solution.

Exla FLOPs has the lowest pricing of H100s among any cloud providers.

The cluster is built for developers with insane availabilities for all types of GPUs.

We’re thrilled to see what you build with it! Your feedback means a lot!

We are looking to give out free credits as well. Please fill this out and we'll be depositing credits soon after: https://tally.so/r/meGzgE

2mo ago

Bild AI

@viraat_das This is solving such a desperate need as training is so expensive! Congrats on the launch and thank you for making this!

2mo ago

Exla FLOPs

Maker

@roop_pal Appreciate it Roop! Excited to see this being used to solve real problems!

2mo ago

Congrats on the launch! This looks promising.

How do you handle node failures during long training runs? Any automatic recovery mechanisms?

What storage options do you provide for datasets and model checkpoints?

You mentioned "The cluster is built for developers with insane availabilities for all types of GPUs." - how do you get around chip shortages?

2mo ago

Exla FLOPs

Maker

@olga_s52 thank you!

Node failures:
Right now, Exla FLOPs gives you direct SSH access to your own bare metal nodes — no scheduler in the middle. This means you have full control over your setup (Slurm, DeepSpeed, Kubernetes, or otherwise). While we don’t currently handle automatic job recovery ourselves, most users bring their own checkpointing or orchestration setup to manage long training runs.
Storage options:
We provide fast, local NVMe on each node for high-speed I/O. For persistence across runs, users typically mount their own S3/GCS buckets or connect to remote storage solutions. Shared storage or NFS-style volumes are in the works.
Chip availability:
Rather than relying on a single cloud, we dynamically source idle GPU capacity across a network of bare metal providers, resale markets, and underutilized datacenter inventory. This allows us to provision 64+ GPUs seemlessly, even when traditional clouds are maxed out or heavily reserved.

Do reach out for any feedback or feature requests though: viraat@exla.ai

2mo ago

🖥️⚡ Heavy compute power without the heavy price—clean drop for builders! 💸

2mo ago

Prathamesh Ware

Olly - AI Agent for Social Media

All the best for the launch.

2mo ago

Exla FLOPs

Maker

@prathamesh_ware Thank you!

2mo ago

Middleware

All the best for launch.

2mo ago

Exla FLOPs

Maker

@sawarams Thank you!

2mo ago

curious to know how you're able to do this? sick product

2mo ago

This is gonna be huge

2mo ago

Exla FLOPs

Maker

@youssef_kallel TY TY!

2mo ago

Cheaper than every major cloud provider is actually insane. Congrats!

2mo ago

Exla FLOPs

Maker

@jesseliii Thanks! Ez money!

2mo ago

Great looking piece of software and even better looking CEO

2mo ago

Exla FLOPs

Maker

@aaronaftab takes a good looking CEO to make a good looking sfotware!

2mo ago

Kawin Nikomborirak

Kanai

Incredible service at an incredible price point!

2mo ago

Exla FLOPs

Maker

@knikomborirak Thank you thank you!

2mo ago

Exla FLOPs

Maker

The form link has been corrected: https://tally.so/r/meGzgE

2mo ago