1.4 EXAFLOP CLUSTER
CHAI | PALO ALTO
[ CHAI Revenue Growth ]
Incentives & Scale
JOBSWe have built a consumer platform where users can build their own AI characters and stories. Our B2C business has grown exponentially, reaching $70M/yr in revenue in 2026.
1.4 EXAFLOPS GPU CLUSTER
FOR AI INFERENCE
At CHAI, we serve hundreds of in-house trained LLMs across several GPU chip types from both AMD and Nvidia. While open-source solutions such as vLLM work well for simple workloads, we've found that we can further improve upon vLLM by almost an order of magnitude through several optimizations, such as custom kernels and compute-efficient attention approximations.
CHAI has seen demand for its AI models grow exponentially. This demand has exceeded the capacity and ability of off-the-shelf providers. Out of necessity, we have had to verticalize and bring inference in-house. Starting small with a cluster of A5000s rented on-demand from CoreWeave in 2023, we've grown to a cluster size of thousands of GPUs, spread across 4 regions. Multi-region inference has challenges and has brought CHAI to the cutting edge of technology.
TOKENS PROCESSED PER DAY
[ Product ]
Building Platform for Social AI
NEWSWe believe in platforms. There is huge demand for AI that is not only factually correct but also entertaining and social.