As one of the original creators and core maintainers of vLLM, the most popular open-source LLM inference engine, he co-founded Inferact in 2025 to commercialize vLLM and make AI inference cheaper and faster.
Studied at
UC Berkeley
Before
Original Creator / Core Maintainer at vLLM (open-source project)