Inferact jobs
14 open roles discovered and AI-scored on Swoopd
Member of Technical Staff, AMD GPU Performance Engineering
San FranciscoMember of Technical Staff, TPU Performance Engineering
SingaporeMember of Technical Staff, TPU or AMD GPU Performance Engineering
SingaporeMember of Technical Staff, Performance and Scale
SingaporeMember of Technical Staff, Kernel Engineering
SingaporeMember of Technical Staff, Inference
SingaporeMember of Technical Staff, Cloud Orchestration
SingaporeMember of Technical Staff, Inference
San FranciscoMember of Technical Staff, Developer Relations
San FranciscoMember of Technical Staff, TPU & AMD GPU Performance Engineering
San FranciscoMember of Technical Staff, Exceptional Generalist (Remote)
RemoteInferact, founded by the creators of vLLM, seeks an exceptional generalist engineer to work across the full vLLM inference stack—from GPU kernels to distributed systems and cloud orchestration. The role is globally remote, asynchronous-first, and requires deep expertise in systems programming, GPU/accelerator programming, or distributed systems, with proficiency in CUDA, Rust/Go/C++, Python/PyTorch, and Kubernetes. Compensation is not specified.
Member of Technical Staff, Cloud Orchestration
San FranciscoMember of Technical Staff, Kernel Engineering
San FranciscoMember of Technical Staff, Performance and Scale
San Francisco
Want Inferact roles matched to you?
Swoopd scores fresh postings against your résumé so you only see the matches that matter.