Inferact jobs

14 open roles discovered and AI-scored on Swoopd

Member of Technical Staff, AMD GPU Performance Engineering
San Francisco
Member of Technical Staff, TPU Performance Engineering
Singapore
Member of Technical Staff, TPU or AMD GPU Performance Engineering
Singapore
Member of Technical Staff, Performance and Scale
Singapore
Member of Technical Staff, Kernel Engineering
Singapore
Member of Technical Staff, Inference
Singapore
Member of Technical Staff, Cloud Orchestration
Singapore
Member of Technical Staff, Inference
San Francisco
Member of Technical Staff, Developer Relations
San Francisco
Member of Technical Staff, TPU & AMD GPU Performance Engineering
San Francisco
Member of Technical Staff, Exceptional Generalist (Remote)
Remote
Inferact, founded by the creators of vLLM, seeks an exceptional generalist engineer to work across the full vLLM inference stack—from GPU kernels to distributed systems and cloud orchestration. The role is globally remote, asynchronous-first, and requires deep expertise in systems programming, GPU/accelerator programming, or distributed systems, with proficiency in CUDA, Rust/Go/C++, Python/PyTorch, and Kubernetes. Compensation is not specified.
Member of Technical Staff, Cloud Orchestration
San Francisco
Member of Technical Staff, Kernel Engineering
San Francisco
Member of Technical Staff, Performance and Scale
San Francisco

Want Inferact roles matched to you?

Swoopd scores fresh postings against your résumé so you only see the matches that matter.

Get started free