Model Serving Engineer
Indexed description
Bright Vision Technologies is a software development company looking for a skilled Model Serving Engineer to design, build, and operate high-performance, highly reliable inference platforms for serving large machine learning models in production.
Requirements
- Bachelor’s or Master’s degree in Computer Science or a related field.
- Six or more years of experience in distributed systems, infrastructure, or ML platform engineering.
- Strong proficiency in Python and a systems language such as Go, Rust, or C++.
- Deep experience operating high-throughput, low-latency services in production.
- Hands-on experience with LLM or large model inference frameworks such as vLLM or TensorRT-LLM.
- Strong understanding of GPU architecture, memory hierarchies, and accelerator utilization.
- Familiarity with Kubernetes, autoscaling, and modern cloud platforms.
- Experience with observability stacks including metrics, tracing, and structured logging.
- Solid grounding in performance engineering and capacity planning.
- Strong communication and incident response skills.
Benefits
- Competitive base salary commensurate with experience, plus benefits.
Originally posted on Himalayas
Create a free Caio profile to unlock the full index and keep your job-search signal for future recommendations.
Unlock free search