Back to search
Bright Vision Technologies Himalayas · Posted 2d ago

Model Serving Engineer

USD Full time Remote

Developer Mid-level Model-Serving-Engineering, Model-Serving, Modeling-Engineer, AI-ML-Services-Engineer, Model-Platform-Engineering, Software-Engineer Himalayas
Continue to application Add your email once, then Caio opens the original posting.

Indexed description

Bright Vision Technologies is a software development company looking for a skilled Model Serving Engineer to design, build, and operate high-performance, highly reliable inference platforms for serving large machine learning models in production.

Requirements

  • Bachelor’s or Master’s degree in Computer Science or a related field.
  • Six or more years of experience in distributed systems, infrastructure, or ML platform engineering.
  • Strong proficiency in Python and a systems language such as Go, Rust, or C++.
  • Deep experience operating high-throughput, low-latency services in production.
  • Hands-on experience with LLM or large model inference frameworks such as vLLM or TensorRT-LLM.
  • Strong understanding of GPU architecture, memory hierarchies, and accelerator utilization.
  • Familiarity with Kubernetes, autoscaling, and modern cloud platforms.
  • Experience with observability stacks including metrics, tracing, and structured logging.
  • Solid grounding in performance engineering and capacity planning.
  • Strong communication and incident response skills.

Benefits

  • Competitive base salary commensurate with experience, plus benefits.

Originally posted on Himalayas

Free. 20 seconds. No password. See every match in this search.

Create a free Caio profile to unlock the full index and keep your job-search signal for future recommendations.

Unlock free search