Staff Engineer, Inference Optimizations
Indexed description
DigitalOcean Staff Engineer, Inference Optimizations 2 Hours AgoSaved In-Office Seattle, WA, USA 191K-239K Annually Senior level 191K-239K Annually Senior levelArtificial Intelligence • Cloud • Software • Infrastructure as a Service (IaaS)Lead design and implementation of scalable, multi-tenant serverless inference services focused on throughput, GPU utilization, resiliency, observability, and operational tooling. Partner with platform and GPU teams, debug production performance issues, mentor engineers, drive incident response, and create reusable patterns, standards, and automation for inference workloads.Top Skills: Api GatewayCloud-NativeGoGpuKubernetesMicroservicesService MeshTensorrt-LlmTritonVllm
Create a free Caio profile to unlock more results and save your role and location preferences.
Unlock free search