genesis Ashby · Posted 2mo ago

Member of Technical Staff, Inference (Bay Area)

Bay Fulltime

Continue to application Add your email once, then Caio opens the original posting.

Indexed description

What You’ll Do Build low-latency inference pipelines for on-device deployment, enabling real-time next-token and diffusion-based control loops in robotics Design and optimize distributed inference systems on GPU clusters, pushing throughput with large-batch serving and efficient resource utilization Implement efficient low-level code (CUDA, Triton, custom kernels) and integrate it seamlessly into high-level frameworks Optimize workloads for both throughput (batching, scheduling, quantization) and latency (caching, memory management, graph compilation) Develop monitoring and debugging tools to guarantee reliability, determinism, and rapid diagnosis of regressions across both stacks What You’ll Bring Deep experience in distributed systems, ML infrastructure, or high-performance serving (8+ years) Production-grade expertise in Python, with strong background in systems languages (C++/Rust/Go) Low-level performance mastery: CUDA, Triton, kernel optimization, quantization, memory and compute scheduling Proven track record scaling inference workloads in both throughput-oriented cluster environments and latency-critical on-device deployments System-level mindset with a history of tuning hardware–software interactions for maximum efficiency, throughput, and responsiveness

Free. 20 seconds. No password. See every match in this search.

Create a free Caio profile to unlock more results and save your role and location preferences.

Unlock free search

Want help applying to roles like this? Search Caio for free. If CV tailoring and application tracking get heavy, Full Caio Agent adds a human specialist.

View Full Agent

genesis Company profile preview

Source: Ashby
Location: Bay
Compensation: Not listed
Open on Caio: 64 roles

Salary insight

Compensation not indexed

Caio highlights salary ranges whenever the original posting exposes them. Compare similar roles as the index fills in.

Similar role details

Fulltime roles Location flexible matches Ashby postings

Company stats

Current index details for genesis, based on roles Caio has indexed from public sources.

64open roles 6sources 8markets Posted 8d agolatest role