Credflow AI Linkedin · Posted 2mo ago

Sr. ML Engineer

India

Continue to application Add your email once, then Caio opens the original posting.

Indexed description

We are hiring a Senior ML Systems Engineer to own the execution and optimisation of PrimaLabs' platform on real customer hardware. This role sits at the core of our delivery engine, focusing on tuning inference runtimes, running large-scale benchmarks, and integrating optimisation pipelines. You will work directly on customer deployments across modern accelerators and act as a key technical counterpart in performance-critical engagements.

Responsibilities

Own and execute optimisation of ML workloads on customer hardware (NVIDIA, AMD, CUDA).
Tune and optimise inference runtimes such as vLLM and SGLang.
Design and run large-scale benchmarking and performance evaluation pipelines.
Build and manage configuration sweep infrastructure for performance exploration.
Integrate and extend optimisation pipelines (DeepHyper or similar frameworks).
Profile system performance and identify bottlenecks across compute, memory, and I/O.
Work closely with customers to deliver measurable performance improvements.
Collaborate with research and infrastructure teams to productionise optimisations.

Requirements

5+ years of experience in ML infrastructure, ML systems, or performance engineering.
Strong experience with model inference systems and runtime optimisation.
Hands-on experience with profiling tools and performance tuning.
Deep understanding of GPU/accelerator-based systems and ML workloads.
Proficiency in Python and system-level debugging.
Experience working with large-scale benchmarking or performance testing systems.
Ability to work directly with customers and translate requirements into solutions.

Good To Have

Experience with large-scale distributed systems or model serving platforms.
Familiarity with low-level performance optimisation (memory, compute, and I/O bottlenecks).
Experience working with hardware-software co-design or system-level tuning.
Background in high-performance computing (HPC).
Contributions to open-source ML systems or infrastructure projects.
Experience in customer-facing or solution engineering roles.

This job was posted by Rakshit Singh Rawat from CredFlow AI.

Free. 20 seconds. No password. See every match in this search.

Create a free Caio profile to unlock more results and save your role and location preferences.

Unlock free search

Want help applying to roles like this? Search Caio for free. If repetitive applications get heavy, Managed Job Search adds supervised execution for $99/month.

View Managed Job Search

Credflow AI Company profile preview

Source: Linkedin
Location: India
Compensation: Not listed
Open on Caio: 2 roles

Salary insight

Compensation not indexed

Caio highlights salary ranges whenever the original posting exposes them. Compare similar roles as the index fills in.

Similar role details

Full-time roles Location flexible matches Linkedin postings

Company stats

Current index details for Credflow AI, based on roles Caio has indexed from public sources.

2open roles 1sources 1markets Posted 2mo agolatest role