Back to search
Jobright.ai Linkedin · Posted 10d ago

Machine Learning Engineer

United States

Linkedin
Continue to application Add your email once, then Caio opens the original posting.

Indexed description

Jobright is your personal AI job search agent that transforms the job search process into a fast, expert-guided journey. They are seeking a Machine Learning Engineer to build and scale business-facing AI agents, managing the entire lifecycle from prototype to production.


Why Join Us


  • Ship production-grade AI systems that directly impact how users navigate their careers
  • Own meaningful infrastructure decisions with real consequences at scale
  • Operate at the intersection of ML research, systems engineering, and live product
  • Help define the technical foundation of an AI-first job search experience


Responsibilities


  • Architect and maintain the scalable infrastructure needed to reliably serve production AI agents at scale
  • Build and continuously optimize LLM inference pipelines, with a focus on minimizing latency, maximizing throughput, and managing compute costs
  • Develop automated monitoring, testing, and CI/CD workflows that keep our agents performant and production-ready
  • Optimize data ingestion and processing pipelines to enable real-time agent responsiveness and support robust RAG architectures
  • Design and maintain the APIs and backend services that connect AI model outputs to the user-facing product


Qualifications


Required


  • Recent grad or early-career engineer (0–2 years) with a degree in Computer Science, Software Engineering, or a closely related field
  • Strong Python skills and hands-on experience with backend frameworks such as FastAPI, Flask, or Django
  • Working experience with ML frameworks like PyTorch or TensorFlow, paired with solid engineering fundamentals - version control, CI/CD, and unit testing
  • Exposure to LLM deployment and a practical understanding of the infrastructure required to support autonomous agent systems
  • Must be based in and authorized to work in the United States


Preferred


  • Prior internship or project experience in MLOps, backend engineering, or distributed systems at an AI-focused company
  • Hands-on familiarity with containerization tools (Docker, Kubernetes) and cloud platforms such as AWS, GCP, or Azure
  • Experience working with vector databases - Pinecone, Milvus, Weaviate, or similar - in a production context
  • Solid grasp of SQL and NoSQL database management for high-throughput, large-scale data workloads
Free. 20 seconds. No password. See every match in this search.

Create a free Caio profile to unlock the full index and keep your job-search signal for future recommendations.

Unlock free search
Want help applying to roles like this? Search Caio for free. If the repetitive CV tweaking gets heavy, Daniel can help set up Caio Agent.
Ask about Agent