Senior Machine Learning Software Development Engineer, AI Ops Integration
Indexed description
As a Machine Learning Software Development Engineer, you'll design and deploy production ready systems that combine traditional ML with modern agentic architectures to solve impactful operational problems at scale.
This role combines the excitement of a startup environment with the scale of Amazon Operations. You'll research state-of-the-art open source and internal tools and will tackle highly ambiguous problems. If you thrive on ownership and dealing with ambiguity, passionate about AI and want to fundamentally influence how Amazon Operations leverages AI, this role offers an extraordinary opportunity to make your mark.
Key job responsibilities
What You'll Do
- Lead the technical design and architecture of production ML/LLM systems end-to-end from data pipelines and model serving to scalable user-facing applications
- Architect and build agentic AI solutions that orchestrate complex operational workflows across multiple systems, APIs, and decision points
- Define the technical strategy for internal tooling. Designing front-end platforms (dashboards, products) that serve non-technical operations users at worldwide scale
- Own the integration architecture across internal systems, databases, and MCP servers: establishing patterns that enable modular multi-system orchestration
- Drive engineering excellence across the ML lifecycle: set standards for experimentation, deployment, monitoring, evaluation, and incident response
- Design guardrails, evaluation frameworks, and human-in-the-loop architectures that ensure production AI systems operate safely and reliably at scale
- Mentor junior engineers, conduct design reviews, and raise the technical bar across the team
- Partner with scientists, product managers, and operations leaders to translate ambiguous business problems into well-scoped technical solutions with clear delivery milestones
Our charter is to identify high-impact automation opportunities, build AI agents that can handle them reliably, and deploy these systems into production
where they process real decisions daily.
The team operates with a build-measure-learn cycle. We work closely with operations partners to understand their problems, prototype solutions quickly, measure impact rigorously, and iterate based on real-world performance.
Basic Qualifications
- Bachelor's degree
- Experience as a mentor, tech lead or leading an engineering team
- Experience in professional, non-internship software development
- Experience leading the architecture and design (architecture, design patterns, reliability and scaling) of new and current systems
- Experience programming with at least one modern language such as Java, C++, or C# including object-oriented design
- Master's degree in computer science or equivalent
- Experience with full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.
Company - Amazon EU SARL, Irish Branch
Job ID: A10423496
Create a free Caio profile to unlock more results and save your role and location preferences.
Unlock free search