AI Engineer
Indexed description
Recognized for the third consecutive year as a Leader by Gartner, we are part of ASMPT, the world's largest supplier of best-in-class equipment, and technological process partner for the electronics and semiconductor industries.
The role:
You will join an existing AI engineering team focused on building reliable AI infrastructure for manufacturing systems. This is hands-on work developing MCP servers, creating tooling for model observability, telemetry, and retraining pipelines—no leadership required, just solid execution within a collaborative team.
This role is based at our headquarters in Porto, Portugal, where collaboration, experimentation, and rigorous engineering standards are essential. You're expected to stay closely connected - actively participating in technical design reviews, architecture discussions, and engaging with teams across Product, Data, and Platform Engineering. This is a role for someone who cares about building AI systems that are not just smart, but observable, debuggable, and continuously improving.
What you'll do:
Develop MCP Servers
- Implement and maintain Model Context Protocol (MCP) servers that connect language models to manufacturing domain tools and data sources
- Optimize server performance and define clear interfaces for tool integration, ensuring models have safe, reliable access to business logic
- Collaborate with team leads to map complex manufacturing workflows into structured tools and prompts
- Design and implement comprehensive telemetry systems to track model behavior, token usage, latency, and cost in production
- Create dashboards and alerting systems that give real-time visibility into model performance and anomalies
- Instrument models to capture structured traces: prompts/system context, tool invocations, inputs/outputs, intermediate artifacts, and decision metadata
- Contribute to standards for logging, tracing, and distributed observability across all AI systems
- Build data collection pipelines that capture production interactions, model failures, and edge cases for retraining
- Implement automated systems for evaluating model improvements and managing safe rollouts
- Contribute to feedback loops that allow the platform to learn from real-world usage without manual intervention
- Write clean, testable code and contribute to team codebases, documentation, and CI/CD processes
- Participate in code reviews, technical design reviews, and troubleshooting production issues
- Experiment with new tools and techniques under team guidance to improve AI system reliability
- Promote the adoption of agentic coding across teams to accelerate delivery and increase throughput while maintaining quality and security standards
- Design repositories, CI, and developer tooling that make agent-driven changes safe (linting, typed APIs, contract tests, golden tests, eval gates)
- Implement robust error handling, fallback strategies, and graceful degradation for AI systems
- Monitor and tune AI systems for performance, uptime, and safety in manufacturing environments
- Gather feedback from operations and product teams to refine tooling and server implementations
- Deployed production MCP servers handling real manufacturing workloads
- Built and iterated on observability tools used daily by engineering and ops teams
- Contributed to retraining pipelines that reduce model staleness and improve prediction accuracy
- Established clear patterns and best practices that help the team scale AI systems reliably
- Delivered robust tooling for debugging, monitoring, and managing AI systems in manufacturing environments
- Work on AI that powers real factories, solving problems with immediate industrial impact
- Join a tight-knit engineering team building the backbone of trustworthy AI infrastructure for manufacturing
- Contribute to systems that manufacturers depend on daily, with full observability and reliability
- Enjoy the freedom to code, collaborate, and grow technically in a rigorous engineering environment
- At least 1 year of hands-on machine learning experience, including training and testing models, and a practical understanding of overfitting, generalization, and bias; plus a solid grasp of common model families (e.g., k-nearest neighbors, decision trees/random forests, support vector machines, linear/logistic regression, and basic neural networks)
- At least 1 year of hands-on experience with LLMs in production or applied settings, including inference, prompt engineering, and evaluation; with a working understanding of how LLMs are configured and behave (e.g., temperature, top-p, max tokens, context windows, and tool/function calling)
- Experience with agentic coding workflows or LLM-based code assistance, using tools that accelerate implementation, refactoring, and test generation while maintaining strong engineering rigor (reviews, testing, documentation, and CI discipline)
- Familiarity with server development, APIs, and containerization (Docker/Kubernetes)
- Strong problem-solving skills and comfortable writing production code - tests, docs, and all
- Excellent software engineering fundamentals: version control, testing, code review, documentation
- Ability to collaborate effectively in a team and work well under technical leadership
- Excellent spoken and written English communication skills
- Experience with manufacturing operations, MES systems, or Industry 4.0 concepts
- Familiarity with MLOps tools, model monitoring platforms, or ML infrastructure
- Basic knowledge of observability tools (Prometheus, Grafana, or similar) and data pipelines
- Proficiency in Python and experience with AI frameworks like PyTorch, TensorFlow, or LangChain
If you need accommodation during the recruitment process, please let us know - we're happy to support you.
Create a free Caio profile to unlock the full index and keep your job-search signal for future recommendations.
Unlock free search