Lead AI Engineer
Indexed description
We are seeking a Lead AI Engineer to design, build and scale cutting-edge AI applications powered by large language models. In this role, you will partner with clients to deliver tailored LLM-driven solutions, architect agentic systems and drive the adoption of emerging AI technologies across enterprise environments.
Responsibilities
- Design, implement and maintain end-to-end AI applications, including chatbots, Q&A platforms, agent workflows and other LLM-driven solutions
- Collaborate directly with clients to understand their needs, identify opportunities and recommend tailored AI/LLM solutions that drive business value
- Architect and optimize robust data pipelines, prompt strategies and datasets to ensure effective, accurate and scalable AI models
- Evaluate, monitor and refine AI system performance, ensure outputs are accurate, secure, scalable and compliant with industry regulations and best practices
- Conduct research, design experiments and perform rapid prototyping to validate technical feasibility and demonstrate the business value of AI solutions
- Stay current with evolving LLM technologies, frameworks, protocols (such as MCP, A2A, ACP) and methodologies, continuously improve solution quality and client outcomes
- Design and implement agentic systems with frameworks such as LangChain, LangGraph and Semantic Kernel, integrate with vector databases and advanced memory architectures
- Develop and maintain APIs and system integrations for production-grade AI applications, including enterprise system integration (CRM, ERP, databases)
- Deploy AI solutions at scale, consider performance, cost-efficiency, maintainability, observability and security (including guardrails and prompt injection prevention)
- Implement and monitor retrieval systems (keyword search, vector search, embeddings), ranking algorithms and agent evaluation frameworks
- Use MLOps/AIOps practices for agentic systems and ensure robust observability and monitoring of deployed solutions
- Clearly communicate complex technical concepts and AI strategies to both technical and non-technical stakeholders, iterate on models based on user feedback
Requirements
- Strong proficiency in at least one modern programming language (such as Python, Java, C#, Go, etc.); experience with web frameworks like FastAPI or similar is a plus
- Deep understanding of the AI application development lifecycle, including production deployment, system integration and rapid UI prototyping (Streamlit, Gradio or similar)
- Familiarity with major LLM platforms and APIs (OpenAI, Anthropic, Amazon Bedrock, Gemini) and related frameworks (LangChain, LangGraph, LlamaIndex, Strands Agents, etc.)
- Knowledge of advanced AI integration patterns (e.g., RAG, agent orchestration, tool calling), retrieval systems (keyword/vector search, embeddings) and ranking algorithms
- Experience to deploy AI solutions at scale, with a focus on performance, cost-efficiency, maintainability, observability and security (including guardrails and prompt injection prevention)
- Proven ability to evaluate generative AI quality with retrieval/classification scores, LLM-based evaluation, agent evaluation metrics and A/B testing
- Experience with vector databases (Pinecone, Weaviate, ChromaDB, FAISS) and semantic/hybrid search
- Experience to design experiments, conduct A/B tests and iterate on models based on user feedback
- Experience with enterprise system integration (CRM, ERP, databases) and deployment to cloud AI platforms or on-premise solutions
- Experience with observability and monitoring tools/frameworks, and application of MLOps/AIOps practices for agentic systems
- Familiarity with emerging protocols (MCP, A2A, ACP) and advanced memory architectures
- Proven experience in AI engineering and delivery of ML-based solutions in production environments
- Strong problem-solving skills, attention to detail and ability to work independently and collaboratively
- Excellent communication, collaboration and interpersonal skills, with the ability to explain complex technical concepts to non-technical stakeholders
Technologies
- Proficiency in at least one modern programming language (e.g., Python, Java, C#, Go, etc.) for AI development
- Web frameworks: FastAPI, Streamlit, Gradio, Flask, Spring Boot, ASP.NET or similar
- Major LLM platforms and APIs: OpenAI, Anthropic, Amazon Bedrock, Gemini
- Agentic frameworks: LangChain, LangGraph, Semantic Kernel, LlamaIndex, Strands Agents
- Data pipeline and integration tools
- Vector databases: Qdrant, FAISS, Chroma, Pinecone, Weaviate, ChromaDB
- Retrieval and ranking systems: keyword search, vector search, embeddings, ranking algorithms
- Cloud AI platforms: Azure OpenAI, Amazon Bedrock, GCP Vertex AI
- On-premise solutions: vLLM
- Enterprise AI platforms: AWS AgentCore, Databricks AgentBricks, Google Agents Space, Azure AI Foundry
- Observability and monitoring tools/frameworks
- MLOps/AIOps practices for agentic systems
- Security and guardrail tools for AI applications
- Protocols: MCP, A2A, ACP
- Advanced memory architectures
We offer
- Learning Culture - We want you to be the best version of yourself, that is why we offer unlimited access to learning platforms, a wide range of internal courses, and all the knowledge you need to grow professionally
- Health Coverage - Health and wellness are important, that is why we have you and up to four family members in a premiere health plan. We have a couple of options, so you can choose what is best for you and your family
- Visual Benefit - Seeing your work for us would be a sight for sore eyes. We want your vision to always be at 100% which is why we offer up to $200.000 COP for any visual health expenses
- Life Insurance Plan - We have partnered with MetLife to offer a full-coverage Ife insurance plan. So, your family is covered, even if you are gone.
- Medical Leave Coverage - We are one of the few companies that cover 100% of your medical leave, for up to 90 days. Your health is the most important thing to us
- Professional Growth Opportunities - We have designed a highly competitive and complete development process, where you will have all the tools to get where you have always wanted to be, personally and professionally
- Stock Option Purchase Plan - As an EPAMer you can be more than just an employee, you will also have the opportunity to purchase stock at a reduced price and become a part owner of our organization
- Additional Income - Besides your regular salary, you will also have the chance to earn extra income by referring talent, being a technical interviewer, and many more ways
- Community Benefit - You will be part of a worldwide community of over 50,000 employees, where you can learn, challenge yourself, stand out, and share your knowledge and experience with multicultural teams!
We accept CVs in English only.
Please note that even though you are applying for this position, you may be offered other projects to join within EPAM.
EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.
Create a free Caio profile to unlock more results and save your role and location preferences.
Unlock free search