AI/LLM Engineer — Python, LangChain, RAG, AWS Lambda
Indexed description
Must-Haves
- Strong Python engineering skills and experience delivering production systems
- Hands-on experience building with LLM APIs (OpenAI, Anthropic, etc.) and deploying AI features end-to-end
- Experience with LangChain / LangGraph — including Prompt Engineering, Token Management, and Agent creation with tool-based workflows
- Experience building agentic workflows (tool use, routing, memory, safety/guardrails), ideally with MCP
- Solid understanding of RAG concepts — retrieval, chunking, embeddings, re-ranking, and evaluation
- Practical AWS experience, especially AWS Lambda, DynamoDB, Bedrock, and serverless architecture patterns
- Backend engineering experience: RESTful API design, webhook integration, JWT, pytest, and observability tools like LangSmith, Langfuse, or Sentry
- Experience doing Git repository analysis — understanding large codebases, debugging, refactors, and code health
- Comfortable working in fast-moving product environments with high ownership
- Available to work US Eastern Time hours
- NextJS and TypeScript for frontend components and integration
- AWS CDK for infrastructure-as-code and deployment
- Broader AWS services knowledge: S3, CloudWatch, IAM, API Gateway
- Experience with Pinecone or other vector databases
- Experience integrating or serving custom models (inference, model endpoints, monitoring)
- Familiarity with evaluation frameworks (offline tests, human feedback loops, regression suites)
- Strong software engineering fundamentals: testing, CI/CD, observability, performance
- Plan and develop applications that include custom AI components — orchestration, evaluation, tooling, and integrations
- Build intelligent agents using LLMs and the MCP framework to enable context-aware automation and decision-making
- Automate internal workflows and operational processes using Python and AWS Lambda
- Develop and improve RAG-based chatbots to increase response accuracy, grounding, and consistency
- Collaborate with the technical team to productionize custom models and connect them with product systems
- Design and maintain multi-agent AI pipelines, including fallback and graceful degradation strategies for reliability
- Work closely with the CTO and cross-functional team in 1-week sprints, contributing to architecture decisions and code reviews
- 🚀 High ownership from day one — You'll be a key technical contributor on a team building production AI at scale. Your decisions matter.
- 📈 Real-world impact — Help a US B2B platform modernize its AI infrastructure and deliver measurable value to its clients.
- 💡 Challenging engineering problems — Reliability, latency, token optimization, and multi-agent orchestration at production scale.
- 🔝 Direct exposure to senior technical leadership — Work alongside an experienced CTO and a path to grow into a lead engineering role.
- 💵 Up to $5,500/month — paid in USD, bi-weekly via Deel
- 🌎 Fully Remote — work from anywhere in Latin America
- 📄 Contractor engagement with a U.S. company
- 🏖️ Paid PTO — competitive package, grows with tenure
- 🤝 Referral Program — earn a bonus for referring talent that gets hired
- 🔗 LinkedIn Profile URL (required)
- 💻 GitHub Portfolio — show us your best work (required)
- 🌐 Live Project Examples — applications or tools you've built (encouraged)
- ✉️ Cover Letter — tell us why you're excited about this role (optional but encouraged)
Our core capabilities include AI strategy execution, fractional Chief AI Officer leadership, staff augmentation, and fully managed engineering workstreams. Moving beyond traditional consulting, we deeply embed within firms to build and ship high-impact solutions. Born in New York City, we are now a remote-first organization supported by an Argentine back office and a diverse US-based client portfolio.
Create a free Caio profile to unlock the full index and keep your job-search signal for future recommendations.
Unlock free search