Back to search
Talent Connect Linkedin · Posted 1mo ago

Applied ML Engineer (Speech/ASR/LLMs) - Senior (Brazil / Argentina)

São Paulo, São Paulo, Brazil

Linkedin
Continue to application Add your email once, then Caio opens the original posting.

Indexed description

Applied ML Engineer (Speech/ASR/LLMs) - Senior

📍 Remote – Argentina/Brazil

🕘 Full-time

About The Role

We’re looking for an Applied ML Engineer to lead the development and operationalization of machine learning models powering real-world AI products.

The primary focus of the role will be building and improving speech-to-text (ASR) systems for medical conversations in Brazilian Portuguese and Latin American Spanish, while also contributing to LLM fine-tuning, clinical classifiers, and production ML infrastructure.

This is a highly hands-on role focused on taking models from experimentation to production.

What you’ll do

  • Design, train, fine-tune, and deploy ML models for speech and language applications
  • Build and maintain training pipelines and evaluation frameworks
  • Work on ASR systems for domain-specific medical conversations
  • Develop and improve data pipelines and ML infrastructure
  • Collaborate on LLM fine-tuning and classifier development
  • Optimize model performance, reliability, and scalability
  • Partner with product and domain teams to translate business needs into ML solutions
  • Contribute to production deployment and monitoring of ML systems

What we’re looking for

  • Strong experience building and operationalizing ML models in production
  • Hands-on experience with:
    • ASR / speech models / audio ML
    • PyTorch or modern ML frameworks
    • ML infrastructure and distributed training
  • Experience building:
    • training pipelines
    • evaluation frameworks
    • production ML workflows
  • Strong programming skills in Python
  • Ability to work independently in ambiguous environments
  • Experience collaborating across technical and non-technical teams
Nice to have

  • Experience with:
    • LLM fine-tuning
    • orchestration frameworks
    • clinical or healthcare data
  • Experience building data flywheels or feedback loops
  • Experience with GPU orchestration and experiment tracking
  • Fluency in Portuguese and/or Spanish
Work model

  • Remote – Americas
  • Preference for candidates located near São Paulo or Medellín (not mandatory)
  • Full-time

💡 Note

This is an applied ML engineering role, not a pure research position. We are looking for builders who can take models from experimentation through deployment and production operation.

How to apply

Please apply directly following the link.

If it does not suit you, please share it with your colleagues.

Thanks!

Free. 20 seconds. No password. See every match in this search.

Create a free Caio profile to unlock more results and save your role and location preferences.

Unlock free search
Want help applying to roles like this? Search Caio for free. If the repetitive CV tweaking gets heavy, Daniel can help set up Caio Agent.
Ask about Agent