Back to search
Gramian Consulting Group Himalayas · Posted 24d ago

AI Evaluation Engineer (Data Analysis & Multi-Agent Systems)

Remote / flexible USD Full time Remote

AI Evaluation Engineer AI Analytics Engineer AI Model Evaluation Specialist Senior AI Analytics Engineer
Continue to application Add your email once, then Caio opens the original posting.

Indexed description

Gramian Consultancy is seeking an AI Evaluation Engineer to design benchmark tasks for complex data analysis workflows. The ideal candidate has 5+ years of experience in data analysis and strong proficiency in Python and SQL.

Requirements

  • 5+ years of experience in data analysis or analytics-heavy roles
  • Strong proficiency in Python (pandas, NumPy) and SQL
  • Experience working with real-world, messy datasets (CSV, JSON, logs, reports)
  • Ability to design analytical problems with clear, verifiable answers
  • Solid understanding of statistics (distributions, correlations, outliers)
  • Familiarity with AI benchmarks or evaluation environments (e.g., SWE-bench or similar)
  • Hands-on experience with Docker (Dockerfiles, image builds, debugging)

Benefits

  • Flexible work arrangements

Originally posted on Himalayas

Free. 20 seconds. No password. See every match in this search.

Create a free Caio profile to unlock more results and save your role and location preferences.

Unlock free search
Want help applying to roles like this? Search Caio for free. If the repetitive CV tweaking gets heavy, Daniel can help set up Caio Agent.
Ask about Agent