Gramian Consulting Group Himalayas · Posted 2mo ago

AI Evaluation Engineer (Data Analysis & Multi-Agent Systems)

Remote / flexible USD Full time Remote

Continue to application Add your email once, then Caio opens the original posting.

Indexed description

Gramian Consultancy is seeking an AI Evaluation Engineer to design benchmark tasks for complex data analysis workflows. The ideal candidate has 5+ years of experience in data analysis and strong proficiency in Python and SQL.

Requirements

5+ years of experience in data analysis or analytics-heavy roles
Strong proficiency in Python (pandas, NumPy) and SQL
Experience working with real-world, messy datasets (CSV, JSON, logs, reports)
Ability to design analytical problems with clear, verifiable answers
Solid understanding of statistics (distributions, correlations, outliers)
Familiarity with AI benchmarks or evaluation environments (e.g., SWE-bench or similar)
Hands-on experience with Docker (Dockerfiles, image builds, debugging)

Benefits

Flexible work arrangements

Originally posted on Himalayas

Free. 20 seconds. No password. See every match in this search.

Create a free Caio profile to unlock more results and save your role and location preferences.

Unlock free search

Want help applying to roles like this? Search Caio for free. If repetitive applications get heavy, Managed Job Search adds supervised execution for $99/month.

View Managed Job Search

Gramian Consulting Group Company profile preview

Source: Himalayas
Location: Remote / flexible
Compensation: USD
Open on Caio: 7 roles

Salary insight

USD

Caio highlights salary ranges whenever the original posting exposes them. Compare similar roles as the index fills in.

Similar role details

Full time roles Remote matches Himalayas postings

Company stats

Current index details for Gramian Consulting Group, based on roles Caio has indexed from public sources.

7open roles 1sources 0markets Posted 2mo agolatest role