Back to search
Mindrift Remotejobs · Posted 28d ago

Freelance Agent Evaluation Engineer

Remote USD 90-90 Full-time Remote

general General Remotejobs
Continue to application Add your email once, then Caio opens the original posting.

Indexed description

We're building a dataset to evaluate AI coding agents by creating challenging tasks and evaluation criteria within realistic simulated environments. You'll work on a part-time, non-permanent project, creating tasks for AI agents to evaluate and improve their coding abilities.

Requirements

- Degree in Computer Science, Software Engineering, or related fields - 5+ years in software development, primarily Python - Background in full-stack development, with experience building React-based interfaces and robust back-end systems - Experience writing tests, familiarity with Docker containers, CI/CD tools, and infrastructure tools Benefits

- Opportunity to work on a challenging project, Flexible schedule, Compensation up to $45 per hour

Free. 20 seconds. No password. See every match in this search.

Create a free Caio profile to unlock more results and save your role and location preferences.

Unlock free search
Want help applying to roles like this? Search Caio for free. If the repetitive CV tweaking gets heavy, Daniel can help set up Caio Agent.
Ask about Agent