G2i Himalayas · Posted 3mo ago

Machine Learning Evaluation Specialist

Albania, Argentina, Austria, Belgium, Bolivia, Bosnia and Herzegovina, Brazil, Bulgaria, Canada, Chile, Colombia, Czechia, Dominican Republic, Ecuador, Estonia, France, Germany, Greece, Hungary, Ireland, Italy, Latvia, Lithuania, Malta, Mexico, Montenegro, North Macedonia, Paraguay, Peru, Poland, Portugal, Romania, Serbia, Slovakia, Spain, Turkey, United Kingdom, United States, Uruguay USD 416000-832000 Contractor Remote

Continue to application Add your email once, then Caio opens the original posting.

Indexed description

Machine Learning Evaluation Specialist (Remote)

List of accepted countries and locations

Important for US applicants: This is a 1099 independent contractor role and is not compatible with F-1 OPT, STEM OPT, or other visa statuses that require W-2 employment, guaranteed hours, or employer sponsorship. We are unable to provide offer letters or employment verification for this role.

Help design the hardest ML problems state-of-the-art AI hasn't solved yet.

We're hiring domain experts to build evaluation tasks that challenge the frontier of AI. This is not an ML engineering role — it's a research role. You'll use deep expertise in your field to create problems that general ML knowledge can't touch.

What you'll do

Propose and frame original, research-grade ML problems rooted in your domain
Design evaluation tasks that require specialized knowledge well beyond standard pipelines
Assess AI-generated solutions for correctness, creativity, and methodological rigor — and explain exactly where and why they fall short
Document problem difficulty, required domain knowledge, and expected failure modes

What you need

Graduate-level expertise (MS or PhD preferred) in a scientific or technical domain that intersects with ML
Strong working knowledge of ML methods — model selection, feature engineering, evaluation metrics
Deep familiarity with active research problems in your field — you know where general ML knowledge runs out
Excellent written communication — you can articulate complex problems clearly and precisely. This cannot be overstated.
Self-motivated and comfortable working independently on intellectually demanding tasks

What you don't need

No prior AI training or RLHF experience required
No software engineering background needed — domain expertise and research instincts are what matter

Domains we're especially looking for

Computational Biology / Bioinformatics
Genomics / Molecular Biology
Physics / Astrophysics / Signal Processing
Climate / Environmental Modeling
Healthcare / Medical Imaging
Neuroscience / Brain-Computer Interfaces
Materials Science / Chemistry
Finance / Quantitative Modeling
Robotics / Control Systems / Reinforcement Learning
Advanced NLP (specialized domains)
Mathematics / Statistics (applied)

Logistics

Fully remote — work from anywhere
$200–$400/hr depending on domain and seniority
10–40 hrs/week, hourly contract
Assessment required — paid if approved
Independent contractor (1099) — not compatible with F-1 OPT, STEM OPT, or visa statuses requiring W-2 employment or employer sponsorship

⚠️ This is a project-based, freelance opportunity with no guaranteed hours. We recommend keeping other work options open while waiting for project assignment.

Originally posted on Himalayas

Free. 20 seconds. No password. See every match in this search.

Create a free Caio profile to unlock more results and save your role and location preferences.

Unlock free search

Want help applying to roles like this? Search Caio for free. If CV tailoring and application tracking get heavy, Full Caio Agent adds a human specialist.

View Full Agent

G2i Company profile preview

Source: Himalayas
Location: Albania, Argentina, Austria, Belgium, Bolivia, Bosnia and Herzegovina, Brazil, Bulgaria, Canada, Chile, Colombia, Czechia, Dominican Republic, Ecuador, Estonia, France, Germany, Greece, Hungary, Ireland, Italy, Latvia, Lithuania, Malta, Mexico, Montenegro, North Macedonia, Paraguay, Peru, Poland, Portugal, Romania, Serbia, Slovakia, Spain, Turkey, United Kingdom, United States, Uruguay
Compensation: USD 416000-832000
Open on Caio: 76 roles

Salary insight

USD 416000-832000

Caio highlights salary ranges whenever the original posting exposes them. Compare similar roles as the index fills in.

Similar role details

Contractor roles Remote matches Himalayas postings

Company stats

Current index details for G2i, based on roles Caio has indexed from public sources.

76open roles 4sources 5markets Posted 3d agolatest role

Indexed description

Machine Learning Evaluation Specialist (Remote)

What you'll do

What you need

What you don't need

Domains we're especially looking for

Computational Biology / Bioinformatics

Genomics / Molecular Biology

Climate / Environmental Modeling

Healthcare / Medical Imaging

Neuroscience / Brain-Computer Interfaces

Materials Science / Chemistry

Finance / Quantitative Modeling

Advanced NLP (specialized domains)

Mathematics / Statistics (applied)

Logistics

Fully remote — work from anywhere

10–40 hrs/week, hourly contract

Assessment required — paid if approved