Back to search
CrowdStrike Builtin · Indexed 2026-06-11

Director, Model Post-Training and Agentic Research (Remote)

Remote or Hybrid, Oregon, United States 195K-290K Annually Remote

Senior level Builtin
Continue to application Add your email once, then Caio opens the original posting.

Indexed description

CrowdStrike Director, Model Post-Training and Agentic Research (Remote) An Hour AgoSaved Remote or Hybrid USA 195K-290K Annually Senior level 195K-290K Annually Senior levelCloud • Computer Vision • Information Technology • Sales • Security • CybersecurityLead and hands-on develop the full post-training stack for security-domain AI, including SFT, RLHF/RLAIF, reward modeling, and agent-RL harnesses. Build training environments and agent scaffolds, define evaluation and benchmarks, drive research direction, publish findings, and recruit and grow a high-density research and engineering team while actively contributing to experiments and architecture.Top Skills: Agent-RlLlama-ClassLlmsPpoQwen-ClassReinforcement LearningReward ModelingRlaifRlhfRollout InfrastructureSupervised Fine-Tuning (Sft)Tool-Use Frameworks

Free. 20 seconds. No password. See every match in this search.

Create a free Caio profile to unlock more results and save your role and location preferences.

Unlock free search
Want help applying to roles like this? Search Caio for free. If the repetitive CV tweaking gets heavy, Daniel can help set up Caio Agent.
Ask about Agent