Director, Model Post-Training and Agentic Research (Remote)
Indexed description
CrowdStrike Director, Model Post-Training and Agentic Research (Remote) An Hour AgoSaved Remote or Hybrid USA 195K-290K Annually Senior level 195K-290K Annually Senior levelCloud • Computer Vision • Information Technology • Sales • Security • CybersecurityLead and hands-on develop the full post-training stack for security-domain AI, including SFT, RLHF/RLAIF, reward modeling, and agent-RL harnesses. Build training environments and agent scaffolds, define evaluation and benchmarks, drive research direction, publish findings, and recruit and grow a high-density research and engineering team while actively contributing to experiments and architecture.Top Skills: Agent-RlLlama-ClassLlmsPpoQwen-ClassReinforcement LearningReward ModelingRlaifRlhfRollout InfrastructureSupervised Fine-Tuning (Sft)Tool-Use Frameworks
Create a free Caio profile to unlock more results and save your role and location preferences.
Unlock free search