Back to search
CrowdStrike Builtin · Indexed 2026-06-11

Director, AI Alignment and Interpretability (Remote)

Remote or Hybrid, Oregon, United States 195K-290K Annually Remote

Senior level Builtin
Continue to application Add your email once, then Caio opens the original posting.

Indexed description

CrowdStrike Director, AI Alignment and Interpretability (Remote) An Hour AgoSaved Remote or Hybrid USA 195K-290K Annually Senior level 195K-290K Annually Senior levelCloud • Computer Vision • Information Technology • Sales • Security • CybersecurityLead and conduct mechanistic interpretability and alignment research for security-specialized AI. Develop methods to read model internals, detect misuse signals, design training interventions and evaluation frameworks, publish original research, and recruit and mentor a lean research team.Top Skills: Activation PatchingAdversarial EvaluationAlignment EvaluationsCausal TracingCircuit AnalysisFeature VisualizationLarge Language ModelsMechanistic InterpretabilityProbing ClassifiersRed Teaming

Free. 20 seconds. No password. See every match in this search.

Create a free Caio profile to unlock more results and save your role and location preferences.

Unlock free search
Want help applying to roles like this? Search Caio for free. If the repetitive CV tweaking gets heavy, Daniel can help set up Caio Agent.
Ask about Agent