Back to search
hyphenconnect Greenhouse · Posted 1mo ago

AI Safety Specialist (AI Engineering)

United States

Engineering Greenhouse
Continue to application Add your email once, then Caio opens the original posting.

Indexed description

We are searching for an AI Safety Specialist who will play a crucial role in enhancing the security and robustness of language models. You will ensure the safe deployment of AI systems by conducting adversarial testing, implementing protective measures, and aligning AI behavior with ethical principles.

Responsibilities:

  • Conduct adversarial testing on LLMs and multimodal agents.
  • Implement guardrails and real-time filtering for autonomous tool use.
  • Develop constitutional AI principles and assist with RLHF alignment pipelines.

Qualifications:

  • Background in cybersecurity, prompt engineering, or adversarial ML.
  • Experience with jailbreak taxonomies and automated red-teaming frameworks.
  • Strong analytical mindset for identifying edge cases.
Free. 20 seconds. No password. See every match in this search.

Create a free Caio profile to unlock more results and save your role and location preferences.

Unlock free search
Want help applying to roles like this? Search Caio for free. If the repetitive CV tweaking gets heavy, Daniel can help set up Caio Agent.
Ask about Agent