Safeguards Enforcement Analyst, Safety Evaluations

AnthropicAnthropic·Remote(Remote-Friendly (Travel-Required) | San Francisco, CA | Washington, DC; San Francisco, CA | New York City, NY)
Data & Analytics

WFA Digital Insight

As the AI landscape evolves, demand for skilled professionals who can ensure model safety and policy compliance is skyrocketing. With Anthropic's mission to create reliable and interpretable AI systems, this Safeguards Enforcement Analyst role stands out in the current remote job market. Candidates should be prepared to showcase their detail-oriented mindset, ability to navigate ambiguity, and coordination skills across teams.

Job Description

About the Role

Anthropic's Safeguards team is responsible for enforcing policies, protecting users, and ensuring the platform is not misused. As a Safeguards Enforcement Analyst focused on Safety Evaluations, you will play a central role in ensuring models meet safety and policy standards.

Responsibilities

  • Support model launch readiness by running evaluations and interpreting results
  • Partner with policy experts to identify risks and scope evaluation approaches
  • Work with stakeholders to manage evaluation outcomes and drive mitigations
  • Develop processes for creating product-specific evaluations as Anthropic's product surface area expands

Requirements

Detail-oriented, comfortable navigating ambiguity, and capable of coordinating across teams

How to Stand Out

  • Ensure your resume highlights experience with policy compliance and model safety evaluations.
  • Prepare to discuss your approach to identifying and mitigating risks in AI models.
  • Showcase your ability to work across teams, including policy experts and engineering teams.
  • Be ready to provide examples of how you've driven process improvements in previous roles.
  • Consider highlighting any experience with AI model development or auditing.

This is a remote position listed on WFA Digital, the platform for professionals who work from anywhere. Browse more remote jobs across all categories.