Safeguards Analyst, Human Exploitation & Abuse

AnthropicAnthropic·Remote(Remote-Friendly (Travel-Required) | San Francisco, CA | Washington, DC)
Data & Analytics

WFA Digital Insight

As the demand for digital safety experts grows, Anthropic's Safeguards Analyst role stands out in the remote job market. With a focus on human exploitation and abuse, this position requires a unique blend of technical and analytical skills. Candidates should be prepared to navigate complex issues and prioritize user well-being, with the company's mission to create reliable and interpretable AI systems.

Job Description

About the Role

Anthropic is seeking a Safeguards Analyst to focus on human exploitation and abuse. The successful candidate will be responsible for building and executing enforcement workflows to detect and mitigate the use of Anthropic's products for harmful activities.

Responsibilities

  • Design and architect automated enforcement systems and review workflows for human exploitation and abuse
  • Partner with Product, Engineering, and Data Science teams to build and tune detection signals
  • Curate policy violation examples and maintain golden evaluation datasets
  • Conduct deep-dive investigations into suspected exploitation activity

How to Stand Out

  • Be prepared to discuss your experience with automated enforcement systems and review workflows
  • Highlight your ability to analyze complex data sets and identify patterns
  • Show a deep understanding of the importance of digital safety and user well-being
  • Emphasize your strong communication skills, as partnership with various teams is crucial
  • Be ready to provide examples of your work in investigating suspected exploitation activity

This is a remote position listed on WFA Digital, the platform for professionals who work from anywhere. Browse more remote jobs across all categories.