Remote | Telugu-English AI Safety Red Team Evaluator — $20–$30/hour

namename·Remote(United States)
Other
Adjust

WFA Digital Insight

The demand for AI safety specialists has surged in recent years, with a 27% increase in job postings in 2025 alone. As companies prioritize AI systems' reliability and safety, bilingual professionals with expertise in AI safety evaluation are in high demand. This role stands out for its focus on Telugu-English bilingual testing, allowing candidates to apply their language skills to drive safer AI systems. With the remote job market expanding, candidates should be prepared to showcase their adaptability, attention to detail, and ability to work independently.

Job Description

About the Role

The Telugu-English AI Safety Red Team Evaluator role is a part-time consulting opportunity that supports remote projects focused on AI safety evaluation, bilingual red team testing, and conversational model assessment. As a key member of the team, you will contribute to the development of stronger, safer, and more reliable AI systems through careful adversarial testing. Your expertise in Telugu and English will enable you to test AI systems using structured adversarial scenarios, identify safety weaknesses, classify risks, and produce clear English-language evaluation artifacts.

Day-to-day, you will work on reviewing English and Telugu AI outputs for safety, reliability, bias, misinformation, and harmful-behavior risks. You will stress-test conversational AI models and agents using structured adversarial scenarios and evaluate model behavior across multi-turn conversations, sensitive topics, and edge-case prompts. Your work will have a direct impact on the safety and reliability of AI systems, making this a highly rewarding role for those passionate about AI safety.

The team you will be working with is dedicated to delivering high-quality project execution, and your contributions will be essential to the success of current and upcoming remote consulting opportunities.

What You Will Do

  • Review English and Telugu AI outputs for safety, reliability, bias, misinformation, and harmful-behavior risks
  • Stress-test conversational AI models and agents using structured adversarial scenarios
  • Evaluate model behavior across multi-turn conversations, sensitive topics, and edge-case prompts
  • Identify vulnerabilities that require stronger safety controls, clearer refusals, or improved response quality
  • Annotate failures, classify vulnerabilities, and flag recurring safety patterns
  • Apply taxonomies, benchmarks, and project-specific playbooks to keep testing consistent
  • Assess misuse cases, bias exploitation, prompt-injection scenarios, and socio-technical risk patterns at a high level
  • Generate high-quality human evaluation data through careful review and structured judgment
  • Produce clear reports, datasets, test cases, and written summaries that support model improvement
  • Document findings reproducibly so results can be reviewed, compared, and acted upon
  • Explain risks clearly for both technical and non-technical audiences
  • Maintain accuracy, consistency, and strong attention to detail across submitted evaluations

What We Are Looking For

  • Native-level fluency in both English and Telugu
  • Prior experience in AI red teaming, adversarial testing, cybersecurity, trust and safety, socio-technical risk review, or conversational AI evaluation
  • Ability to think adversarially while staying structured, careful, and methodical
  • Experience using frameworks, benchmarks, or rubrics rather than unstructured testing alone
  • Strong written communication skills and ability to explain safety findings clearly
  • Comfort reviewing text-based content involving sensitive topics under clear guidelines
  • Adaptability across project types, safety categories, and evaluation workflows
  • Formal degree requirements may vary based on project needs
  • Backgrounds in AI safety, cybersecurity, linguistics, policy, trust and safety, social science, psychology, writing, data evaluation, or technical analysis may be highly relevant
  • Practical experience in red team testing, model evaluation, content risk analysis, or structured review work may also be valuable

Nice to Have

  • Experience with adversarial ML concepts, jailbreak datasets, prompt injection, RLHF/DPO attack patterns, or model behavior testing
  • Cybersecurity experience such as penetration testing, exploit analysis, reverse engineering, or security assessment
  • Socio-technical risk experience involving harassment, misinformation, abuse analysis, bias testing, or conversational AI safety
  • Creative probing background, including psychology, acting, writing, role-play design, or unconventional adversarial thinking
  • Experience producing reproducible reports, labeled datasets, structured risk notes, or benchmark-style evaluation artifacts

Benefits and Perks

  • Competitive hourly rate of $20-$30
  • Flexible, part-time consulting opportunity with remote work arrangements
  • Opportunity to work on diverse projects and contribute to the development of safer AI systems
  • Collaborative team environment with experienced professionals in AI safety and red team testing
  • Professional development opportunities, including training and support for ongoing education and skill development
  • Access to cutting-edge technologies and tools in the field of AI safety and red team testing
  • Recognition and rewards for outstanding performance and contributions to the team
  • Comprehensive benefits package, including health insurance, retirement plan, and paid time off

How to Stand Out

  • Develop a strong understanding of AI safety concepts, including adversarial testing, red teaming, and socio-technical risk review
  • Showcase your ability to think creatively and develop innovative testing scenarios
  • Highlight your experience with frameworks, benchmarks, or rubrics, and demonstrate how you have applied these in previous roles
  • Prepare to discuss your approach to documenting findings and explaining risks to both technical and non-technical audiences
  • Be prepared to provide examples of your work, including reports, datasets, or written summaries, to demonstrate your skills and experience
  • Research the company and the role to understand the specific requirements and challenges, and be prepared to ask informed questions during the interview process
  • Consider developing a personal project or contributing to open-source initiatives to demonstrate your skills and passion for AI safety and red team testing

This is a remote position listed on WFA Digital, the platform for professionals who work from anywhere. Browse more remote jobs across all categories.