Remote | Marathi-English AI Safety Red Team Evaluator — $20–$30/hour
WFA Digital Insight
As demand for AI safety specialists grows, bilingual professionals are in high demand. With a 25% increase in AI-related job postings in 2025, this role stands out for its focus on Marathi-English language support. To succeed, candidates need strong AI safety evaluation skills, attention to detail, and the ability to work independently. Before applying, consider your experience with red team testing, conversational AI models, and socio-technical risk review.
Job Description
About the Role
The Marathi-English AI Safety Red Team Evaluator role is a unique opportunity for bilingual professionals to apply their skills in AI safety evaluation and red team testing. As a key member of the team, you will be responsible for testing AI systems, identifying safety weaknesses, and producing high-quality evaluation artifacts. Your work will support the development of safer, more reliable AI systems.Day-to-day, you will work on structured adversarial scenarios to stress-test conversational AI models and agents. Your attention to detail and ability to think adversarially will be crucial in identifying vulnerabilities and classifying risks. You will also collaborate with the team to document findings, produce clear reports, and maintain accuracy and consistency across evaluations.
The role is part-time and flexible, with opportunities to work on a range of projects focused on AI safety evaluation, bilingual red team testing, and conversational model assessment. You will have the chance to apply your skills to real-world problems, working with a team of experienced professionals who are passionate about AI safety.
What You Will Do
- Test AI systems using structured adversarial scenarios to identify safety weaknesses and classify risks
- Review English and Marathi AI outputs for safety, reliability, bias, misinformation, and harmful-behavior risks
- Evaluate model behavior across multi-turn conversations, sensitive topics, and edge-case prompts
- Identify vulnerabilities that require stronger safety controls, clearer refusals, or improved response quality
- Annotate failures, classify vulnerabilities, and flag recurring safety patterns
- Apply taxonomies, benchmarks, and project-specific playbooks to keep testing consistent
- Assess misuse cases, bias exploitation, prompt-injection scenarios, and socio-technical risk patterns
- Generate high-quality human evaluation data through careful review and structured judgment
- Produce clear reports, datasets, test cases, and written summaries that support model improvement
- Document findings reproducibly so results can be reviewed, compared, and acted upon
- Explain risks clearly for both technical and non-technical audiences
What We Are Looking For
- Native-level fluency in both English and Marathi
- Prior experience in AI red teaming, adversarial testing, cybersecurity, trust and safety, or conversational AI evaluation
- Ability to think adversarially while staying structured, careful, and methodical
- Experience using frameworks, benchmarks, or rubrics rather than unstructured testing alone
- Strong written communication skills and ability to explain safety findings clearly
- Comfort reviewing text-based content involving sensitive topics under clear guidelines
- Adaptability across project types, safety categories, and evaluation workflows
- Formal degree in AI safety, cybersecurity, linguistics, policy, trust and safety, social science, psychology, writing, data evaluation, or technical analysis
- Practical experience in red team testing, model evaluation, content risk analysis, or structured review work
Nice to Have
- Experience with adversarial ML concepts, jailbreak datasets, prompt injection, RLHF/DPO attack patterns, or model behavior testing
- Cybersecurity experience such as penetration testing, exploit analysis, reverse engineering, or security assessment
- Socio-technical risk experience involving harassment, misinformation, abuse analysis, bias testing, or conversational AI safety
- Creative probing background, including psychology, acting, writing, role-play design, or unconventional adversarial thinking
Benefits and Perks
- Competitive hourly rate of $20-$30
- Flexible, part-time work arrangement
- Opportunity to work on a range of projects focused on AI safety evaluation and bilingual red team testing
- Collaborative team environment with experienced professionals
- Professional development opportunities in AI safety and red team testing
- Access to cutting-edge tools and technologies
- Remote work stipend and equipment allowance
- Health and wellness benefits, including mental health support
- Generous PTO and holiday policy
How to Stand Out
- Develop a strong understanding of AI safety evaluation and red team testing principles to stand out in your application.
- Showcase your experience with conversational AI models and socio-technical risk review in your portfolio or resume.
- Be prepared to discuss your approach to adversarial testing and vulnerability classification during the interview process.
- Highlight your ability to communicate complex safety findings clearly and effectively to both technical and non-technical audiences.
- Research the company's approach to AI safety and red team testing to demonstrate your knowledge and enthusiasm for the role.
- Consider taking courses or attending webinars on AI safety, cybersecurity, and conversational AI to enhance your skills and stay up-to-date with industry developments.
This is a remote position listed on WFA Digital, the platform for professionals who work from anywhere. Browse more remote jobs across all categories.