Remote | Telugu-English AI Safety Red Team Evaluator — $20–$30/hour

name·Remote(United States)

Other

Adjust

WFA Digital Insight

The demand for AI safety specialists has surged in recent years, with a 27% increase in job postings in 2025 alone. As companies prioritize AI systems' reliability and safety, bilingual professionals with expertise in AI safety evaluation are in high demand. This role stands out for its focus on Telugu-English bilingual testing, allowing candidates to apply their language skills to drive safer AI systems. With the remote job market expanding, candidates should be prepared to showcase their adaptability, attention to detail, and ability to work independently.

Job Description

About the Role

The Telugu-English AI Safety Red Team Evaluator role is a part-time consulting opportunity that supports remote projects focused on AI safety evaluation, bilingual red team testing, and conversational model assessment. As a key member of the team, you will contribute to the development of stronger, safer, and more reliable AI systems through careful adversarial testing. Your expertise in Telugu and English will enable you to test AI systems using structured adversarial scenarios, identify safety weaknesses, classify risks, and produce clear English-language evaluation artifacts.

Day-to-day, you will work on reviewing English and Telugu AI outputs for safety, reliability, bias, misinformation, and harmful-behavior risks. You will stress-test conversational AI models and agents using structured adversarial scenarios and evaluate model behavior across multi-turn conversations, sensitive topics, and edge-case prompts. Your work will have a direct impact on the safety and reliability of AI systems, making this a highly rewarding role for those passionate about AI safety.

The team you will be working with is dedicated to delivering high-quality project execution, and your contributions will be essential to the success of current and upcoming remote consulting opportunities.

What You Will Do

Review English and Telugu AI outputs for safety, reliability, bias, misinformation, and harmful-behavior risks
Stress-test conversational AI models and agents using structured adversarial scenarios
Evaluate model behavior across multi-turn conversations, sensitive topics, and edge-case prompts
Identify vulnerabilities that require stronger safety controls, clearer refusals, or improved response quality
Annotate failures, classify vulnerabilities, and flag recurring safety patterns
Apply taxonomies, benchmarks, and project-specific playbooks to keep testing consistent
Assess misuse cases, bias exploitation, prompt-injection scenarios, and socio-technical risk patterns at a high level
Generate high-quality human evaluation data through careful review and structured judgment
Produce clear reports, datasets, test cases, and written summaries that support model improvement
Document findings reproducibly so results can be reviewed, compared, and acted upon
Explain risks clearly for both technical and non-technical audiences
Maintain accuracy, consistency, and strong attention to detail across submitted evaluations

What We Are Looking For

Native-level fluency in both English and Telugu
Prior experience in AI red teaming, adversarial testing, cybersecurity, trust and safety, socio-technical risk review, or conversational AI evaluation
Ability to think adversarially while staying structured, careful, and methodical
Experience using frameworks, benchmarks, or rubrics rather than unstructured testing alone
Strong written communication skills and ability to explain safety findings clearly
Comfort reviewing text-based content involving sensitive topics under clear guidelines
Adaptability across project types, safety categories, and evaluation workflows
Formal degree requirements may vary based on project needs
Backgrounds in AI safety, cybersecurity, linguistics, policy, trust and safety, social science, psychology, writing, data evaluation, or technical analysis may be highly relevant
Practical experience in red team testing, model evaluation, content risk analysis, or structured review work may also be valuable

Nice to Have

Experience with adversarial ML concepts, jailbreak datasets, prompt injection, RLHF/DPO attack patterns, or model behavior testing
Cybersecurity experience such as penetration testing, exploit analysis, reverse engineering, or security assessment
Socio-technical risk experience involving harassment, misinformation, abuse analysis, bias testing, or conversational AI safety
Creative probing background, including psychology, acting, writing, role-play design, or unconventional adversarial thinking
Experience producing reproducible reports, labeled datasets, structured risk notes, or benchmark-style evaluation artifacts

Benefits and Perks

Competitive hourly rate of $20-$30
Flexible, part-time consulting opportunity with remote work arrangements
Opportunity to work on diverse projects and contribute to the development of safer AI systems
Collaborative team environment with experienced professionals in AI safety and red team testing
Professional development opportunities, including training and support for ongoing education and skill development
Access to cutting-edge technologies and tools in the field of AI safety and red team testing
Recognition and rewards for outstanding performance and contributions to the team
Comprehensive benefits package, including health insurance, retirement plan, and paid time off

How to Stand Out

Develop a strong understanding of AI safety concepts, including adversarial testing, red teaming, and socio-technical risk review
Showcase your ability to think creatively and develop innovative testing scenarios
Highlight your experience with frameworks, benchmarks, or rubrics, and demonstrate how you have applied these in previous roles
Prepare to discuss your approach to documenting findings and explaining risks to both technical and non-technical audiences
Be prepared to provide examples of your work, including reports, datasets, or written summaries, to demonstrate your skills and experience
Research the company and the role to understand the specific requirements and challenges, and be prepared to ask informed questions during the interview process
Consider developing a personal project or contributing to open-source initiatives to demonstrate your skills and passion for AI safety and red team testing

This is a remote position listed on WFA Digital, the platform for professionals who work from anywhere. Browse more remote jobs across all categories.