Remote | Kannada-English AI Safety Red Team Evaluator — $20–$30/hour

24-MAG24-MAG·Remote(United States)
Other
Adjust

WFA Digital Insight

As demand for AI safety specialists grows, with a 27% increase in 2025, bilingual professionals are in high demand. 24-MAG's role stands out for its unique focus on Kannada-English AI safety evaluation, requiring native-level fluency and experience in red team testing. With the AI safety market expected to reach

.4 billion by 2028, this is an exciting time to join a company pushing the boundaries of AI reliability. Before applying, candidates should understand the importance of clear communication, attention to detail, and the ability to work with sensitive topics.

Job Description

About the Role

The Kannada-English AI Safety Red Team Evaluator plays a crucial role in ensuring the reliability and safety of AI systems. This part-time consulting opportunity is perfect for bilingual professionals experienced in AI safety evaluation, red team testing, and adversarial review. As a key member of the team, you will be responsible for testing AI systems, identifying safety weaknesses, and producing clear evaluation artifacts. The role is focused on supporting current and upcoming remote consulting opportunities in AI safety evaluation, bilingual red team testing, conversational model assessment, and misuse-risk review. You will work closely with the team to ensure the highest quality of project execution and contribute to the development of stronger, safer AI systems. The ideal candidate will have a strong background in AI safety, red team testing, or a related field, with native-level fluency in both English and Kannada. This is an excellent opportunity to apply your skills and experience to a dynamic and growing field.

What You Will Do

  • Test AI systems using structured adversarial scenarios to identify safety weaknesses and classify risks
  • Review English and Kannada AI outputs for safety, reliability, bias, misinformation, and harmful-behavior risks
  • Stress-test conversational AI models and agents using structured adversarial scenarios
  • Evaluate model behavior across multi-turn conversations, sensitive topics, and edge-case prompts
  • Identify vulnerabilities that require stronger safety controls, clearer refusals, or improved response quality
  • Annotate failures, classify vulnerabilities, and flag recurring safety patterns
  • Apply taxonomies, benchmarks, and project-specific playbooks to keep testing consistent
  • Assess misuse cases, bias exploitation, prompt-injection scenarios, and socio-technical risk patterns at a high level
  • Generate high-quality human evaluation data through careful review and structured judgment
  • Produce clear reports, datasets, test cases, and written summaries that support model improvement
  • Document findings reproducibly so results can be reviewed, compared, and acted upon
  • Explain risks clearly for both technical and non-technical audiences

What We Are Looking For

  • Native-level fluency in both English and Kannada
  • Prior experience in AI red teaming, adversarial testing, cybersecurity, trust and safety, socio-technical risk review, or conversational AI evaluation
  • Ability to think adversarially while staying structured, careful, and methodical
  • Experience using frameworks, benchmarks, or rubrics rather than unstructured testing alone
  • Strong written communication skills and ability to explain safety findings clearly
  • Comfort reviewing text-based content involving sensitive topics under clear guidelines
  • Adaptability across project types, safety categories, and evaluation workflows
  • Formal degree requirements may vary based on project needs
  • Backgrounds in AI safety, cybersecurity, linguistics, policy, trust and safety, social science, psychology, writing, data evaluation, or technical analysis may be highly relevant
  • Practical experience in red team testing, model evaluation, content risk analysis, or structured review work may also be valuable

Nice to Have

  • Experience with adversarial ML concepts, jailbreak datasets, prompt injection, RLHF/DPO attack patterns, or model behavior testing
  • Cybersecurity experience such as penetration testing, exploit analysis, reverse engineering, or security assessment
  • Socio-technical risk experience involving harassment, misinformation, abuse analysis, bias testing, or conversational AI safety
  • Creative probing background, including psychology, acting, writing, role-play design, or unconventional adversarial thinking
  • Experience producing reproducible reports, labeled datasets, structured risk notes, or benchmark-style evaluation artifacts

Benefits and Perks

  • Competitive hourly rate of $20-$30 per hour
  • Opportunity to work on flexible assignments and contribute to stronger, safer AI systems
  • Collaborative and dynamic work environment with a team of experienced professionals
  • Professional development opportunities in AI safety, red team testing, and related fields
  • Flexible work arrangements and remote work options
  • Access to cutting-edge tools and technologies in AI safety and evaluation
  • Recognition and rewards for outstanding performance and contributions to the team
  • Comprehensive support for ongoing education and training in AI safety and related fields

How to Stand Out

  • Develop a strong portfolio showcasing your experience in AI safety evaluation, red team testing, and adversarial review to stand out as a candidate.
  • Highlight your ability to communicate complex safety findings clearly and effectively, both in writing and verbally.
  • Be prepared to provide specific examples of your experience with structured adversarial scenarios, risk classification, and vulnerability annotation.
  • Familiarize yourself with the latest developments in AI safety, red team testing, and adversarial ML concepts to demonstrate your expertise.
  • Emphasize your ability to work with sensitive topics and adapt to different project types, safety categories, and evaluation workflows.
  • Consider obtaining certifications in AI safety, cybersecurity, or related fields to enhance your qualifications and demonstrate your commitment to the field.
  • Practice explaining technical concepts to non-technical audiences to improve your communication skills and increase your chances of success in the role.

This is a remote position listed on WFA Digital, the platform for professionals who work from anywhere. Browse more remote jobs across all categories.