Member of Technical Staff, Post-Training

CohereCohere·Remote(London)
Other

WFA Digital Insight

As the demand for AI and machine learning specialists continues to soar, with a growth rate of over 30% in the past year, companies like Cohere are at the forefront of innovation. This role stands out for its unique blend of research and production, offering the chance to work with cutting-edge models and technologies. With the remote work landscape evolving, professionals with strong technical skills, particularly in Python and related ML frameworks, are in high demand. Before applying, candidates should be prepared to showcase their expertise in software engineering, distributed training infrastructures, and large-scale model training. Cohere's commitment to diversity and inclusion is also a notable aspect, making this an attractive opportunity for those valuing a collaborative and innovative work environment.

Job Description

About the Role

As a Member of the Technical Staff at Cohere, you will be part of a team that is pushing the boundaries of artificial intelligence. Your day-to-day responsibilities will involve designing and writing high-performant and scalable software for training models, consistently post-training models to reach state-of-the-art level performance, and coordinating with other specialist teams to produce models with strong all-encompassing performance. This role is crucial in advancing the state of the art for model post-training and bridging the gap between research and production.

The team at Cohere is passionate about their craft, with each member being one of the best in the world at what they do. You will have the opportunity to learn from and work with the best researchers in the field, in an environment that values diversity and inclusion. With offices in several locations but also embracing remote work, this role offers flexibility and the chance to contribute to a mission that aims to scale intelligence to serve humanity.

Cohere's mission is to make AI accessible and useful for everyone, and as a Member of the Technical Staff, you will play a key role in achieving this goal. Your contributions will directly impact the development of AI systems that power magical experiences like content generation, semantic search, and agents.

What You Will Do

  • Design and write high-performant and scalable software for training models.
  • Consistently post-train models to reach state-of-the-art level performance.
  • Coordinate with other specialist teams to produce models that have strong all-encompassing performance.
  • Craft and implement techniques to improve the performance and results of our training cycles.
  • Research, implement, and experiment with ideas on our supercompute and data infrastructure.
  • Learn from and work with the best researchers in the field.
  • Participate in the development of large-scale distributed training strategies.
  • Contribute to the advancement of model training methodologies.
  • Collaborate with cross-functional teams to ensure the successful deployment of models.
  • Stay updated with the latest advancements in AI and machine learning, applying this knowledge to improve model performance.

What We Are Looking For

  • Extremely strong software engineering skills.
  • Proficiency in Python and related ML frameworks such as JAX, Pytorch, and XLA/MLIR.
  • Experience with distributed training infrastructures and associated frameworks.
  • Hands-on experience with large-scale distributed training strategies.
  • Strong understanding of machine learning principles and practices.
  • Experience with the post-training phase of model training, with a strong emphasis on performance optimization.
  • Excellent problem-solving skills and the ability to work in a fast-paced environment.
  • Strong communication skills, with the ability to collaborate effectively with a team.
  • A strong passion for AI and machine learning, with a desire to contribute to cutting-edge research and development.

Nice to Have

  • Experience publishing papers at top-tier venues in the field of AI and machine learning.
  • Knowledge of Kubernetes, Slurm, and other distributed training infrastructures.
  • Experience with model deployment and maintenance.
  • Familiarity with agile development methodologies.
  • Participation in open-source projects related to AI and machine learning.

Benefits and Perks

  • The opportunity to work on cutting-edge AI research and development.
  • Collaborative and dynamic work environment with highly skilled professionals.
  • Flexible working hours and remote work options.
  • Access to the latest technologies and tools in AI and machine learning.
  • Professional development opportunities, including conferences and workshops.
  • Comprehensive health and dental benefits.
  • Weekly lunch stipend and in-office lunches and snacks.
  • Full parental leave top-up for up to 6 months.
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement.

How to Stand Out

  • Tip: Showcase your proficiency in Python and related ML frameworks by including examples of personal projects or contributions to open-source repositories.
  • Tip: Highlight your experience with distributed training infrastructures and large-scale model training, emphasizing any achievements in performance optimization.
  • Tip: Prepare to discuss your understanding of the latest advancements in AI and machine learning, and how you see yourself contributing to Cohere's mission.
  • Tip: Emphasize your problem-solving skills and ability to work in a fast-paced environment, providing examples from previous experiences.
  • Tip: Demonstrate your passion for AI and machine learning by discussing recent papers or projects you've worked on, and how they relate to Cohere's work.
  • Tip: Be ready to talk about your experience with collaboration tools and agile development methodologies, as these are valued in Cohere's work environment.
  • Tip: If you have experience publishing papers or participating in top-tier venues, be sure to mention this as it is considered a plus.

This is a remote position listed on WFA Digital, the platform for professionals who work from anywhere. Browse more remote jobs across all categories.