Senior Member of Technical Staff, Multimodal AI

Cohere·Remote(San Francisco)

Other

WFA Digital Insight

The demand for AI and machine learning specialists has surged, with a 25% increase in job postings in the last year alone. As companies like Cohere push the boundaries of multimodal AI, skilled professionals with expertise in areas like deep learning frameworks and distributed training strategies are in high demand. With the remote job market booming, candidates with a strong passion for machine learning and a knack for innovative problem-solving can thrive in roles like this. Before applying, it's essential to understand the current landscape of multimodal AI and the skills required to excel in this field.

Job Description

About the Role

As a Senior Member of Technical Staff with a focus on Multimodal AI at Cohere, you will be at the forefront of a rapidly evolving field, working on cutting-edge systems that integrate various modalities such as text, speech, and vision. Your day-to-day responsibilities will involve designing and developing robust and scalable multimodal AI systems, conducting research and experiments on advanced compute infrastructure, and collaborating closely with world-class teams to drive innovation. You will be part of a diverse and remote-friendly team that values creativity, practical problem-solving, and a passion for machine learning.

The role matters because multimodal AI has the potential to revolutionize the way we interact with technology. By joining Cohere, you will be contributing to the development of frontier models that can power magical experiences like content generation, semantic search, and agents. Your work will have a direct impact on the company's mission to scale intelligence to serve humanity.

Cohere's engineering teams are known for pushing the boundaries of what's possible, and as a Senior Member of Technical Staff, you will be expected to contribute your expertise and innovative ideas to the table. You will be working in a fast-paced, technically challenging environment where no two days are the same.

What You Will Do

Design and develop cutting-edge multimodal AI systems that integrate various modalities such as text, speech, and vision.
Conduct research and experiments on advanced compute infrastructure to explore novel ideas in multimodal representation learning, transfer learning, and more.
Collaborate closely with world-class teams, including researchers, engineers, and designers, to drive innovation and contribute to the development of frontier models.
Work on the development of large-scale multimodal models, including distributed training strategies and evaluation methodologies.
Participate in the design and implementation of novel multimodal architectures, including but not limited to autoregressive models and generative models.
Contribute to the development of efficient GPU kernels using CUDA to optimize performance for multimodal tasks.
Stay up-to-date with the latest advancements in multimodal AI and contribute to the company's research and development efforts.
Collaborate with the product team to develop and deploy multimodal AI systems that meet customer needs and drive business value.
Work on the development of evaluation methodologies to measure the performance of multimodal models and identify areas for improvement.

What We Are Looking For

Exceptional software engineering skills, with a proven track record of building robust and scalable systems.
Strong command of Python and experience with popular deep learning frameworks like JAX, PyTorch, and TensorFlow, with an understanding of their multimodal capabilities.
Knowledge of distributed training strategies, especially for large-scale multimodal models.
Familiarity with autoregressive models, particularly their application in multimodal tasks such as image or video captioning, speech-to-text generation.
Experience with multimodal representation learning, transfer learning, and other related areas.
Strong understanding of computer vision, natural language processing, and machine learning fundamentals.
Excellent problem-solving skills, with the ability to think creatively and outside the box.

Nice to Have

Publications in top-tier venues demonstrating expertise in multimodal AI research.
Experience in writing efficient GPU kernels using CUDA to optimize performance for multimodal tasks.
Familiarity with agile development methodologies and version control systems like Git.
Experience working in a remote-friendly environment and collaborating with distributed teams.

Benefits and Perks

Competitive compensation package, including salary and equity.
Opportunity to work on cutting-edge multimodal AI systems and contribute to the development of frontier models.
Collaborative and dynamic work environment with a team of world-class researchers, engineers, and designers.
Flexible working hours and remote work options to accommodate different time zones and work styles.
Access to advanced compute infrastructure and tools to support research and development efforts.
Professional development opportunities, including conferences, workshops, and training programs.
Health insurance, retirement plans, and other benefits to support your well-being and financial security.

This is a remote position listed on WFA Digital, the platform for professionals who work from anywhere. Browse more remote jobs across all categories.