Inference Technical Lead, Sora

Openai·Remote(San Francisco)

Other

WFA Digital Insight

As the demand for AI and machine learning specialists continues to skyrocket, with a growth rate of 30% in the past year, technical roles like the Inference Technical Lead are becoming increasingly sought after. With the rise of hybrid research and product teams, professionals with expertise in model performance optimization and low-level systems are in high demand. OpenAI, a pioneer in AI research and deployment, offers a unique opportunity to work on cutting-edge projects that can benefit humanity. Before applying, candidates should be aware of the company's commitment to safety and human-centric AI development, as well as its hybrid work model. With the right skills and experience, this role can be a game-changer for those looking to make a real impact in the tech industry.

Job Description

About the Role

The Inference Technical Lead role at OpenAI is a critical position that requires a deep understanding of model serving efficiency and inference performance. As a key member of the Sora team, you will be responsible for driving initiatives to optimize inference performance and scalability, ensuring that the team's broader goals are supported by a strong technical foundation. The Sora team is a hybrid research and product team focused on integrating multimodal functionalities into OpenAI's AI products, with the mission of creating reliable, user-friendly, and socially beneficial technologies.

The role involves close collaboration with research and product teams to ensure that models perform effectively at scale. You will also be responsible for designing, building, and improving critical serving infrastructure to support the growth and reliability needs of the Sora team. With the company's commitment to pushing the boundaries of AI capabilities, this role offers a unique opportunity to work on high-impact projects that can drive real-world change.

As a technical leader, you will be expected to navigate ambiguity, set technical direction, and drive complex initiatives to completion. Your expertise in model performance optimization, kernel-level systems, and low-level performance tuning will be essential in optimizing inference performance and scalability. You will also be engaged in model design, working closely with researchers to develop inference-friendly models that can be deployed effectively.

What You Will Do

Perform engineering efforts focused on improving model serving, inference performance, and system efficiency
Drive optimizations from a kernel and data movement perspective to improve system throughput and reliability
Partner closely with research and product teams to ensure our models perform effectively at scale
Design, build, and improve critical serving infrastructure to support Sora's growth and reliability needs
Collaborate with cross-functional teams to identify and prioritize technical initiatives
Develop and maintain technical documentation to support the development and deployment of models
Stay up-to-date with the latest advancements in AI and machine learning, applying this knowledge to optimize model performance
Work closely with the research team to develop and deploy new models and technologies
Participate in code reviews and contribute to the development of best practices for model development and deployment
Troubleshoot and resolve technical issues, working closely with the engineering team to resolve problems

What We Are Looking For

Deep expertise in model performance optimization, particularly at the inference layer
Strong background in kernel-level systems, data movement, and low-level performance tuning
Experience with GPU inference engineering and model serving
Strong understanding of computer architecture and operating systems
Experience with cloud-based infrastructure and containerization
Strong programming skills in languages such as C++, Python, or similar
Experience with Agile development methodologies and version control systems
Strong communication and collaboration skills, with experience working with cross-functional teams
Experience with technical leadership and mentoring, with a strong track record of driving technical initiatives to completion

Nice to Have

Experience with machine learning frameworks such as TensorFlow or PyTorch
Knowledge of computer vision and natural language processing
Experience with DevOps and continuous integration/continuous deployment (CI/CD) pipelines
Familiarity with OpenAI's products and technologies
Experience working in a hybrid research and product team environment

Benefits and Perks

Competitive salary and equity package
Comprehensive health, dental, and vision insurance
Flexible PTO and sick leave policy
Opportunity to work on high-impact projects that can drive real-world change
Collaborative and dynamic work environment with a team of experienced professionals
Access to cutting-edge technologies and tools
Professional development opportunities, including training and conference sponsorships
Remote work stipend and equipment allowance
Access to OpenAI's products and services, including early access to new technologies
Opportunities for career growth and advancement within the company

How to Stand Out

Develop a strong portfolio that showcases your expertise in model performance optimization and low-level systems, highlighting specific projects and achievements.
Brush up on your knowledge of cloud-based infrastructure and containerization, as well as machine learning frameworks such as TensorFlow or PyTorch.
Prepare to talk about your experience with technical leadership and mentoring, highlighting your ability to drive complex initiatives to completion.
Be prepared to discuss your understanding of computer architecture and operating systems, as well as your experience with GPU inference engineering and model serving.
Don't be afraid to ask about the company culture and values, as well as the opportunities for professional development and growth within the company.
Research the company's products and technologies, and be prepared to discuss how your skills and experience align with the company's mission and goals.
Consider highlighting your experience with Agile development methodologies and version control systems, as well as your ability to work effectively in a hybrid research and product team environment.

This is a remote position listed on WFA Digital, the platform for professionals who work from anywhere. Browse more remote jobs across all categories.