Software Engineer, Compute - Storage
WFA Digital Insight
The demand for skilled engineers in cloud infrastructure has surged, with a 25% increase in job postings over the past year. As companies like Openai push the boundaries of AI research, the need for robust storage systems has become critical. With its commitment to AI safety and human-centered design, Openai stands out as a leader in the industry. To succeed in this role, candidates will need strong systems engineering skills, experience with distributed systems, and a passion for building scalable infrastructure. Before applying, consider the evolving landscape of cloud storage and the importance of collaboration in a fast-paced research environment.
Job Description
About the Role
As a Software Engineer on the Compute - Storage team at Openai, you will play a crucial role in developing and operating the storage foundation behind the company's most demanding workloads. This includes designing storage systems for rapidly evolving experiments and powering production at scale. You will work closely with the research team to build and manage the storage infrastructure, ensuring that it meets the needs of both research and production environments.The Storage Infrastructure team at Openai is responsible for building and operating the storage platform that underpins the company's research and production systems. This platform spans cloud and in-house object stores, dedicated storage hardware, and a federation layer that unifies these backends behind a simple interface. As a Software Engineer on this team, you will have the opportunity to work on deeply technical systems at scale and own them in production.
Openai's commitment to AI safety and human-centered design means that the company is dedicated to creating technology that benefits all of humanity. As a member of the Compute - Storage team, you will be part of a community that values collaboration, creativity, and a passion for building scalable infrastructure.
What You Will Do
- Build and operate storage services that underpin Openai's research infrastructure
- Develop object storage systems across cloud and in-house environments
- Design and implement systems for cross-region data movement, replication, and recovery
- Develop lifecycle management capabilities that keep data durable, available, and cost-effective
- Evolve the federation layer that unifies multiple backend systems behind a simple interface
- Improve performance, reliability, and operational excellence across the platform
- Collaborate closely with researchers and infrastructure teams to support rapidly evolving workloads
- Write strong production code, ideally in Rust or another systems-oriented language
- Work with Kubernetes-based systems to deploy and manage storage services
- Use tools such as Terraform, Grafana, or similar infrastructure and observability tooling to manage and monitor storage systems
What We Are Looking For
- Experience building or operating distributed systems in production
- Experience working on storage infrastructure, object stores, distributed filesystems, or other data-intensive backend systems
- Strong systems engineering skills, including experience with systems programming languages such as Rust or C++
- Experience with Kubernetes-based systems and containerization
- Experience with tools such as Terraform, Grafana, or similar infrastructure and observability tooling
- Strong understanding of cloud computing concepts, including cloud storage, networking, and security
- Experience working in a collaborative environment, with a strong focus on teamwork and communication
- Strong problem-solving skills, with the ability to debug complex systems issues
- Experience with Agile development methodologies and version control systems such as Git
Nice to Have
- Experience working with machine learning or AI workloads
- Experience with data-intensive applications, such as data analytics or scientific computing
- Experience working in a research environment, with a focus on rapid prototyping and experimentation
- Experience with security and compliance in cloud computing environments
- Experience working with cross-functional teams, including research, engineering, and product management
Benefits and Perks
- Competitive salary and benefits package
- Opportunity to work on cutting-edge AI research and development
- Collaborative and dynamic work environment, with a strong focus on teamwork and communication
- Flexible working hours and remote work options
- Professional development opportunities, including training and education programs
- Access to the latest technologies and tools, including cloud computing platforms and machine learning frameworks
- Comprehensive health insurance and wellness programs
- Retirement savings plan and stock options
- Paid time off and holiday leave
- Access to a diverse and inclusive community, with a strong focus on AI safety and human-centered design
How to Stand Out
- To stand out in your application, be sure to highlight your experience working with distributed systems, cloud storage, and containerization.
- Make sure your resume and cover letter are tailored to the specific requirements of the role, and that you have a strong understanding of the company's technology stack.
- In your interview, be prepared to discuss your experience working with Kubernetes, Terraform, and other infrastructure and observability tooling.
- Be ready to provide examples of your problem-solving skills, including times when you had to debug complex systems issues.
- Consider creating a personal project or contributing to an open-source project to demonstrate your skills and passion for building scalable infrastructure.
- Be prepared to discuss your understanding of AI safety and human-centered design, and how you think these concepts relate to your work as a software engineer.
- Don't be afraid to ask questions during the interview process, including questions about the company culture, team dynamics, and opportunities for growth and development.
This is a remote position listed on WFA Digital, the platform for professionals who work from anywhere. Browse more remote jobs across all categories.