CPU/Storage/PoP-WAN Program Manager
WFA Digital Insight
The demand for skilled infrastructure program managers has skyrocketed, with a 25% increase in job postings over the past year. As companies like Openai continue to push the boundaries of AI, the need for experts who can manage complex infrastructure deployments has become critical. With the rise of remote work, companies are looking for candidates who can lead cross-functional teams and drive results in a fast-paced environment. Openai stands out for its innovative approach to AI and its commitment to hiring top talent. Before applying, candidates should be aware that this role requires strong technical fluency and experience in managing large-scale infrastructure systems.
Job Description
About the Role
The CPU/Storage/PoP-WAN Program Manager role at Openai is a critical position that requires a high level of technical expertise and program management skills. As a key member of the Infrastructure organization, you will be responsible for leading the execution of complex infrastructure programs that enable Openai to scale its AI capabilities. This role involves managing cross-functional teams, driving infrastructure deployment, and ensuring that all components are working together seamlessly.The Infrastructure organization at Openai is responsible for building the systems that power the company's AI workloads at a global scale. As the demand for compute capacity continues to accelerate, the ability to rapidly convert infrastructure investments into usable production capacity has become mission-critical. The CPU/Storage/PoP/WAN team plays a vital role in this effort, and the Program Manager will be responsible for ensuring that all aspects of the infrastructure are working together to support the company's goals.
The successful candidate will have a strong technical background and experience in managing large-scale infrastructure systems. They will be able to communicate effectively with both technical and non-technical stakeholders, and drive results in a fast-paced environment.
What You Will Do
- Lead the execution of CPU and GPU cluster activation programs across Openai's global infrastructure footprint
- Drive the readiness to convert contracted compute capacity into schedulable production clusters
- Own the deployment programs for new Points of Presence (PoPs), backbone nodes, WAN expansion, and interconnection initiatives
- Build integrated schedules spanning procurement, logistics, installation, storage readiness, network turn-up, testing, and production handoff
- Coordinate Bill of Materials (BOM) readiness, server delivery, racks, optics, cabling, storage hardware, and vendor milestones
- Partner with engineering teams to align compute, storage, and networking dependencies before cluster activation
- Manage the deployment of storage systems supporting training and inference workloads, including readiness, validation, performance checks, and scaling plans
- Coordinate backbone capacity expansion, cross-connects, inter-region pathing, and cloud interconnect readiness with Azure and third-party providers
- Lead physical deployment execution, including rack-and-stack, hardware bring-up, L1 validation, and site acceptance criteria
- Build repeatable deployment playbooks, dashboards, governance cadences, and operating mechanisms for scale
- Identify risks early across supply chain, site readiness, technical constraints, and vendor execution, and drive mitigation plans
- Communicate milestones, escalations, and capacity forecasts to senior leadership
What We Are Looking For
- 8+ years of experience in technical program management, infrastructure deployment, network deployment, or data center operations
- Strong experience delivering programs involving compute, storage, networking, or large-scale infrastructure systems
- Working knowledge of servers, clusters, storage arrays, routers, switches, optics, and structured cabling
- Experience owning cross-functional programs across engineering, operations, supply chain, and external vendors
- Strong understanding of deployment lifecycles from planning and procurement through production handoff
- Ability to reason across physical infrastructure execution and logical systems architecture dependencies
- Proven ability to build integrated schedules and drive accountability across multiple stakeholders
- Strong executive communication skills with experience managing critical escalations and leadership updates
- Comfortable operating in fast-moving environments with changing priorities
Nice to Have
- Experience with cloud-based infrastructure and services, such as Azure or AWS
- Knowledge of containerization technologies, such as Docker or Kubernetes
- Familiarity with agile development methodologies and version control systems, such as Git
- Certification in program management, such as PMP or Scrum Master
Benefits and Perks
- Competitive salary and equity package
- Comprehensive health and wellness benefits, including medical, dental, and vision
- Flexible paid time off and holidays
- Remote work stipend and equipment allowance
- Opportunities for professional growth and development, including training and education programs
- Access to cutting-edge technologies and innovative projects
- Collaborative and dynamic work environment with a team of experienced professionals
How to Stand Out
- Develop a strong understanding of cloud-based infrastructure and services, such as Azure or AWS, to stay competitive in the job market.
- Highlight your experience with containerization technologies, such as Docker or Kubernetes, to demonstrate your ability to work with modern infrastructure systems.
- Be prepared to discuss your experience with agile development methodologies and version control systems, such as Git, to show your ability to work in a fast-paced environment.
- Emphasize your strong executive communication skills and experience managing critical escalations and leadership updates to demonstrate your ability to drive results.
- Research Openai's company culture and values to show your enthusiasm for the role and the company, and to demonstrate your ability to work in a collaborative and dynamic environment.
- Be ready to provide specific examples of your experience managing large-scale infrastructure systems and driving cross-functional programs to success.
- Consider obtaining certification in program management, such as PMP or Scrum Master, to demonstrate your expertise and commitment to the field.
This is a remote position listed on WFA Digital, the platform for professionals who work from anywhere. Browse more remote jobs across all categories.