Senior Infra Engineer: Baremetal Orchestration
WFA Digital Insight
As remote work continues to shape the digital landscape, demand for skilled infrastructure engineers has skyrocketed. With a 25% increase in cloud infrastructure spending in 2025, companies like Railway are at the forefront of innovation. This Senior Infra Engineer role stands out for its focus on baremetal orchestration and internal tooling, requiring a unique blend of technical expertise and problem-solving skills. Candidates should be prepared to showcase their experience with distributed systems and infrastructure management, as well as their ability to communicate complex ideas effectively.
Job Description
About the Role
The Senior Infra Engineer plays a critical role in developing and maintaining Railway's infrastructure, ensuring seamless deployment and management of applications. As a key member of the team, you will be responsible for designing and implementing scalable, resilient, and efficient infrastructure solutions. Your work will have a direct impact on the company's culture, trajectory, and outcome, making this a high-impact and high-agency role.The role is more internal-facing, focusing on building the platform that Railway engineers run on. You will collaborate closely with the engineering team to ensure that the infrastructure meets their needs and is aligned with the company's goals. With a strong emphasis on innovation and experimentation, you will have the opportunity to explore new technologies and approaches to infrastructure management.
Railway is committed to fostering a culture of innovation, collaboration, and continuous learning. As a distributed team, we value diversity, inclusivity, and open communication. Our team is passionate about what we do, and we're looking for like-minded individuals who share our enthusiasm for infrastructure engineering and digital innovation.
What You Will Do
- Build and maintain Railway's host provisioning stack, including PXE boot, Ansible, and burn-in agents
- Develop and evolve the homegrown orchestration engine to manage clusters, containers, and VMs
- Optimize the efficiency of the bin packing algorithm to maximize utilization and minimize costs
- Design and maintain internal tooling for engineers to interact with the fleet
- Develop internal observability and alerting systems to catch fleet problems before they affect customers
- Create CI pipelines to ship infrastructure code safely
- Define infrastructure that can be torn down, failed over, and reconstituted from scratch using Terraform and Ansible
- Build Golang/Rust GRPC services from scratch to support millions of users
- Write Engineering Requirement Documents to take ideas from concept to implementation
What We Are Looking For
- Strong understanding of distributed systems and infrastructure management
- Hands-on experience with bare metal provisioning, configuration management, and hardware production-ready
- Comfort building and operating internal tools
- Solid intuition about the longevity of solutions
- Tact to implement solutions, create monitors, and document requirements
- Great sense of direction and prioritization in an early-stage startup environment
- Sense of grit to dive into problems, implement solutions, and replace them when needed
- Excellent communication skills
Nice to Have
- Experience with Terraform and Ansible
- Knowledge of Golang and Rust
- Familiarity with GRPC services
Benefits and Perks
- Competitive compensation package
- Opportunities for professional growth and development
- Collaborative and dynamic work environment
- Flexible working hours and remote work options
- Access to cutting-edge technologies and tools
- Health and wellness programs
- Paid time off and holidays
About Railway
Railway is a dynamic and innovative company that values diversity, inclusivity, and open communication. We're committed to creating a work environment that is supportive, collaborative, and stimulating. Our team is passionate about digital innovation, and we're looking for talented individuals who share our enthusiasm for infrastructure engineering and digital innovation.How to Stand Out
- Showcase your experience with distributed systems and infrastructure management by providing specific examples of successful projects you've led or contributed to.
- Develop a strong understanding of Railway's technology stack and be prepared to discuss how you can contribute to its growth and development.
- Highlight your ability to communicate complex technical ideas effectively, both in writing and through presentations.
- Prepare to discuss your approach to problem-solving and how you handle complex infrastructure issues.
- Be ready to showcase your skills in areas such as Terraform, Ansible, Golang, and Rust, and explain how you stay up-to-date with the latest developments in these technologies.
- Demonstrate your passion for infrastructure engineering and digital innovation, and explain how you see yourself contributing to Railway's mission and values.
This is a remote position listed on WFA Digital, the platform for professionals who work from anywhere. Browse more remote jobs across all categories.