Staff Software Engineer, Infrastructure
WFA Digital Insight
As the demand for cloud infrastructure specialists grows, with a 27% increase in job postings in the last year, Docker is at the forefront of this trend. With over 20 million monthly users, the company is investing heavily in its platform, and this role is crucial to its success. To thrive in this position, candidates need to have strong technical skills, particularly in digital infrastructure and cloud computing. The ideal candidate will have experience in designing and implementing scalable systems, with a focus on security, reliability, and performance. As the remote job market continues to evolve, companies like Docker are looking for professionals who can work independently and collaboratively in a distributed team environment.
Job Description
About the Role
Docker is seeking a highly experienced Staff Software Engineer to join its Infrastructure team. As a key member of this team, you will be responsible for designing, building, and maintaining the company's internal platform, which supports hundreds of engineers across multiple development teams. The platform carries high-scale production traffic and data transfer every day, and it's essential that it's reliable, secure, and scalable. The top priority for this role is to move the platform from expert-driven support to self-service systems with clear ownership, safe defaults, and strong guardrails. This will enable development teams to focus on their own products instead of the platform, and it will require building a multi-region, cross-account network architecture and a testing and continuous-deployment flow that teams can trust. You will be joining a team of four, growing to seven this year, and you will be expected to set technical direction and lead the team through real production adoption.What You Will Do
- Take ambiguous infrastructure problems and turn them into proposals that the organization can rally around
- Design self-service capabilities and platform APIs for onboarding, provisioning, deployment, observability defaults, and day-2 operations
- Set delivery standards using Terraform, GitOps with Argo CD, progressive rollout, and good testing
- Evolve the multi-tenant EKS foundations toward better reliability, security, scale, and cost
- Improve SLOs, alerting, and incident follow-up on Grafana Cloud to make production safer and less dependent on heroics
- Collaborate with development teams to ensure the platform meets their needs and is adopted widely
- Develop and maintain documentation and training materials for the platform
- Participate in on-call rotations and provide technical support for the platform
- Stay up-to-date with industry trends and emerging technologies in cloud infrastructure and apply that knowledge to improve the platform
What We Are Looking For
- 8+ years of experience in software engineering, with a focus on cloud infrastructure and platform development
- Strong proficiency in Go, Terraform, and GitOps
- Experience with Kubernetes, EKS, and containerization
- Strong understanding of network architecture, security, and reliability
- Experience with continuous integration and continuous deployment (CI/CD) pipelines
- Strong collaboration and communication skills, with the ability to work with remote teams
- Experience with agile development methodologies and version control systems such as Git
- Strong problem-solving skills, with the ability to analyze complex problems and develop creative solutions
Nice to Have
- Experience with AI-assisted and agentic workflows
- Knowledge of Envoy Gateway ingress, traffic routing, and multi-region, cross-account connectivity
- Experience with Grafana Cloud and monitoring systems
- Familiarity with Docker products, such as Docker Desktop, Docker Hub, and Docker Scout
Benefits and Perks
- Competitive salary and equity package
- Comprehensive health, dental, and vision benefits
- Flexible working hours and remote work arrangements
- Generous paid time off and holidays
- Professional development opportunities, including training and conference attendance
- Access to the latest technologies and tools
- Collaborative and dynamic work environment with a team of experienced professionals
How to Stand Out
- To stand out in this role, make sure you have a strong understanding of cloud infrastructure and platform development, as well as experience with Go, Terraform, and GitOps.
- Emphasize your ability to collaborate with remote teams and communicate complex technical concepts to non-technical stakeholders.
- Be prepared to provide examples of your experience with continuous integration and continuous deployment (CI/CD) pipelines, as well as your knowledge of network architecture, security, and reliability.
- Show your passion for staying up-to-date with industry trends and emerging technologies in cloud infrastructure, and be prepared to discuss how you apply that knowledge to improve the platform.
- Consider creating a portfolio that showcases your experience with cloud infrastructure and platform development, and be prepared to walk the interviewer through your design decisions and technical choices.
- Don't be afraid to ask questions about the company culture, the team you'll be working with, and the opportunities for professional development and growth.
This is a remote position listed on WFA Digital, the platform for professionals who work from anywhere. Browse more remote jobs across all categories.