Site Reliability Engineer, Environment Automation

GitlabGitlab·Remote(Remote, India)
Software Development
Adjust

WFA Digital Insight

The demand for skilled site reliability engineers is on the rise, with a 25% increase in job postings over the past year. As companies like Gitlab continue to grow and expand their remote teams, the need for experts who can ensure the reliability and scalability of their systems has never been more pressing. With the right combination of technical skills and experience, candidates can stand out in a crowded job market and take their careers to the next level. Before applying, candidates should be aware of the unique challenges and opportunities that come with working in a remote environment and be prepared to demonstrate their ability to work independently and collaboratively as part of a distributed team.

Job Description

About the Role

The Site Reliability Engineer, Environment Automation role at Gitlab is a unique opportunity to join a dedicated team of experts who are passionate about developing automation for GitLab environments. As a Site Reliability Engineer, you will be responsible for designing, deploying, and maintaining GitLab environments across cloud providers using infrastructure as code, deployment packages, and Kubernetes. You will work closely with senior SREs to solve the unique challenges of managing many tenant environments in parallel, each with its own constraints and integration points.

Day-to-day, you will be focused on contributing to the design and evolution of infrastructure automation using Terraform, Ansible, and Kubernetes to provision, upgrade, and operate many GitLab environments with minimal manual effort. You will also help debug and resolve production issues across Kubernetes clusters, GitLab components, and cloud services, and assist in building automation and safeguards that prevent similar issues from recurring.

What You Will Do

  • Contribute to the design and evolution of infrastructure automation using Terraform, Ansible, and Kubernetes to provision, upgrade, and operate many GitLab environments with minimal manual effort
  • Help debug and resolve production issues across Kubernetes clusters, GitLab components, and cloud services, then assist in building automation and safeguards that prevent similar issues from recurring
  • Assist in creating and maintaining deployment and orchestration tools, such as Helm Charts, omnibus-gitlab configurations, and multi-tenant workflows, that make it easy for teams to manage GitLab environments at scale
  • Collaborate with senior SREs to solve the unique challenges of managing many tenant environments in parallel, each with its own constraints and integration points
  • Help define, deploy, and maintain GitLab environments across cloud providers using infrastructure as code, deployment packages, and Kubernetes
  • Contribute to automation that reduces manual work, assist in building tooling that orchestrates upgrades and configuration changes safely at scale, and support an observability stack that lets us understand and improve the health of every environment
  • Work closely with the Dedicated team to ensure that GitLab environments are always production ready and meet the needs of our customers
  • Participate in on-call rotations and collaborate with other teams to resolve issues and improve the overall reliability and scalability of our systems
  • Stay up-to-date with the latest developments in infrastructure automation, Kubernetes, and cloud computing, and apply this knowledge to continuously improve our systems and processes

What We Are Looking For

  • 3+ years of experience in a site reliability engineering or similar role, with a focus on automation and infrastructure as code
  • Strong knowledge of Terraform, Ansible, and Kubernetes, with experience deploying and managing these technologies in a cloud-based environment
  • Experience with GitLab and its components, including GitLab Runner, GitLab CI/CD, and GitLab Pages
  • Strong understanding of cloud computing concepts, including infrastructure as code, deployment packages, and containerization
  • Experience with Linux and containerization technologies, such as Docker
  • Strong problem-solving skills, with the ability to debug and resolve complex issues in a fast-paced environment
  • Excellent communication and collaboration skills, with the ability to work effectively with cross-functional teams
  • Experience with continuous integration and continuous deployment (CI/CD) pipelines and tools, such as Jenkins or GitLab CI/CD

Nice to Have

  • Experience with other infrastructure automation tools, such as AWS CloudFormation or Google Cloud Deployment Manager
  • Knowledge of programming languages, such as Python or Ruby
  • Experience with agile development methodologies and version control systems, such as Git
  • Certification in a relevant field, such as AWS Certified DevOps Engineer or Google Cloud Certified - Professional Cloud Developer

Benefits and Perks

  • Competitive salary and benefits package
  • Opportunity to work with a talented and dedicated team of professionals who are passionate about developing automation for GitLab environments
  • Collaborative and dynamic work environment with a strong focus on innovation and continuous improvement
  • Flexible working hours and remote work options
  • Professional development opportunities, including training and certification programs
  • Access to the latest technologies and tools, including GitLab, Terraform, Ansible, and Kubernetes

How to Stand Out

  • Make sure you have a strong understanding of infrastructure automation, including Terraform, Ansible, and Kubernetes, and be prepared to provide examples of your experience with these technologies.
  • Highlight your problem-solving skills and ability to debug and resolve complex issues in a fast-paced environment.
  • Emphasize your experience with GitLab and its components, including GitLab Runner, GitLab CI/CD, and GitLab Pages.
  • Be prepared to discuss your experience with continuous integration and continuous deployment (CI/CD) pipelines and tools, such as Jenkins or GitLab CI/CD.
  • Show your passion for automation and infrastructure as code, and be prepared to provide examples of how you have applied these principles in previous roles.
  • Don't be afraid to ask questions during the interview process, and be prepared to provide feedback on how you would approach a given problem or scenario.
  • Make sure you have a strong understanding of cloud computing concepts, including infrastructure as code, deployment packages, and containerization.

This is a remote position listed on WFA Digital, the platform for professionals who work from anywhere. Browse more remote jobs across all categories.