Senior Site Reliability Engineer, Tenant Services: Geo
WFA Digital Insight
The demand for skilled site reliability engineers has grown significantly, with a 25% increase in job postings in the last year. As companies like Gitlab continue to expand their remote teams, professionals with expertise in disaster recovery and DevSecOps are in high demand. With over 50 million registered users, Gitlab is a leader in the industry, and this role offers a chance to work with a talented team. Before applying, candidates should be aware of the importance of AI-driven workflows and the company's values-driven culture. With the right skills and experience, this could be a career-defining opportunity
Job Description
About the Role
As a Senior Site Reliability Engineer at Gitlab, you will be part of the Tenant Services, Geo team, responsible for supporting Gitlab Dedicated customer migrations and Geo-related escalations. Your day-to-day work will involve executing Dedicated Geo migrations and cutovers, handling Geo-related escalations, and improving the Geo operational surface for Dedicated. You will work closely with the core Geo team, Dedicated migrations, and Support to ensure seamless operations. The Geo team is crucial in ensuring the smooth replication of data from a Gitlab instance to a warm-standby, used for data migrations and disaster recovery. As a Senior Site Reliability Engineer, you will play a key role in maintaining the reliability and efficiency of these systems. Gitlab is committed to fostering a culture of innovation and collaboration, and as a member of the team, you will be expected to embody these values.What You Will Do
- Execute Dedicated Geo migrations and cutovers end-to-end, including planning, pre-cutover validation, execution, and post-cutover verification and cleanup
- Join the team's shift and weekend coverage rotation for Dedicated cutovers across EMEA and US hours
- Participate in the SaaS Site Reliability Engineering (SRE) on-call rotation to respond to incidents that impact Gitlab.com availability
- Operate and improve the Geo operational surface for Dedicated, including environment preparation and data hygiene checks prior to migrations
- Execute replication, validation, and cutover procedures
- Handle Geo-related escalations from Support and internal partners
- Design, build, and maintain automation, tooling, and runbooks that make migrations, cutovers, and Geo escalations as efficient and repeatable as possible
- Collaborate with the core Geo team to improve the overall Geo operational surface
- Develop and maintain documentation for Geo-related processes and procedures
What We Are Looking For
- 5+ years of experience in a similar role, preferably in a cloud-based environment
- Strong understanding of disaster recovery technologies and DevSecOps principles
- Experience with Gitlab or similar version control systems
- Familiarity with automation tools and scripting languages
- Excellent problem-solving skills and attention to detail
- Ability to work in a fast-paced environment and respond to incidents effectively
- Strong communication and collaboration skills
- Experience with AI-driven workflows and automation
Nice to Have
- Experience with Geo or Gitaly
- Knowledge of cloud-based infrastructure and migration techniques
- Familiarity with Agile development methodologies
- Certification in a related field, such as DevSecOps or cloud computing
Benefits and Perks
- Competitive salary and benefits package
- Opportunity to work with a talented team of professionals
- Collaborative and dynamic work environment
- Professional development and growth opportunities
- Flexible working hours and remote work options
- Access to the latest technologies and tools
- Recognition and reward for outstanding performance
- Comprehensive health and wellness programs
- Generous paid time off and holiday policy
How to Stand Out
- Make sure to highlight your experience with disaster recovery technologies and DevSecOps principles in your application
- Familiarize yourself with Gitlab's values and culture to demonstrate your understanding of the company's vision and mission
- Develop a strong understanding of automation tools and scripting languages to improve your chances of success in this role
- Be prepared to provide examples of your problem-solving skills and attention to detail in your previous experience
- Research the company's use of AI-driven workflows and automation to show your interest and enthusiasm for the role
- Prepare to discuss your experience with collaborative tools and version control systems
- Show your willingness to learn and adapt to new technologies and processes
This is a remote position listed on WFA Digital, the platform for professionals who work from anywhere. Browse more remote jobs across all categories.