Senior Site Reliability Engineer - Payward Services
WFA Digital Insight
The demand for skilled site reliability engineers has surged 25% in the past year, driven by the need for seamless digital experiences. With companies like Kraken leading the charge in digital innovation, professionals with expertise in DevOps, cloud infrastructure, and CI/CD pipelines are in high demand. As a remote site reliability engineer, you'll play a critical role in ensuring the stability and efficiency of Kraken's systems. Before applying, consider highlighting your experience with hybrid-cloud environments, scripting skills, and ability to work independently in fast-paced environments. Kraken stands out for its commitment to operational excellence and offering a world-class team environment.
Job Description
About the Role
As a Senior Site Reliability Engineer at Kraken, you will be at the forefront of managing infrastructure and improving CI/CD pipelines. This role is pivotal in ensuring the operational excellence of Kraken's Payward Services business unit. Day-to-day, you will collaborate closely with the team to identify areas for improvement, implement efficient solutions, and support the development of best practices. The team's success is deeply intertwined with your ability to debug complex distributed systems, networks, and Linux operating systems issues, ensuring that Kraken's services run smoothly and efficiently.Kraken is a leader in digital innovation, and its commitment to excellence is reflected in its search for a talented Senior Site Reliability Engineer. This position is not just about technical expertise but also about being a self-starter who can thrive in a fast-paced, remote environment. The team at Kraken values collaboration, innovation, and the pursuit of operational excellence, making this role an exciting opportunity for the right candidate.
In this role, you will report to a seasoned leader who understands the importance of site reliability engineering in driving business success. Your insights and expertise will contribute significantly to the development of Kraken's infrastructure strategy, ensuring that the company remains at the forefront of digital innovation.
What You Will Do
- Manage and optimize hybrid-cloud infrastructure environments to ensure high availability and scalability.
- Develop and improve CI/CD pipelines to enhance the efficiency and reliability of software deployments.
- Collaborate with cross-functional teams to identify and resolve complex technical issues.
- Design, implement, and maintain monitoring and alerting systems, with a focus on Prometheus and Grafana.
- Debug and resolve issues in distributed systems, networks, and Linux operating systems.
- Implement containerization and orchestration solutions using Docker, Nomad, or Kubernetes.
- Develop and maintain scripts for automation and troubleshooting using Bash, Python, or Go.
- Participate in on-call rotations to provide 24/7 support for critical systems.
- Analyze system performance, identify bottlenecks, and propose optimizations.
- Develop and maintain technical documentation for infrastructure and systems.
What We Are Looking For
- 5+ years of experience in a DevOps or Site Reliability Engineering role.
- Proficiency in hybrid-cloud infrastructure environments, including AWS, Azure, or Google Cloud.
- Experience with Git source version control and CI/CD configuration.
- Deep understanding of monitoring and alerting systems, preferably Prometheus and Grafana.
- Ability to debug complex distributed systems, networks, and Linux operating systems issues.
- Experience with containerization and orchestration tools such as Docker, Nomad, or Kubernetes.
- Strong scripting skills in Bash, Python, or Go.
- Self-motivated individual with the ability to work independently in a remote environment.
- Excellent problem-solving skills and attention to detail.
Nice to Have
- Experience with agile development methodologies and version control systems.
- Knowledge of security best practices and compliance frameworks.
- Familiarity with project management tools such as Jira or Asana.
- Certification in cloud computing, DevOps, or a related field.
Benefits and Perks
- Competitive salary and equity package.
- 401(k) matching program to support your retirement goals.
- Generous paid time off to ensure a healthy work-life balance.
- Opportunities for professional development and continuous learning.
- Access to a world-class team with a culture of innovation and collaboration.
- Flexible, remote work environment to suit your lifestyle.
- Comprehensive health insurance package.
- Annual stipend for professional growth and development.
How to Stand Out
- Highlight your experience with hybrid-cloud infrastructure and how you've managed or improved CI/CD pipelines in previous roles.
- Prepare to back your claims with examples of complex distributed systems you've debugged or optimized.
- Familiarize yourself with Kraken's technology stack and be ready to discuss how your skills align with their needs.
- Develop a strong understanding of monitoring and alerting systems, particularly Prometheus and Grafana, to stand out as a candidate.
- Showcase your scripting skills by sharing examples of scripts you've developed for automation or troubleshooting.
- Be prepared to discuss your approach to problem-solving and how you handle working in a fast-paced, remote environment.
- Research Kraken's company culture and be ready to discuss how your values and work style align with theirs.
This is a remote position listed on WFA Digital, the platform for professionals who work from anywhere. Browse more remote jobs across all categories.