Staff Site Reliability Engineer

GE AerospaceGE Aerospace·Remote(United States)
Software Development
ProgrammaticExcel

WFA Digital Insight

As remote work reshapes the tech landscape, demand for skilled site reliability engineers has surged. With a 25% increase in cloud infrastructure adoption in 2025, professionals with expertise in programmatic solutions and Excel are highly sought after. GE Aerospace stands out for its commitment to innovation and customer experience, making this role an exciting opportunity for those passionate about operational excellence. Candidates should be prepared to showcase their technical prowess and ability to drive architecture enhancements. Before applying, it's essential to understand the role's emphasis on automation, analytics, and collaboration.

Job Description

About the Role

The Staff Site Reliability Engineer plays a critical role in ensuring the performance and availability of compute and network infrastructure across all business segments. This position is part of a highly skilled team focused on achieving operational excellence through relentless technical innovation and automation. The ideal candidate will be passionate about delivering exceptional customer experiences and driving the adoption of cutting-edge technologies.

As a key member of the Site Reliability team, you will work closely with cross-functional teams to establish performance baselines, capacity thresholds, and monitoring criteria. Your expertise in programmatic solutions and data analysis will be essential in developing automated solutions to address potential problems before they impact service availability.

GE Aerospace is committed to fostering a culture of innovation and collaboration. As a Staff Site Reliability Engineer, you will have the opportunity to work with a talented team of professionals who are dedicated to delivering exceptional results.

What You Will Do

  • Establish performance baselines and capacity thresholds for compute and network infrastructure
  • Develop automated solutions to address potential problems before they result in service interruptions
  • Provide impact assessments and mitigation plans for changes going into the production environment
  • Investigate root causes of severe and systemic outages and apply corrective actions across the enterprise
  • Develop availability measures that align with customer experience to accurately assess the usability of crucial services
  • Build capacity models to baseline transactional load compared to resource performance
  • Identify thresholds for all critical links in the data path to quickly isolate potential imbalances
  • Analyze failure points in services to model risk levels and resolution steps
  • Assist in driving architecture enhancements into the system to mitigate potential failure points
  • Programmatically monitor for and remediate configuration drift of critical devices
  • Develop response plans to potential failure points and evaluate effectiveness during planned tests

What We Are Looking For

  • Bachelor's degree from an accredited university or college with a minimum of 4 years of professional experience
  • Excellent knowledge of AWS/Azure cloud services
  • Strong oral and written communication skills
  • Demonstrated experience scripting or developing software and services for the cloud (Python, Go, Java, Node.js, .NET, etc.)
  • Extensive knowledge of network protocols (TCP/IP, SNMP, FTP, syslog, TFTP, etc.)
  • Experience managing version control systems such as Git
  • Experience deploying and managing infrastructure on public clouds such as AWS or Azure

Nice to Have

  • Experience using an automated configuration management system (Terraform, Chef, Puppet, Ansible, Salt, etc.)
  • Knowledge of Network Management (SNMP, MIB)
  • Experience with configuring, customizing, and extending monitoring tools (Datadog, Sensu, Grafana, Splunk, etc.)
  • Programming experience with open-source scripting and data analysis packages like Python, R

Benefits and Perks

  • Competitive compensation package
  • Opportunity to work with a talented team of professionals
  • Professional development and growth opportunities
  • Comprehensive health benefits
  • Remote work stipend
  • Generous PTO policy

How to Stand Out

  • Ensure your resume and cover letter highlight your experience with programmatic solutions and data analysis. For example, describe a project where you developed an automated script to resolve a recurring issue, detailing the programming language used and the benefits achieved.
  • Be prepared to discuss your experience with cloud infrastructure, including AWS/Azure services, and how you've applied them in previous roles. Consider preparing examples of how you've optimized resource utilization or improved service availability.
  • Familiarize yourself with GE Aerospace's technology stack and be ready to ask informed questions during the interview. Research the company's recent projects and initiatives to demonstrate your interest and enthusiasm.
  • Showcase your problem-solving skills by walking the interviewer through a complex technical issue you've resolved in the past. Use the STAR method to structure your response, covering the situation, task, action, and result.
  • Prepare to back up your claims of automation expertise with real-world examples or code snippets. Consider sharing a personal project or contribution to an open-source repository that demonstrates your skills in automation and scripting.
  • Don't underestimate the importance of soft skills; highlight your ability to work collaboratively in a remote environment and communicate effectively with cross-functional teams. Emphasize your experience with collaboration tools, such as Slack or Microsoft Teams, and describe how you've successfully managed remote relationships.

This is a remote position listed on WFA Digital, the platform for professionals who work from anywhere. Browse more remote jobs across all categories.