Senior Software Engineer - Orchestration & Job Execution
WFA Digital Insight
As demand for skilled software engineers in cloud platform engineering continues to surge, with a 25% growth in the US market, companies like Alteryx are at the forefront of innovation. With the rising importance of automation, AI, and data-driven decision-making, professionals equipped with expertise in distributed systems, asynchronous processing, and cloud-native technologies are in high demand. This role stands out for its focus on building and operating backend services that coordinate work movement through Alteryx Cloud, requiring a unique blend of technical prowess and operational acumen. Candidates should be prepared to showcase their ability to design, build, and operate complex production systems end-to-end, understanding the intricacies of distributed systems, observability, and operational tradeoffs.
Job Description
## About the Role The Senior Software Engineer - Orchestration & Job Execution role at Alteryx presents a unique opportunity for a seasoned engineer to join the cloud platform engineering team. In this capacity, you will be instrumental in designing, building, operating, and evolving backend services that are pivotal to the coordination of work within Alteryx Cloud. The role involves a deep focus on platform capabilities related to orchestration, job execution, workflow runtime services, and ensuring reliable service-to-service coordination. This is a senior engineering position that requires comfort and expertise in owning complex production systems from end to end, including reasoning about distributed systems, asynchronous processing, reliability, observability, failure modes, and making informed operational tradeoffs.
The work will be backend-heavy, with responsibilities including building APIs, services, workers, libraries, and runtime infrastructure. You will have the opportunity to work with modern cloud-native systems and technologies, such as TypeScript/Node.js, relational databases, messaging or queueing systems, event-driven architecture, containers, Kubernetes, and observability tooling. Collaboration with various teams, including platform, product, security, QE, and application teams, will be key to ensuring that cloud platform capabilities are reliable, understandable, and safe to operate at scale.
## What You Will Do - Design and deliver backend services, APIs, workers, and shared libraries that power cloud platform capabilities.
- Build and improve systems for orchestrating work across services, including job submission, execution tracking, status propagation, retries, cancellation, results, and operational visibility.
- Develop reliable asynchronous and event-driven systems using queues, messaging, background workers, and durable state.
- Work on distributed execution flows across platform services, including service-to-service communication, routing, acknowledgements, and failure recovery.
- Build and maintain cloud-native runtime infrastructure using containers, Kubernetes, deployment automation, and related platform tooling.
- Integrate platform services with persistence layers, event streams, REST APIs, and internal service contracts.
- Improve production reliability through metrics, tracing, structured logging, health checks, dashboards, alerting, runbooks, and incident follow-up.
- Lead technical design for ambiguous or cross-service work, review code with a systems mindset, and mentor engineers on distributed-system and production-engineering practices.
- Collaborate with partner teams to turn product requirements into incremental, testable, and operable platform capabilities.
- Utilize AI and modern development tools to enhance engineering productivity, code quality, and delivery speed.
- Strong experience building backend services using TypeScript/Node.js, or core systems languages like Go, Java, or Rust, with a willingness to work primarily in Node.js.
- Experience designing and operating asynchronous, queue-driven, or event-driven systems, including patterns such as retries, cancellation, idempotency, concurrency, ordering, timeouts, and failure handling.
- Familiarity with durable persistence, service contracts, RESTful APIs, and integrations across multiple services or platform components.
- Knowledge of cloud-native systems, including containers, Kubernetes, and observability tooling.
- Experience with modern development practices, including version control (e.g., Git), continuous integration, and continuous deployment.
- Strong problem-solving skills, with the ability to reason about complex systems and operational issues.
- Excellent communication skills, with the ability to collaborate effectively with cross-functional teams.
- Familiarity with AI and machine learning concepts, and their application in software development.
- Knowledge of cybersecurity principles and practices, particularly in cloud environments.
- Participation in open-source projects or personal coding projects that demonstrate expertise in relevant areas.
- Opportunities for professional growth and career advancement within a rapidly expanding company.
- Flexible, remote work arrangements, with the possibility of working from anywhere in the United States.
- Access to cutting-edge technologies and tools, with a focus on cloud-native systems and modern software development practices.
- Collaborative, dynamic work environment with a team of experienced engineers and professionals.
- Comprehensive health insurance, including medical, dental, and vision coverage.
- Generous paid time off (PTO) policy, including vacation days, sick leave, and holidays.
How to Stand Out
- Leverage your Excel skills: Even though the role is heavily focused on backend development, showcasing proficiency in data analysis and visualization using Excel can be a valuable asset.
- Highlight distributed system expertise: Emphasize your understanding and experience with distributed systems, asynchronous processing, and cloud-native technologies in your resume and during interviews.
- Prepare to discuss operational tradeoffs: Be ready to dive into the intricacies of operational decisions, including tradeoffs related to scalability, reliability, and performance in distributed systems.
- Showcase your problem-solving skills: Prepare examples that demonstrate your ability to reason about complex systems, identify issues, and propose effective solutions.
- Demonstrate your ability to work remotely: Highlight your experience with remote collaboration tools and your self-motivation to work effectively in a remote setup.
- Be prepared to talk about your experience with cloud-native technologies: Have specific examples ready that showcase your hands-on experience with containers, Kubernetes, and observability tooling.
- Emphasize your ability to learn and adapt: Show enthusiasm for learning new technologies and adapting to the evolving landscape of cloud platform engineering.
This is a remote position listed on WFA Digital, the platform for professionals who work from anywhere. Browse more remote jobs across all categories.