Platform Engineering Manager

Tango·Remote(United States)
Software Development
Excel

WFA Digital Insight

The demand for skilled platform engineers has skyrocketed as companies invest heavily in cloud infrastructure and AI technologies. With a 25% growth in cloud computing jobs in the past year, professionals with expertise in multi-cloud strategy, observability, and AI/ML tooling are in high demand. Tango stands out as a pioneer in AI-native Internal Developer Platforms, and this role offers a unique chance to build and operate a foundational layer powering engineering velocity. Before applying, candidates should be well-versed in AWS and Azure Well-Architected Frameworks and have a solid understanding of cloud modernization principles.

Job Description

About the Role

As a Platform Engineering Manager at Tango, you will be at the forefront of building and operating the company's AI-native Internal Developer Platform (IDP), the foundational layer that powers engineering velocity across the organization. This is a critical role that requires a deep understanding of cloud infrastructure, AI/ML technologies, and the ability to drive cloud modernization efforts. You will be responsible for defining the platform roadmap, leading the migration of teams onto the platform, and ensuring that the platform is scalable, secure, and efficient.

The IDP is a key component of Tango's technology stack, and as the Platform Engineering Manager, you will be working closely with cross-functional teams to ensure that the platform meets the needs of the business. This includes collaborating with peer engineering leaders to migrate teams onto the platform, positioning the platform as the organization's AI-first engineering foundation, and ensuring that the platform is aligned with the company's overall technology strategy.

Tango is committed to creating a culture of innovation and collaboration, and as a Platform Engineering Manager, you will be expected to embody these values. You will be working in a dynamic and fast-paced environment, and you will need to be adaptable, flexible, and able to thrive in a rapidly changing landscape.

What You Will Do

  • Own and execute the platform roadmap, including compute, networking, identity, observability, shared services, and AI/ML tooling across AWS and Azure
  • Lead cloud modernization efforts against the AWS and Azure Well-Architected Frameworks, ensuring that the platform is aligned with the company's overall technology strategy
  • Define golden paths for standardized self-service workflows, including service scaffolding, DB provisioning, environment spin-up, and AI workload deployment
  • Drive IaC & CI/CD automation, using tools such as OpenTofu/Ansible, GitHub Actions, and ArgoCD to maximize deployment frequency and reduce lead time
  • Own org-wide observability, including metrics, logs, traces, and alerting, and ensure that the platform has full-stack coverage across infrastructure, Kubernetes, APM, distributed tracing, AI pipelines, and cost anomaly detection
  • Build and operate a self-service shared services catalog, including secrets management, API gateways, model registries, and LLM gateways
  • Collaborate with peer engineering leaders to plan and execute structured workload migrations onto the platform
  • Partner with data science and ML engineering to translate agentic workflow requirements into reusable platform primitives
  • Establish governance, cost controls, prompt injection guardrails, and model access policies for AI API usage and inference spend

What We Are Looking For

  • 5+ years of experience in platform engineering, with a focus on cloud infrastructure and AI/ML technologies
  • Strong understanding of cloud modernization principles and the ability to lead cloud modernization efforts
  • Experience with AWS and Azure Well-Architected Frameworks, including operational excellence, security, reliability, performance efficiency, and cost optimization
  • Strong programming skills in languages such as Python, Java, or C++
  • Experience with IaC tools such as OpenTofu/Ansible, Terraform, or CloudFormation
  • Strong understanding of CI/CD pipelines and the ability to drive automation efforts
  • Experience with observability tools such as Datadog, Signoz, OpenTelemetry, Grafana, Prometheus, or Loki
  • Strong communication and collaboration skills, with the ability to work effectively with cross-functional teams

Nice to Have

  • Experience with AI/ML technologies, including machine learning frameworks such as TensorFlow or PyTorch
  • Experience with containerization technologies such as Docker or Kubernetes
  • Experience with agile development methodologies and the ability to work in a fast-paced environment
  • Strong understanding of security principles and the ability to ensure that the platform is secure and compliant

Benefits and Perks

  • Competitive salary and benefits package
  • Opportunity to work with a cutting-edge technology stack and contribute to the development of a pioneering AI-native Internal Developer Platform
  • Collaborative and dynamic work environment with a team of experienced engineers and technologists
  • Flexible working hours and remote work options
  • Professional development opportunities, including training and education programs
  • Access to the latest tools and technologies, including cloud infrastructure and AI/ML platforms
  • Recognition and reward programs, including bonuses and stock options
  • Comprehensive health and wellness programs, including medical, dental, and vision coverage
  • Generous PTO and holiday schedule, including paid time off for vacations and holidays

How to Stand Out

  • Be prepared to discuss your experience with cloud modernization and AI/ML technologies, and how you have applied these skills in previous roles.
  • Make sure to highlight your understanding of cloud infrastructure and your ability to drive automation efforts using tools such as OpenTofu/Ansible and GitHub Actions.
  • Showcase your experience with observability tools such as Datadog and Signoz, and your ability to ensure that the platform has full-stack coverage across infrastructure, Kubernetes, APM, distributed tracing, AI pipelines, and cost anomaly detection.
  • Emphasize your strong communication and collaboration skills, and your ability to work effectively with cross-functional teams.
  • Be prepared to discuss your experience with agile development methodologies and your ability to work in a fast-paced environment.
  • Research the company and the role, and be prepared to ask informed questions during the interview process.
  • Make sure to highlight your passion for innovation and your desire to contribute to the development of a pioneering AI-native Internal Developer Platform.

This is a remote position listed on WFA Digital, the platform for professionals who work from anywhere. Browse more remote jobs across all categories.