Machine Learning Systems Engineer, Ads ML Platform

RedditReddit·Remote(Remote - United Kingdom)
Software Development
Excel

WFA Digital Insight

The demand for skilled machine learning engineers has surged in recent years, with a particular emphasis on those who can build and scale data infrastructure. As remote work becomes the norm, companies like Reddit are looking for talent who can drive innovation in the digital space. With the rise of AI and ML, the ability to manage and optimize data pipelines is crucial. According to recent statistics, the need for data engineers with ML expertise has grown by over 25% in the last two years. Reddit's flexible work culture and commitment to community-driven conversations make it an attractive choice for those looking to make a meaningful impact. Before applying, candidates should be aware of the importance of collaboration and open communication in a remote setup.

Job Description

About the Role

The Machine Learning Systems Engineer position at Reddit is a unique opportunity to join a dynamic team focused on building scalable feature platforms for Ads ML. As a key member of this team, you will be responsible for designing and implementing data infrastructure that supports large-scale feature and training set computation, transformation, and storage. Your expertise in data engineering and machine learning workflows will be crucial in driving the success of Reddit's advertising efforts.

Reddit is a community-driven platform with over 100,000 active communities and approximately 126 million daily active unique visitors. The company prides itself on fostering open and authentic conversations on the internet. As a remote worker, you will have the flexibility to work from anywhere in the UK, collaborating with a talented team of engineers to evolve and scale Reddit's feature management systems.

What You Will Do

  • Design and build data infrastructure that supports large-scale feature and training set computation, transformation, and storage.
  • Develop frameworks for batch and real-time features with a focus on reliability, scalability, and ease of use.
  • Build platform capabilities for feature governance, including lineage tracking, validation, drift detection, anomaly monitoring, reproducibility, and versioning.
  • Partner with ML engineers to ensure smooth integration of feature engineering workflows into ML production systems.
  • Build systems that support agentic ML workflows, including automated feature discovery, feature quality evaluation, and feature lifecycle management.
  • Contribute to operational excellence through observability, performance tuning, reliability engineering, and cost optimization initiatives.
  • Work closely with cross-functional teams to identify and prioritize project requirements.
  • Collaborate on the development of best practices for data infrastructure and ML workflows.
  • Stay up-to-date with industry trends and emerging technologies to continuously improve Reddit's Ads ML platform.

What We Are Looking For

  • 3+ years of experience in data infrastructure/platform engineering or ML infrastructure platforms.
  • Hands-on experience building production services, data pipelines, APIs, workflow systems, or developer tools.
  • Experience with at least one distributed data or compute system such as Spark, PySpark, Flink, Kafka, Ray, Airflow, Kubernetes, BigQuery, or similar technologies.
  • Familiarity with ML data workflows such as feature generation, training dataset creation, batch processing, real-time data processing, model training, experimentation, or online serving.
  • Strong coding skills and the ability to write clean, maintainable, well-tested code.
  • Experience with Excel and other data analysis tools.
  • Excellent communication and collaboration skills.
  • Ability to work in a fast-paced environment and adapt to changing priorities.

Nice to Have

  • Experience building intelligent automation or agentic workflows for ML systems.
  • Experience with ML infrastructure and MLOps workflows spanning feature engineering, training pipelines, experimentation, model deployment, and online serving.
  • Knowledge of cloud-based technologies and their applications in ML workflows.
  • Certification in data engineering or a related field.

Benefits and Perks

  • Flexible remote work arrangement from anywhere in the UK.
  • Competitive salary and benefits package.
  • Opportunity to work with a talented team of engineers and contribute to the development of Reddit's Ads ML platform.
  • Professional development opportunities, including training and conference sponsorships.
  • Access to cutting-edge technologies and tools.
  • Health, dental, and vision insurance.
  • Generous PTO and holidays.
  • Parental leave and family support.
  • Employee assistance programs and mental health support.

How to Stand Out

  • Showcase your experience with distributed data systems and ML workflows in your resume and cover letter.
  • Prepare to discuss specific projects you've worked on, highlighting your problem-solving skills and ability to collaborate with cross-functional teams.
  • Familiarize yourself with Reddit's community-driven culture and be ready to explain how your skills align with the company's mission.
  • Develop a portfolio that demonstrates your expertise in data engineering and ML infrastructure, including any personal projects or contributions to open-source initiatives.
  • Be prepared to discuss your approach to operational excellence, including observability, performance tuning, and cost optimization.
  • Highlight any experience you have with agentic ML workflows and automated feature discovery.
  • Practice explaining complex technical concepts in simple terms to demonstrate your ability to communicate effectively with non-technical stakeholders.

This is a remote position listed on WFA Digital, the platform for professionals who work from anywhere. Browse more remote jobs across all categories.