Staff Machine Learning Systems Engineer, Embeddings Platform
WFA Digital Insight
As the demand for AI-driven systems continues to rise, with a predicted 25% increase in machine learning investments by 2027, Reddit is at the forefront of this shift. With over 100,000 active communities, the need for advanced personalization and recommendation systems has never been more critical. The role of a Staff Machine Learning Systems Engineer is not just about technical prowess but also about influencing key AI-driven systems and mentoring a team of engineers. Candidates should be aware that this role requires a deep understanding of modern deep learning architectures and the ability to design, implement, and optimize scalable ML systems. With the remote job market booming, this is an opportunity to join a leader in the tech industry and contribute to its growth.
Job Description
About the Role
The Staff Machine Learning Systems Engineer position at Reddit is a pivotal role that involves leading the development of next-generation, large-scale machine learning techniques. This role is part of the Embeddings Platform team, which is responsible for building highly expressive machine learning models that power Reddit's recommendation systems. The successful candidate will be at the forefront of driving innovation in scalable model design and training approaches, ensuring efficient and reliable deployment of ML models in production.As part of this role, the engineer will partner with leadership to define ML roadmaps and collaborate with cross-functional teams to integrate ML models into Reddit's key AI-driven systems. The role requires a deep understanding of complex multi-entity relationships in machine learning applications and how they are modeled in large-scale systems. The engineer will also be responsible for mentoring and guiding senior and mid-level ML engineers, fostering a culture of excellence, innovation, and knowledge sharing.
The Embeddings Platform team is committed to leveraging modern deep learning approaches and scalable model designs to enhance personalization across Reddit's ecosystem. The team's work impacts content discovery, user engagement, and platform growth at a massive scale. As such, the Staff Machine Learning Systems Engineer will play a critical role in driving the technical direction for large-scale machine learning models and guiding the development of advanced deep learning architectures.
What You Will Do
- Architect and lead the development of next-generation, large-scale machine learning techniques
- Define and execute the ML strategy, identifying opportunities to enhance personalization and recommendation quality across Reddit
- Lead research initiatives on scalable machine learning systems and real-time model adaptation, bringing cutting-edge advancements into production
- Partner with ML infrastructure teams to build high-performance, distributed training systems that efficiently scale across multiple GPUs and cloud environments
- Establish and optimize real-time serving architectures for large-scale embeddings, ensuring low-latency inference and high throughput
- Collaborate cross-functionally with teams in Feed Ranking, Ads, Content Understanding, and Core ML to integrate ML models into Reddit's key AI-driven systems
- Mentor and guide senior and mid-level ML engineers, fostering a culture of excellence, innovation, and knowledge sharing
- Stay at the forefront of AI research, evaluating and introducing new modeling paradigms to keep Reddit's ML ecosystem cutting-edge
- Drive technical discussions, present findings to leadership, and contribute to long-term ML planning and decision-making
What We Are Looking For
- 8+ years of experience in machine learning engineering, with a strong focus on large-scale ML systems and recommendation or personalization systems
- Expertise in modern deep learning architectures, including sequence models and foundational models
- Deep understanding of complex multi-entity relationships in machine learning applications and how they are modeled in large-scale systems
- Proven ability to design, implement, and optimize scalable ML architectures
- Experience with large-scale distributed training systems and cloud environments
- Strong understanding of software engineering principles and experience with Agile development methodologies
- Excellent communication and collaboration skills, with the ability to work effectively with cross-functional teams
- Strong problem-solving skills, with the ability to analyze complex problems and develop creative solutions
Nice to Have
- Experience with natural language processing and computer vision applications
- Knowledge of reinforcement learning and deep learning techniques
- Familiarity with TensorFlow, PyTorch, or other deep learning frameworks
- Experience with cloud-based ML platforms, such as AWS SageMaker or Google Cloud AI Platform
- Certification in machine learning or a related field
Benefits and Perks
- Competitive salary and bonus structure
- Equity in a leading tech company
- Comprehensive health, dental, and vision insurance
- 401(k) matching program
- Flexible PTO policy and paid holidays
- Remote work stipend and home office setup support
- Professional development opportunities, including conference attendance and training programs
- Access to cutting-edge technologies and tools
- Collaboration with a talented team of engineers and researchers
How to Stand Out
- Ensure your resume highlights your experience with large-scale machine learning systems and recommendation or personalization systems.
- Prepare to discuss your expertise in modern deep learning architectures and complex multi-entity relationships in machine learning applications.
- Be ready to provide examples of your experience with large-scale distributed training systems and cloud environments.
- Show a deep understanding of software engineering principles and experience with Agile development methodologies.
- Highlight your ability to communicate complex technical concepts to non-technical stakeholders.
- Demonstrate your problem-solving skills by walking the interviewer through your approach to analyzing complex problems and developing creative solutions.
This is a remote position listed on WFA Digital, the platform for professionals who work from anywhere. Browse more remote jobs across all categories.