Senior Staff Software Engineer, Indexing & Retrieval Platform

RedditReddit·Remote(Remote - United States)
Software Development
Excel

WFA Digital Insight

As the demand for skilled software engineers in machine learning continues to rise, with over 50% of companies investing in AI, Reddit's Senior Staff Software Engineer role stands out. With the growth of online communities, the need for efficient and scalable indexing and retrieval systems has never been more pressing. This role offers a unique chance to lead the technical strategy and implementation of Reddit's next-generation ML Indexing & Retrieval engine, making it an attractive opportunity for those looking to make a significant impact in the tech industry. Candidates should be prepared to showcase their expertise in designing and maintaining large-scale distributed systems, as well as their ability to collaborate with cross-functional teams.

Job Description

About the Role

The Senior Staff Software Engineer will lead the development of Reddit's ML Indexing & Retrieval Platform, focusing on building and scaling the core infrastructure that powers machine learning-driven recommendations. This role is part of a high-impact, cross-functional team responsible for designing and maintaining systems for ML data ingestion, low-latency retrieval services, and end-to-end lifecycle management of data. With a focus on performance, reliability, and scalability, the team enables real-time access to high-quality data that supports a wide range of applications, including Content Understanding, Semantic, Lexical retrieval, and GenAI applications.

The ML Indexing & Retrieval Platform team at Reddit is committed to delivering high-quality experiences for users, and the Senior Staff Software Engineer will play a key role in achieving this goal. By leading the technical strategy, architecture, and implementation of Reddit's next-generation ML Indexing & Retrieval engine, the successful candidate will have a significant impact on the company's ability to provide efficient and scalable systems.

Reddit is a community of communities, built on shared interests, passion, and trust, and is home to the most open and authentic conversations on the internet. With 100,000+ active communities and approximately 121 million daily active unique visitors, Reddit is one of the internet's largest sources of information.

What You Will Do

  • Lead the technical strategy, architecture, and implementation of Reddit's next-generation ML Indexing & Retrieval engine, integrating capabilities across lexical and vector indexing, low-latency retrieval, and emerging GenAI applications.
  • Partner closely with product engineers across Content Understanding, Search, Feeds, Ads, Growth, and Safety to deliver high-quality experiences.
  • Define best practices for observability, reliability, and operational excellence in large-scale distributed systems.
  • Mentor and guide engineers in designing scalable infrastructure and adopting robust DevOps and SRE principles.
  • Collaborate with infrastructure, and ML teams to ensure the platform evolves to meet the needs of Reddit's growing user base and diverse content ecosystem.
  • Develop and maintain large-scale distributed systems, with a focus on performance, reliability, and scalability.
  • Design and implement data ingestion, processing, and storage systems that support real-time data access.
  • Work closely with cross-functional teams to identify and prioritize project requirements.
  • Participate in code reviews and ensure that the codebase is maintainable, efficient, and well-documented.
  • Troubleshoot and resolve complex technical issues, and develop strategies for preventing similar issues in the future.

What We Are Looking For

  • 8+ years of experience in software development, with a focus on building and scaling large-scale distributed systems.
  • Expertise in one or more programming languages, such as Go, Java, Python, or any object-oriented programming language.
  • Experience with frameworks such as Flink, Airflow, or Spark for large-scale batch and stream processing.
  • Familiarity with databases such as Vector, Lexical, and Key-Value Databases.
  • Knowledge of tools such as Kubernetes, Docker, AWS, and GCP.
  • Experience with designing and implementing scalable infrastructure and adopting robust DevOps and SRE principles.
  • Strong understanding of machine learning concepts and experience with ML-driven recommendations.
  • Excellent communication and collaboration skills, with the ability to work effectively with cross-functional teams.

Nice to Have

  • Experience with GenAI applications and emerging technologies.
  • Familiarity with Reddit's technology stack and architecture.
  • Participation in open-source projects or contributions to the software development community.
  • Experience with agile development methodologies and version control systems such as Git.
  • Knowledge of cloud-based infrastructure and experience with cloud providers such as AWS or GCP.

Benefits and Perks

  • Competitive salary and equity package.
  • Comprehensive health, dental, and vision insurance.
  • Flexible PTO and holiday schedule.
  • Remote work stipend and equipment budget.
  • Professional development opportunities and conference sponsorship.
  • Access to Reddit's internal communities and networks.
  • Opportunity to work on high-impact projects and contribute to the company's mission.

How to Stand Out

  • Showcase your expertise in designing and maintaining large-scale distributed systems, and be prepared to provide specific examples of your experience.
  • Highlight your ability to collaborate with cross-functional teams, and emphasize your excellent communication and interpersonal skills.
  • Familiarize yourself with Reddit's technology stack and architecture, and be prepared to discuss your experience with similar technologies.
  • Be prepared to discuss your experience with machine learning concepts and ML-driven recommendations, and highlight your understanding of the company's goals and objectives.
  • Research the company culture and values, and be prepared to discuss how your skills and experience align with Reddit's mission and vision.
  • Prepare to discuss your experience with agile development methodologies and version control systems, and be ready to provide examples of your experience with cloud-based infrastructure.
  • Be prepared to negotiate your salary and benefits package, and do your research on the market rate for similar positions.

This is a remote position listed on WFA Digital, the platform for professionals who work from anywhere. Browse more remote jobs across all categories.