Senior Search Applications Performance Engineer
WFA Digital Insight
The demand for skilled search applications performance engineers is on the rise, with the global AI market expected to reach
Job Description
About the Role
As a Senior Search Applications Performance Engineer at Cohere, you will play a critical role in optimizing and scaling the company's search applications and infrastructure. This will involve working at the intersection of search technology and performance engineering to deliver fast, reliable, and intelligent search experiences. You will be part of a team that is passionate about pushing the boundaries of what is possible with AI-powered search.The search applications performance engineer role is a key part of Cohere's mission to scale intelligence and serve humanity. By joining the Compass team, you will have the opportunity to work on complex technical challenges and contribute to the development of new tool surfaces designed for agentic users. Your work will directly impact the user experience, driving document understanding capabilities and supporting the development of new AI-powered search applications.
Cohere is a team of researchers, engineers, designers, and more, who are passionate about their craft. Each person is one of the best in the world at what they do, and the company believes that a diverse range of perspectives is essential for building great products. As a Senior Search Applications Performance Engineer, you will be working closely with cross-functional teams, including modeling teams, product teams, and engineering teams.
What You Will Do
- Implement performance monitoring and optimization strategies for Compass search services and the integration with North
- Develop and maintain benchmarking frameworks to evaluate search model performance and infrastructure efficiency
- Collaborate with modeling teams to optimize search models for faster response times and reduced resource consumption
- Work on scaling the search services while maintaining high availability and low latency
- Partner with product teams to translate performance requirements into technical implementations
- Develop and optimize new tool surfaces for agentic search experiences
- Profile applications hosted on a mix of CPU, GPU, and autoscaled compute nodes to identify performance bottlenecks
- Communicate technical performance metrics effectively to cross-functional teams
- Participate in the development of new search applications and features
- Collaborate with the engineering team to ensure seamless integration of search applications with other systems
What We Are Looking For
- Proficiency in Python and backend search technologies (OpenSearch, ElasticSearch, Weaviate, FastAPI)
- Experience with data or evaluation pipelines and performance benchmarking
- Experience profiling applications hosted on a mix of CPU, GPU, and autoscaled compute nodes
- Ability to communicate technical performance metrics effectively to cross-functional teams
- 4+ years of experience contributing to production environments
- Strong understanding of search technology and performance engineering principles
- Experience with cloud-based infrastructure and containerization (e.g., Docker, Kubernetes)
- Familiarity with agile development methodologies and version control systems (e.g., Git)
Nice to Have
- Experience with Kubernetes, Helm, and infrastructure deployment
- Familiarity with GPU-based model inference optimization (ONNX, Triton, vLLM)
- History working in the search and discovery domain
- Experience with machine learning frameworks and libraries (e.g., TensorFlow, PyTorch)
Benefits and Perks
- Competitive salary and benefits package
- Opportunity to work on cutting-edge AI technology and contribute to the development of new search applications
- Collaborative and dynamic work environment with a team of experienced professionals
- Flexible working hours and remote work options
- Access to the latest tools and technologies
- Professional development opportunities, including training and conference attendance
- Comprehensive health and dental benefits
- Generous parental leave policy
- Weekly lunch stipend and in-office snacks
- Co-working stipend for remote workers
- Six weeks of vacation (30 working days)
How to Stand Out
- When applying for this role, be sure to highlight your experience with search technologies and performance engineering principles.
- Showcase your ability to communicate complex technical ideas effectively to cross-functional teams.
- Be prepared to provide examples of your experience with benchmarking frameworks and performance optimization strategies.
- Demonstrate your understanding of cloud-based infrastructure and containerization.
- If you have experience with machine learning frameworks and libraries, be sure to highlight this in your application.
- Consider creating a portfolio of your work to demonstrate your skills and experience.
- Research the company culture and values to ensure you are a good fit for the role and the organization.
This is a remote position listed on WFA Digital, the platform for professionals who work from anywhere. Browse more remote jobs across all categories.