Database Reliability Engineer - Core Team
WFA Digital Insight
As the demand for real-time analytics and data warehousing continues to surge, with the global cloud analytics market projected to reach $40 billion by 2026, professionals with expertise in database reliability engineering are in high demand. Clickhouse, a leader in real-time analytics, is looking for a skilled Database Reliability Engineer to join their core team. With over 3,000 customers and a growth rate of 250 percent year over year, this role offers a chance to work with a fast-growing company. Candidates should have a strong understanding of distributed database internals, SQL, and cloud computing platforms, as well as excellent problem-solving skills. Before applying, candidates should be prepared to showcase their experience in reliability engineering, QA, or customer-facing engineering, and demonstrate their ability to thrive in a fast-paced, global team environment.
Job Description
About the Role
The Database Reliability Engineer will be part of Clickhouse's core team, responsible for ensuring the reliability, availability, scalability, and performance of ClickHouse core. This role is crucial in maintaining the high standards of service that Clickhouse provides to its customers. The successful candidate will collaborate with various teams, including Control Plane, Dataplane, Security, Support, and Operations, to implement ClickHouse in the best way possible for customers.About Clickhouse
Clickhouse is a fast-growing private cloud company that has been recognized on the 2025 Forbes Cloud 100 list. With over 3,000 customers and a growth rate of 250 percent year over year, Clickhouse leads the market in real-time analytics, data warehousing, observability, and AI workloads. The company has recently validated its momentum with a $400M Series D financing round and has seen significant adoption from customers such as Capital One, Lovable, Decagon, Polymarket, and Airwallex.What You Will Do
- Continuously improve the reliability and performance of ClickHouse core
- Improve and create metrics and alerts for ClickHouse to identify and prevent problems in production
- Investigate the root cause of problems and submit bug fixes, issue reports, and suggest improvements
- Enhance and refine incident response processes and post-mortem analysis for ClickHouse core related outages
- Plan, enable, and drive Chaos initiatives across Engineering teams
- Manage on-call processes to respond to performance and reliability issues
- Collaborate with different teams to implement ClickHouse in the best way for customers
- Own the areas of managing engineering escalation management and response
- Conduct blameless postmortems and drive continuous improvement of how Clickhouse is run and optimized in the cloud
What We Are Looking For
- Bachelor’s or Master’s degree in Computer Science or a related field
- At least 5 years of experience in Reliability Engineering, QA, or customer-facing engineering
- Previous experience operating ClickHouse or other SQL databases in production
- Excellent understanding of distributed database internals and SQL
- Scripting experience with Shell or Python, and ability to read and understand C++ code
- Knowledge of cloud computing platforms such as AWS, Azure, or Google Cloud Platform
- Strong problem-solving skills and production debugging skills
- Ability to thrive in a fast-paced environment as part of a global team
Nice to Have
- Experience with Adjust and Excel
- Familiarity with ClickHouse or other real-time analytics platforms
- Certification in cloud computing or related fields
Benefits and Perks
- Opportunity to work with a fast-growing company in the cloud analytics market
- Collaborative and dynamic work environment
- Flexible remote work arrangements
- Professional development opportunities
- Access to cutting-edge technologies and tools
- Competitive compensation package
How to Stand Out
- Ensure your resume and cover letter highlight your experience in reliability engineering, QA, or customer-facing engineering, and demonstrate your understanding of distributed database internals and SQL.
- Familiarize yourself with ClickHouse and its real-time analytics platform to stand out in the application process.
- Be prepared to provide specific examples of your problem-solving skills and production debugging experience.
- Showcase your ability to work in a fast-paced, global team environment and thrive in a remote work setup.
- Research the company's culture and values to demonstrate your enthusiasm for the role and the company.
This is a remote position listed on WFA Digital, the platform for professionals who work from anywhere. Browse more remote jobs across all categories.