Job Title : ClickHouse Support Engineer (L1 / L2 / L3)
Role Overview :
We are seeking a skilled ClickHouse Support Engineer to serve as the first line of defense for all ClickHouse support tickets (L1, L2, and L3). You will be responsible for diagnosing and resolving incidents within defined SLAs, enhancing system reliability, and continuously improving our support playbooks. This role requires a mix of database expertise, problem-solving skills, and a strong customer-centric approach.
Key Responsibilities :
- Incident Management : Triage, investigate, and resolve L1L3 support tickets for ClickHouse clusters.
- Performance Optimization : Diagnose and address query performance issues, configuration errors, and hardware-related problems.
- Root Cause Analysis : Conduct in-depth investigations to identify underlying issues and implement permanent solutions.
- Escalation : Escalate complex or high-impact incidents to senior administrators or architects when necessary.
- Documentation : Maintain and update detailed support documentation, runbooks, and troubleshooting guides.
- SLA Compliance : Monitor incident timelines and provide timely, clear updates to all stakeholders.
Required Skills & Experience :
Minimum 7 years of experience in a database or infrastructure support role.Hands-on ClickHouse expertise, including SQL query tuning and query profiling.Strong Linux troubleshooting skills (logs, systemd, kernel parameters).Experience working with ticketing systems such as Jira or ServiceNow.Excellent written and verbal communication skills with a strong customer-service mindset.Preferred Skills :
Scripting skills for diagnostic automation (Bash, Python).Experience with monitoring tools such as Prometheus and Grafana.Previous involvement in escalation handling and process improvements.(ref : hirist.tech)