Talent.com
Lead Platform Engineer - Site Reliability

Lead Platform Engineer - Site Reliability

Prometheus consultingHyderabad
24 days ago
Job description

Description : What You Will Own :

  • Build, manage, and mentor a high-performing Platform Engineering team, fostering a culture of collaboration, accountability, and continuous development.
  • Ensure timely and efficient delivery of the teams project and reactive work to support the needs of the business in alignment with Product and Technology Roadmaps
  • Provide coaching, training, and career development opportunities to strengthen the team's skills and performance
  • Ensure effective planning and allocation of team capacity through project management, planning / estimation, risk management and predictable delivery while creating visibility of all work
  • Provide technical leadership and mentoring across teams through knowledge sharing sessions, pair programming, code reviews and solution design
  • Translate business problems and complex requirements into simple, tractable technical solutions
  • Contribute to the preparation and execution of Product & Technology roadmaps
  • Implement and maintain monitoring / alerting / logging systems to identify and respond to incidents
  • Conduct / participate in Root Cause Analyses (RCAs) and blameless post-mortems
  • Participate in on-call rotations to ensure system reliability and rapid incident response.
  • Ensure scalability and efficiency of cloud infrastructure and systems to handle traffic and data growth
  • Collaborate with product engineering teams to design / build fit-for-purpose and observable software
  • Lead innovation within the platform team, and ensure that successful ideas are evangelized and propagated across the company

Required Qualifications :

  • Bachelor's degree in Computer Science, Information Technology, or similar
  • Recent and proficient experience development in C# / .NET
  • Proven experience (3+ years) Leading Platform Engineering or Site Reliability Engineering teams
  • Recent and proficient experience (8+ years) in C# / .NET development
  • Experience (2+ years) working on a SaaS platform, preferably in a leadership capacity
  • Proven track record of designing, building and operating highly-available and performant production environments, preferably using Kubernetes
  • Proficiency with one or more public cloud providers such as Azure, AWS or GCP
  • Proficiency using Infrastructure as Code (IaC) tools such as Terraform
  • Experience with monitoring, observability and logging tools such as DataDog, Prometheus, Grafana, or similar.
  • Strong business acumen and strategic thinking capabilities
  • Exceptional leadership, communication, and interpersonal skills, with the ability to engage at all levels with non-technical as well as technical audiences
  • Ability to work in a fast-paced, dynamic environment and manage multiple priorities with a positive attitude
  • Preferred Qualifications :

  • Relevant certifications in cloud platforms (e.g., Microsoft Certified : Azure Solutions Architect) and DevOps practices (e.g., Certified Kubernetes Administrator) are a plus
  • Experience in database management / performance tuning, particularly MSSQL.
  • Experience working with frontend technologies (React)
  • (ref : hirist.tech)

    Create a job alert for this search

    Site Reliability Engineer • Hyderabad