Talent.com
This job offer is not available in your country.
Associate D&A Site Reliability Engineer (SRE)

Associate D&A Site Reliability Engineer (SRE)

ConfidentialMumbai
6 days ago
Job description

How you will contribute

You will :

  • Execute the business analytics agenda in conjunction with analytics team leaders
  • Work with best-in-class external partners who leverage analytics tools and processes
  • Use models / algorithms to uncover signals / patterns and trends to drive long-term business performance
  • Execute the business analytics agenda using a methodical approach that conveys to stakeholders what business analytics will deliver

What you will bring

A desire to drive your future and accelerate your career and the following experience and knowledge :

  • Using data analysis to make recommendations to analytic leaders
  • Understanding in best-in-class analytics practices
  • Knowledge of Indicators (KPIs) and scorecards
  • Knowledge of BI tools like Tableau, Excel, Alteryx, R, Python, etc. is a plus
  • Job Summary

  • Experience :   6 + years
  • Purpose :   Lead the full lifecycle of a cloud-native platform on GCP, ensuring security, reliability, scalability, and cost-efficiency through SRE practices, Terraform automation, observability, and security / compliance enforcement.
  • Key Contributions :

  • Product Ownership  / mindset  :   Drive continuous improvement in platform performance, reliability, security, and usability.
  • Security & Vulnerability Management :   Proactively identify , remediate, and prevent security vulnerabilities. Automate compliance checks and vulnerability scans.
  • Reliability Engineering :   Own SLIs / SLOs, build self-healing systems with clear incident response.
  • Infrastructure Automation :   Architect and govern reusable, secure Terraform-based GCP infrastructure.
  • Cost Governance :   Integrate FinOps principles to optimize D&A workload and resource consumption cost, performance, and utilization .
  • Observability & Insights :   Implement comprehensive monitoring to identify trends, prevent issues, and improve reliability.
  • Security Automation :   Enforce security policies as code (shift left security) and support security audits.
  • Collaboration :   Partner with Dev, FinOps, CloudOps , and Security teams to ensure alignment.
  • Required Skills & Experience :

  • 6+ years in SRE, DevOps, or Cloud Platform Engineering with end-to-end platform ownership experience.
  • Expert in Terraform (secure modules, policy-as-code, Terraform Cloud / Enterprise).
  • Strong GCP knowledge (GKE, Compute Engine, IAM, VPC, Cloud Storage, Cloud Armor, Identity Aware Proxy).
  • Deep Kubernetes (GKE) experience (autoscaling, network policies, RBAC, PSPs, Kubernetes security).
  • Proven skills in managing SLIs / SLOs and automated incident response.
  • Strong background in cloud security and vulnerability management. Use of tools like Sonarqube , Wiz, Tenable , GitHub actions and Dependabot .
  • Experience with observability stacks (Prometheus, Grafana, Stackdriver , Datadog) and root cause analysis.
  • Hands-on CI / CD experience ( Git hub CI / CD, ArgoCD , Jenkins) integrated with Terraform.
  • Proficient in Python, Bash, or Go for automation.
  • Familiar with FinOps best practices and compliance frameworks (ISO, SOC2, etc.).
  • Bonus Points :

  • GCP certifications (Cloud Architect, DevOps Engineer).
  • Multi-cloud (AWS / GCP) Terraform experience.
  • Terraform Cloud / Enterprise and policy-as-code (Sentinel, OPA) experience.
  • AI-driven monitoring or SRE tooling development background.
  • Workload identity federation and GKE security hardening experience.
  • Soft Skills :

  • Strategic Thinking
  • Leadership & Mentorship
  • Effective Communication
  • Analytical & Detail-Oriented
  • Continuous Learner
  • Skills Required

    Reliability Engineering, Devops, Data Analysis, Bi Tools, SRE

    Create a job alert for this search

    Site Reliability Engineer • Mumbai