We are looking for a Site Reliability Engineer (SRE) Platform Engineer to design, build, and maintain scalable, resilient, and efficient infrastructure for our Data PlatformThis role focuses on developing platform solutions, improving system reliability, automating infrastructure, and enhancing developer productivityYou will work closely with Software engineers, Architects, Data Engineers, DevOps, and security teams to create a highly available and performant platform, Key tasks & accountabilitiesPlatform Engineering & Automation : Design and implement scalable, automated, and self-healing infrastructure solutionsInfrastructure as Code (IaC) : Develop and maintain infrastructure using TerraformObservability & Monitoring : Implement and maintain monitoring, logging, and alerting systems using Datadog, GrafanaUnity Catalog : Implement and maintain Unity Catalog, Metadata, Delta Sharing and Identity access managementDatabases : Implement and maintain relational databases, data warehouses and NoSQL databasesPower BI : Manage the entire Power BI tenant within ABICI / CD & DevOps Practices : Optimize CI / CD pipelines using ADO, GitHub Actions, or ArgoCD to enable seamless deployments, Cloud : Architect and manage cloud-native platforms using Azure, AWS, or Google Cloud Platform (GCP)Networking : Manage and secure the data platform network by enforcing network security policies, integrating on-premises networks with cloud environments, configuring VNETs, subnets, and routing policiesDisaster Recovery : Develop and maintain the Disaster Recovery environment and conduct periodic Disaster Recovery drillsResilience & Incident Management : Improve system reliability by implementing fault-tolerant designs and participating in L4 level resolutionSecurity & Compliance : Ensure platform security by implementing best practices for cloud-based data platforms, access controls, and zone specific compliance requirementsDeveloper Enablement : Build internal tools and frameworks to enhance the developer experience and enable self-service capabilitiesQualifications, Experience, SkillsLevel of educational attainment required :Bachelor's or Master's degree in computer science, Information Technology, or a related field with minimum of 3 years of experienceCertifications (Any one of them)Azure Developer AssociateAzure DevOps Engineer ExpertAzure Solutions Architect ExpertAzure Data Engineer AssociateGoogle Professional SRE CertificationAWS Certified DevOps Engineer Professional)Technical Expertise :Programming Languages : Proficient in programming languages such as Bash, Powershell, Terraform, Python, Java, etc , Cloud Platforms : Expertise in Azure, AWS or GCP cloud servicesInfrastructure as Code (IaC) : Experience with TerraformUnity Catalog : Deep understanding of Databricks architecture, Schema & table structure, Metadata, Delta Sharing and Identity access managementDatabases : Deep understanding of database concepts and experience with relational databases, datawarehouses and NoSQL databasesKubernetes & Containers : Hands-on experience with Kubernetes, Helm, and Docker in production environmentsPower BI : Deep understanding of Power BI administration, workspace management, dashboard development, performance optimization and integrationMonitoring & Logging : Experience with observability tools like Datadog, GrafanaCI / CD & DevOps : Experience with GitHub, Azure DevOps, GitHub Actions, or ArgoCDNetworking & Security : Experience with cloud network, firewalls, VPNs, DNS, policy deployment and vulnerability remediationsDisaster Recovery : Deep understanding of cloud DR concepts and high availability requirementsAnd above all of this, an undying love for beer!We dream big to create future with more cheers,Skills Required
Github, Azure Devops, Schema, metadata