Description
About the Role
We're looking for a skilled Site Reliability Engineer (SRE) to join our platform engineering team supporting a critical enterprise grade .NET application hosted on Azure, with upcoming cloud transformation initiatives targeting Google Cloud Platform (GCP).
This role demands strong operational discipline, infrastructure-as-code expertise, and a solid understanding of both application and database ecosystems. You will work closely with development and infrastructure teams to improve system reliability, performance, scalability, and automation.
You will create a bridge between development and operations by applying a software engineering mindset to system administration topics. Your time will split between operations / on-call duties and developing systems and software to continuously improve system reliability and performance.
Key Responsibilities
- Manage, monitor, and scale .NET-based applications hosted on Azure
- Write and manage infrastructure as code using Terraform
- Troubleshoot and optimize Microsoft SQL Server (MSSQL) databases
- Ensure high availability and reliability of application services across environments
- Automate build / deploy / monitoring pipelines (CI / CD, alerting, healing)
- Participate in on-call rotation and own incident management / resolution workflows
- Contribute to GCP migration planning and foundational infrastructure setup
- Identify and implement SLOs, SLIs, and error budgets
- Building systems to proactively monitor the health, performance and security of our production and non-production virtualized infrastructure.
- Improving our monitoring and alerting systems to make sure engineers get paged when it matters (and don't get paged when it doesn't).
- Use practices from DevOps and GitOps to improve automation and processes to make self service possible.
- Safeguarding reliability. Ensuring that our services are highly available, resilient against disasters, self-monitoring, and self-healing.
Requirements
Must-Have Skills :
4–6 years in Site Reliability Engineering, DevOps, or Infrastructure Engineering rolesExperience with Azure cloud services and resource managementHands-on with Terraform (HCL) for infrastructure provisioningProficient in .NET application support and performance tuningStrong working knowledge of Microsoft SQL Server (MSSQL) or MySQLSolid understanding of monitoring tools, logging frameworks, and observability practicesExperience with CI / CD pipelines (Azure DevOps preferred)Nice-to-Have Skills :
Exposure to Google Cloud Platform (GCP) and multi-cloud architectureExperience working on Windows-based server environmentsFamiliarity with containerization (Docker, Kubernetes)Prior participation in cloud migration or modernization projectsMinimum Qualifications
BS in Computer Science, Information Technology, Business / Management Information Systems or related field or equivalent experience.Typically minimum of 4 years relevant experienceShow more
Show less
Skills Required
Terraform, Mysql, .NET, Monitoring Tools, Microsoft Sql Server, Azure, Azure Devops