Maintain and monitor the availability of cloud infrastructure, troubleshoot, identify, and resolve production-level infrastructure issues.
Using Infrastructure as a Code (IAAC) tools, develop and maintain automation tools for provisioning, configuration management, and deployment.
Establish and maintain monitoring and alerting systems for the detection and response to incidents.
Demonstrate strong customer focus.
Should have the ability to collaborate with internal teams and customers during incidents, explaining the issue, recommending immediate mitigations, and providing long-term solutions.
Investigate customer escalations and work closely with the engineering, support, and sales teams to implement a solution.
Perform a postmortem analysis of system failures and implement corrective measures as necessary.
Participate in the rotational on-call schedule based on the need to be available in an emergency.
A demonstrated track record of optimising cloud infrastructure costs. Monitor and control the use of cloud resources, implement cost-saving measures, and provide recommendations for optimising cloud costs.
Experience implementing security best practices and compliance measures in production environments.
Experience with security audits, vulnerability assessments, and the implementation of security controls to protect sensitive data and ensure regulatory Profile :
3+ years experience with a focus on cloud infrastructure automation, configuration management, and deployment automation. Significant portion of AWS is used for mid to large size deployments.
Experience designing, architecting, and running large scale cloud infrastructure.
Experience working with reverse proxy, webserver, load balancing and CDN services.
Familiarity with security best practices and compliance frameworks such as PCI DSS
Strong interpersonal and communication skills (including oral, written, and listening skills)
Experience with stress testing and tuning production systems using tools such as K6, Locust
Experience in using AWS Cost Explorer, AWS Budgets, and AWS Cost and Usage Reports and optimising costs to ensure efficient resource skills :
Experience with AWS in designing, deploying, and managing cloud infrastructure.
Experience with scripting languages such as Python and Bash
Experience managing reverse proxies / web servers on a large-scale production level.
Experience with infrastructure as a code tool such as Experience working with Kafka, Elasticsearch, and RabbitMQ
Experience with observation tools such as Prometheus, Grafana, and Loki
(ref : hirist.tech)
Create a job alert for this search
Site Reliability Engineer • Bangalore
Related jobs
Promoted
Site Reliability Engineer Engineer - DevOps
Zealant Consulting GroupBangalore
Job Summary : We are seeking a seasoned Site Reliability Engineer (SRE) Engineer to join our growing team.This is a crit...Show moreLast updated: 30+ days ago
Promoted
New!
Site Reliability Engineer
BayOne Solutionshosur, tamil nadu, in
Role : Site Reliability Engineer.The CXE Site Reliability Engineering (SRE) team manages the CI / CD pipelines and cloud infrastructure, ensuring seamless deployment, monitoring, and maintenance.Howev...Show moreLast updated: 8 hours ago
Promoted
Cloud Engineer
ValueMomentumhosur, tamil nadu, in
We are seeking a highly skilled.You will work closely with development, operations, and security teams to ensure continuous delivery, high availability, and optimal performance of our applications....Show moreLast updated: 7 days ago
Promoted
Site Reliability Engineer - Cloud Platforms
LanceSoft, IncBangalore
Role and Responsibilities : Reporting to Engineering, the Site Reliability Engineer will play a critical role in driving innovation and growth for the Banking Soluti...Show moreLast updated: 19 days ago
Promoted
Site Reliability Engineer
ViewSonicBengaluru, Karnataka, India
Bachelor's degree in Computer Science, Engineering, or a related field.Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory.
Basic understanding of AWS solutions in...Show moreLast updated: 17 days ago
Promoted
Site Reliability Engineer - Cloud Services
HyreSnapBangalore
Responsibilities : - Utilize a wide variety of open source technologies and tools, with a strong emphasis on cloud services.Implement best practice...Show moreLast updated: 19 days ago
Promoted
Site Reliability Engineer - Chaos Management
XebiaBengaluru, Karnataka, India
AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 8 days ago
Promoted
Site Reliability Engineer
Amicon Hub Servicesbangalore, karnataka, in
Manage and scale production systems hosted on.Automate operational tasks using.Improve system reliability and reduce manual interventions through automation.
Collaborate with development teams to en...Show moreLast updated: 6 days ago
Promoted
athenahealth - Site Reliability Engineer - Cloud Infrastructure
athenaHealth Technology Private Limited.Bangalore
Join us as we work to create a thriving ecosystem that delivers accessible, high-quality, and sustainable healthcare for all.
Our modern, open ecosystem connects care teams and delivers actionable i...Show moreLast updated: 19 days ago
Promoted
Site Reliability Engineer - Cloud Operations
Creencia Technologies Pvt LtdBangalore
We are recruiting an experienced Site Reliability Engineer to join our newly established TechOps division within the Technology department.
We maintain the systems that keep our products running smo...Show moreLast updated: 27 days ago
Promoted
Site Reliability Engineer
XebiaBengaluru, Karnataka, India
AWS DevOps Engineer with strong expertise in Observability and Site Reliability Engineering (SRE).The role requires hands-on experience with AWS services, Infrastructure as Code (IaC), CI / CD, monit...Show moreLast updated: 30+ days ago
Promoted
New!
Site Reliability Engineer
ExasoftBangalore, IN
Responsibilities and Requirements : .Experience must be at least 10+ years in SRE.Multi Cloud, Hybrid Cloud – on Data center sites.
Experience with multiple operating systems (.Operating Systems, Kern...Show moreLast updated: 11 hours ago
Promoted
Site Reliability Engineer
WhiteLotus Talent PartnersBengaluru, Karnataka, India
L0 and L1 Site Reliability Engineer (SRE) Support.Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by.
In this role, you will focu...Show moreLast updated: 30+ days ago
Promoted
Site Reliability Engineer
Uplershosur, tamil nadu, in
Uplers is hiring for one of the clients.SRE (Oracle Cloud Infrastructure).Remote | Mon–Fri | 10 : 30 AM – 7 : 30 PM IST.Use of personal device required.
OCI cloud infrastructure using Terraform and GitL...Show moreLast updated: 24 days ago
Promoted
Site Reliability Engineer - Cloud Infrastructure
ENTER RecruitmentBangalore
We are looking for a dedicated Site Reliability Engineer (SRE) - Cloud Ops to join our team.In this role, you will play a key part in ensuring the stability and scalability of our cloud infrastruct...Show moreLast updated: 30+ days ago
Key Responsibilities : - Design, deploy, and manage scalable, secure, and highly available infrastructure solutions on Microsoft Azure.
Automate infrastructure provisioning, con...Show moreLast updated: 30+ days ago
Promoted
Site Reliability Engineer
Central Business Solutions Inc.Bangalore Urban, Karnataka, India
Linux SRE [Linux SRE L3 with Infra + Operation Support].The Server Operations team is part of the Enterprise Computing organization within Client.
The wider team has presence in cities globally and ...Show moreLast updated: 5 days ago
Promoted
Cloud Engineer Lead (AWS)
Datapel Systemshosur, tamil nadu, in
The Senior Cloud Engineer (AWS) will be responsible for developing, maintaining, optimising and supporting the cloud infrastructure that supports Datapel’s Warehouse Management System (WMS) and rel...Show moreLast updated: 18 days ago