Talent.com
This job offer is not available in your country.
Airtel - Lead Site Reliability Engineer - DevOps

Airtel - Lead Site Reliability Engineer - DevOps

AIRTEL INTERNATIONAL LLPGurugram
30+ days ago
Job description

Women Candidates preferred

Position Overview :

SRE- Lead will be responsible for managing a team of engineers focused on software deployments and site reliability engineering practices. The role will involve overseeing the deployment process of software applications and services, implementing automation, monitoring, and alerting tools, and ensuring the reliability, availability, and performance of critical systems and services. The Deployments and SRE Manager will collaborate closely with development, operations, and other stakeholders to drive a culture of DevOps and SRE, aiming to improve system stability, scalability, and Responsibilities :

  • Leadership : Lead and mentor a team of engineers responsible for software deployments and SRE practices. Set clear expectations, provide coaching and feedback, and foster a collaborative and innovative team environment.
  • Deployment Management : Implement and manage the deployment process for software applications and services, including Monthly release management of AADL products, change management, and rollback procedures. Drive continuous improvement in deployment processes and tools to increase efficiency and minimize risk.
  • Site Reliability Engineering : Implement best practices in site reliability engineering, including system monitoring, alerting, capacity planning, performance optimization, and incident management. Collaborate with development teams to ensure application architectures are resilient and scalable, and drive the adoption of DevOps and SRE principles and practices.
  • Automation and Tooling : Evaluate, implement, and maintain relevant automation and tooling to streamline operational tasks, reduce manual effort, and improve system reliability. This may include configuration management, containerization, and orchestration technologies, well versed with Blue Green and Canary Deployment Model.
  • Incident Management : Lead incident management efforts, including incident response, root cause analysis, and post-incident reviews. Collaborate with cross-functional teams to minimize impact and restore services as quickly as possible. Implement preventive measures to avoid future incidents and drive continuous improvement in incident management processes.
  • Monitoring and Alerting : Implement and maintain effective system monitoring and alerting tools to proactively detect and resolve issues. Define and track key performance indicators (KPIs) and service level objectives (SLOs) to measure system reliability, performance, and Collaboration : Collaborate closely with development, operations, security, network and other stakeholders to ensure smooth operations and timely resolution of issues. Foster strong relationships and effective communication channels to promote collaboration and Documentation : Maintain comprehensive documentation of deployment processes, system configurations, procedures, and incident reports. Ensure documentation is up-to-date, accurate, and accessible to relevant :
  • Bachelor's degree in Computer Science, Information Technology, or related field.
  • Minimum of 7 years of experience in software engineering, DevOps, deployments, or site reliability engineering.
  • Strong technical skills in deployment processes and tools, such as release management, change management, and rollback procedures.
  • Proficient in scripting and automation using tools like Python, Bash, or PowerShell.
  • Solid understanding of DevOps principles, Agile methodologies, and ITIL practices.
  • Strong technical skills in CI / CD tools and practices, such as Jenkins, Git, Docker, Kubernetes, and related technologies.
  • Strong leadership skills with experience in managing and mentoring technical teams.
  • Excellent problem-solving, analytical, and communication skills.
  • Ability to work independently, prioritize tasks, and manage time effectively.
  • Experience with incident management tools and processes, such as ITIL Incident Management, and familiarity with ITSM frameworks.
  • In-depth knowledge of relational database management systems (RDBMS) such as Oracle, Microsoft SQL Server, MySQL, or PostgreSQL.
  • Knowledge of cloud computing platforms, preferably AWS is a plus.
  • Relevant certifications, such as AWS Certified DevOps Engineer, Kubernetes Certified Administrator, or Site Reliability Engineering (SRE) certifications, Grafana expertise are desirable.

ref : hirist.tech)

Create a job alert for this search

Site Reliability Engineer • Gurugram

Related jobs
Lead Site Reliability Engineer

Lead Site Reliability Engineer

cvent india pvt ltdINDIA
Cvent is a global meeting, event, travel, and hospitality technology leader, with more than 4000 employees worldwide.As a leading cloud-based technology company, we have over 28,000 customers, in...Show moreLast updated: 30+ days ago
Lead Site Reliability Engineer

Lead Site Reliability Engineer

ZenotiINDIA
Zenoti provides an all-in-one, cloud-based software solution for the beauty and wellness industry.Our solution allows users to seamlessly manage every aspect of the business in a comprehensive mobi...Show moreLast updated: 30+ days ago
Site Reliability Engineer

Site Reliability Engineer

trellixINDIA
Trellix, the trusted CISO ally, is redefining the future of cybersecurity and soulful work.Our comprehensive, GenAI-powered platform helps organizations confronted by todays most advanced threats g...Show moreLast updated: 30+ days ago
  • Promoted
Xebia - Senior / Lead / Principal Site Reliability Engineer

Xebia - Senior / Lead / Principal Site Reliability Engineer

Xebia IT Architects India Pvt LtdGurgaon
Role : Site Reliability Engineer Experience Range : 7 - 12 Years Location : Pune & Chennai, Bangalore , Gurgaon Mode of Work : Hyb...Show moreLast updated: 3 days ago
  • Promoted
Site Reliability Engineer - DevOps

Site Reliability Engineer - DevOps

Vikash TechnologiesGurugram
Key Responsibilities : - Monitor, maintain, and improve system reliability, availability, and performance <...Show moreLast updated: 4 days ago
  • Promoted
Gemini Solutions - DevOps / Site Reliability Engineer

Gemini Solutions - DevOps / Site Reliability Engineer

Gemini SolutionsGurgaon
POSITION SUMMARY : In this role, you will play a crucial part in shaping the firm's infrastructure reliability and efficiency by implementing ...Show moreLast updated: 30+ days ago
Site Reliability Engineer

Site Reliability Engineer

PhonepeINDIA
PhonePe is Indias leading digital payments company with 50 crore (500 Million) registered users and 3.Million) merchants covering over 99 PERCENT of the postal codes across India.On the back of it...Show moreLast updated: 30+ days ago
Site Reliability Engineer

Site Reliability Engineer

NatWest GroupINDIA
Join us as a Site Reliability Engineer.In this key role, youll support the improvement of non-functional and operational characteristics such as availability, performance, efficiency, change manage...Show moreLast updated: 30+ days ago
Site Reliability Engineer

Site Reliability Engineer

Citadel SecuritiesGurugram
Candidates who have less than 3 years of experience should possess : .Good knowledge of UNIX / Linux command line.Good understanding of the usage of TCP / IP and UDP networking in applications.Basic unde...Show moreLast updated: 30+ days ago
Senior Site Reliability Engineer

Senior Site Reliability Engineer

everbridgeINDIA
Everbridge (NASDAQ : EVBG) empowers enterprises and government organizations to anticipate, mitigate, respond to, and recover stronger from critical events. In todays unpredictable world, resilient o...Show moreLast updated: 30+ days ago
  • Promoted
Airtel - Lead Site Reliability Engineer - DevOps

Airtel - Lead Site Reliability Engineer - DevOps

AIRTEL INTERNATIONAL LLPGurgaon
Women Candidates preferred Position Overview : SRE- Lead will be responsible for managing a team of e...Show moreLast updated: 30+ days ago
Lead Site Reliability Engineer

Lead Site Reliability Engineer

UnitedHealth GroupGurgaon, Haryana, IN
Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives.The work you do with our team will directly improve health outcomes by connect...Show moreLast updated: 5 days ago
Site Reliability Engineer

Site Reliability Engineer

tcg digital solutions pvt ltdINDIA
Bachelors or masters degree in Computer Science, Engineering, or related field.Essential Skills (Two top skills).AWS Ecosystem EKS, EC2, DynamoDB, Lambda, etc. The SRE team should include some memb...Show moreLast updated: 30+ days ago
Senior Site Reliability Engineer

Senior Site Reliability Engineer

autodesk india pvt ltdINDIA
Do you want the opportunity to be part of a startup environment working on a new product seeking to become a world-leading integration platform? Are you looking to be at the forefront of innovative...Show moreLast updated: 30+ days ago
Site Reliability Engineer

Site Reliability Engineer

Newnovation SolutionsGurugram, Haryana, India
We are seeking a proactive and technically strong Site Reliability Engineer (SRE) to ensure the stability performance and scalability of our Data Engineering Platform. You will work on cutting-edge ...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer - Docker

Site Reliability Engineer - Docker

questhiringGurgaon
Responsibilities : - You will architect and build for high performance and scale.You will work on continuously improving...Show moreLast updated: 11 days ago
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Nvidia Graphics Pvt LtdINDIA
NVIDIA has been redefining computer graphics, PC gaming, and accelerated computing for more than 25 years.Its a unique legacy of innovation thats motivated by outstanding technology and amazing peo...Show moreLast updated: 30+ days ago
  • Promoted
Lead Site Reliability Engineer

Lead Site Reliability Engineer

CventGurugram, Haryana, India
Site Reliability is about combining development and operations knowledge and skills to help make the organization better. Whether you have a development background and are interested in learning mor...Show moreLast updated: 30+ days ago
Site Reliability Engineer

Site Reliability Engineer

Qure.aiINDIA
AI is one of the fastest-growing startups in India, which develops Artificial intelligence-enabled products and platforms for healthcare diagnostics. We create cutting-edge solutions that positively...Show moreLast updated: 30+ days ago
  • Promoted
Airtel - Lead - Land Acquisition

Airtel - Lead - Land Acquisition

Bharti AirtelGurugram, India
DGM - Land are seeking a highly motivated and experienced professional (10 - 12 years) to join our team in managing captive land acquisitions for Data Centers through private parties as well as St...Show moreLast updated: 30+ days ago