This job offer is not available in your country.

Angel One - Site Reliability Engineer - Monitoring Tools

ANGEL ONE LIMITEDBangalore

30+ days ago

Job description

Job Title : SRE2

Location : Bengaluru, Karnataka

What you will do :

Design, write and build tools to improve the reliability, latency, availability and scalability.
Engender reliability and availability starting with metrics and measurements
Enable scaling by providing tools, developing training and / or augmenting processes
Build tools / automate to prevent re-occurrence of problems in mission critical products / services.
Engages with the development team throughout the life cycle to help develop software for reliability and scale, ensuring minimal refactoring or changes.
Dynamically manage workload of the SRE team, drive and deliver on multiple priorities simultaneously
Provide thought leadership in architecture, design, product features and provide feedback on products built on a variety of platforms
Design, code, test, and deliver software to automate manual operational work
Troubleshoot priority incidents, facilitate blameless post-mortems and ensure permanent closure of incidents
Engage with development team throughout the life cycle to help develop software for reliability and scale, ensuring minimal refactoring or changes
Identify application patterns and analytics in support of better service level objectives
Design self-healing and resiliency patterns
Design automated software and product upgrades, change management, and release management solutions
Coach or manage teams as applicable
Participate in the 24x7 support coverage as needed
Should be self-motivated and willing to work under minimum surveillance

Who you are :

Bachelor's degree or equivalent experience in an software engineering discipline

5 to 7 years of experience.

Experience in Software development in one or more of the following programming language is must : Python / go,

Expertise in at least one technology stack designing, coding, testing, and delivering software

Experience in Distributed computing.

Strong experience in designing and building highly available high-volume messaging infrastructure with Apache Kafka on AWS and On-prem (e.g. stretch cluster, active / active or active / passive) using Mirror Maker or other replication tools.

Good experience with Schema Registry, Kafka connectors (source and sink) and KSQL, have worked with Kafka brokers, Zookeeper, Topics, connectors for Setup and administration.

Strong experience in Enterprise Redis, cluster setup, administration, reliability and observability.

Strong experience in setting up monitoring and management with tools.

Working knowledge of monitoring, management tools and data growth management.

Devops Tools experience in Jenkins / Ansible / Git workflows / CICD

Proficiency in one or more technology domains, may be a cross-domain expert able to solve complex and mission critical problems within a business or across the firm

Working knowledge of infrastructure components (e.g. routers, load balancers, cloud products, container systems, compute, storage, and networks)

Excellent debugging and troubleshooting skills.

Experience with infrastructure provisioning tools like Terraform or Ansible.

Hands-on experience deploying and operating applications using IaaS and PaaS Amazon AWS.

(ref : hirist.tech)

Create a job alert for this search

Site Reliability Engineer • Bangalore

Related jobs

Promoted

Site Reliability Engineer - Observability Services

TeamWare SolutionsBangalore

Role Summary : We are seeking a highly skilled Site Reliability Engineer (SRE) with a strong focus on observability.The ideal candidate will have 5-8 years of experie...Show moreLast updated: 30+ days ago

Promoted

Site Reliability Engineer

Vbeyond corporationBangalore

SRE (Site Reliability Engineer 2) We are looking for engineers who are passionate about reliability, performance, and efficiency, and with experience in building tool...Show moreLast updated: 30+ days ago

Promoted

Site Reliability Engineer

ConcordBengaluru, IN

Engineers (Individual Contributors).Strong SRE (Site Reliability Engineering).CI / CD, monitoring, automation, infrastructure as code, etc.Show moreLast updated: 17 days ago

Promoted

Site Reliability Engineer

ViewSonicBengaluru, Karnataka, India

Bachelor's degree in Computer Science, Engineering, or a related field.Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory. Basic understanding of AWS solutions in...Show moreLast updated: 16 days ago

Promoted

Senior Site Reliability Engineer- ELK Expert

iVedha Inc.hosur, tamil nadu, in

Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago

Promoted

Monitoring Engineer - Site Reliability

Insight Global, LLCBangalore

Job Title : LLM System Monitor - Site Reliability Engineer (SRE).Location : Bangalore, India (Hybrid - Onsite 3 Days / Week). Type : Full-Time (Insight Global at Cisco).Required Skills & Experience : ...Show moreLast updated: 9 days ago

Promoted

Site Reliability Engineer

Core Minds Tech SOlutionsHosur

Job Description : - Engage with our product teams to understand requirements, design, and implement resilient and scalable infrastructure solutions&l...Show moreLast updated: 30+ days ago

Promoted

Siemens - L1 Site Reliability Engineer - Monitoring Tools

Siemens LimitedBangalore

Role : SRE Exp : 4 to 6 years Location : Bangalore We know that the only way a...Show moreLast updated: 10 days ago

Promoted

System Engineer

Netsmore Technologieshosur, tamil nadu, in

Systems Engineer – Level 3 (Internal).Mandatory skills : AWS cloud infrastructure + OKTA administration.The L3 Systems Engineer role is more engineering-focused than traditional system admin roles.I...Show moreLast updated: 4 days ago

Promoted

Site Reliability Engineer

XebiaBengaluru, IN

AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 26 days ago

Promoted

Site Reliability Engineer

TavantBengaluru, Karnataka, India

With 25+ years of experience building innovative digital products and solutions, Tavant provides impactful results to its customers. It has been the frontrunner in driving digital innovation and tec...Show moreLast updated: 25 days ago

Promoted

Senior Site Reliability Engineer

WSO2Bengaluru, Karnataka, India

Founded in 2005, WSO2 is the largest independent software vendor providing open-source API management, integration, and identity and access management (IAM) to thousands of enterprises in over 90 c...Show moreLast updated: 30+ days ago

Promoted

Observability - Engineer Site Reliability [T500-20244]

Albertsons Companies IndiaBengaluru, Karnataka, India

About Albertsons Companies Inc.As a leading food and drug retailer in the United States, Albertsons Companies, Inc.Our well-known banners across the United States, including Albertsons, Safeway, Vo...Show moreLast updated: 7 days ago

Promoted

Principal / Chief Site Reliability Engineer - Observability Services

CollaberaBangalore

Job Description : As a Principal / Chief Site Reliability Engineer, you will play a critical role in designing, developing, and maintaining scalable and highly reliabl...Show moreLast updated: 30+ days ago

Promoted

Site Reliability Engineer

WhiteLotus Talent PartnersBengaluru, Karnataka, India

L0 and L1 Site Reliability Engineer (SRE) Support.Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by. In this role, you will focu...Show moreLast updated: 30+ days ago

Promoted

Site Reliability Engineer

Uplershosur, tamil nadu, in

Uplers is hiring for one of the clients.SRE (Oracle Cloud Infrastructure).Remote | Mon–Fri | 10 : 30 AM – 7 : 30 PM IST.Use of personal device required. OCI cloud infrastructure using Terraform and GitL...Show moreLast updated: 23 days ago

Promoted

Site Reliability Engineer

Amicon Hub ServicesBengaluru, Karnataka, India

Manage and scale production systems hosted on.Automate operational tasks using.Improve system reliability and reduce manual interventions through automation. Collaborate with development teams to en...Show moreLast updated: 6 days ago

Promoted

Site Reliability Engineer - Chaos Management

Xebiahosur, tamil nadu, in