Job Role : SRE Lead
Exp : 5+ years
Location : Kochi
Job Description
A Site Reliability Engineer is a professional who acts as a warrior to monitor, protect customer applications, taking charge on operational tasks to ensure the efficient functioning of a system.
They are responsible for monitoring, automating, and improving the reliability, performance, and availability of any applications.
Mandatory to have working experience as SRE Lead or Techno function role as Site Reliability Engineer (SRE) at customer work location in e-com domain.
- Be an organization face at customer site collaborating with organization leadership.
- Must have knowledge of Production Application Support.
- Working experience in interacting with Team / Onsite / customers who provide 24x7 coverage, help & guide during India night coverage.
- Should know how to gather SRE requirement from Tech and non-tech aspect from customer.
- Must have excellent knowledge on ensuring reliability and scalability of applications
- Should have excellent automation skills to automate repetitive tasks, reduce false alarms using python and or any other languages.
- Working experience on how to gather requirements on health of applications, services to monitor, setting service levels.
- Must have Level 1, Level 2 and Level 3 support experience in eCommerce platforms.
- Hands on experience in Monitoring, Logging, Alerting, Dashboarding, and report generation in any monitoring tools such as AppDynamics / Splunk / Dynatrace / Datadog / CloudWatch / ELK / Prome / New Relic).
- Must have knowledge in ITIL framework specifically on Alerts, Incident, change management, CAB, Production deployments, Risk and mitigation plan, SLA, SLI
- Should be able to lead P1 calls, brief about the P1 to customer, proactive in gathering leads / customers into the P1 calls till RCA. Experience working with postman.
- Should have knowledge on building and executing SOP, runbooks, handling any ITSM platforms (JIRA / ServiceNow / BMC Remedy)
- Must know how to work with the Dev team, cross functional teams across time zones.
- Should be able to generate WSR / MSR by extracting the tickets from ITSM platforms.
If interested, please share your resume with vandana.tripathi@amplesuccesshr.com. Also, don't forget to follow our company page "@Ample Success Hr Solutions" for regular updates on job openings.