Immediate Start! Principal Site Reliability Engineer
Rakuten IndiaIndia
9 hours ago
Job description
Responsibilities :
Design, develop SLA, SLO, SLI of services within the Business Unit.
Involve in whole process of Development, Production System Operation including system maintenance, monitoring, automation, backend operation, ensuring high availability, regular application release, troubleshooting, middleware performance tuning and collaborating with functional, technical team members to provide high quality services.
Involve in automation of routine manual production / non-production operation using technologies like Ansible, Chief etc. Will be the key person to propose, implement automation to increase productivity, quality.
Always improve the system performance, reliability
Should have service ownership mind & proactively able to react to the production issues.
Propose new technologies, tools etc. to improve the whole process of development, testing and production operations. Strong self-learning ability, motivation to work on new Technologies.
Work closely with developers, product manager, project manager, team lead, security, and QA team members in different location (Singapore, Japan, India etc.)
Exp : 8 Years - 14 Years
Qualifications :
Must-have
Over 8 years of experience on SRE, handling high traffic production system independently, troubleshooting (middleware, infra), automation, regular operation etc.
Implement Site Reliability Engineering principles regarding performance, reliability, monitoring, alerting in Production environment
Experience in management of large-scale service.
Experience in design and construction of public cloud (Ex. GCP, Azure), preferably GCP.
Good knowledge in CI / CD / CT pipeline using tools such as Jenkins / Bamboo and VCS such as GIT / SVN
Strong knowledge in LINUX based system operation and extensive skills in Linux commands.
Hands-on experience in Unix / Linux / Shell / Python scripting
Experience in developing and operating one or more of following systems : Kubernetes, Nginx, ELK stack, Hadoop, etc.
Identify process gaps and recommend on best practices based on industry standards.
Provide technical expertise on complex automation and functional issues.
Flexible emergency support timing based on the business requirement. Must adapt to business needs in terms of working hours.
Big Data technologies such as Hadoop, NoSQL - Couchbase, Cassandra
Create a job alert for this search
Site Reliability Engineer • India
Related jobs
Promoted
Senior Site Reliability Engineer- ELK Expert
iVedha Inc.Nagpur, IN
Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone.
Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
Promoted
Senior Site Reliability Engineer
WSO2nagpur, maharashtra, in
Founded in 2005, WSO2 is the largest independent software vendor providing open-source API management, integration, and identity and access management (IAM) to thousands of enterprises in over 90 c...Show moreLast updated: 25 days ago
Promoted
Site Reliability Engineer [3 Days Left]
SynechronIndia
We have immediate opportunity for Senior Site Reliability Engineer.Job Role : Senior Site Reliability Engineer.Job Location : Synechron ( Bengaluru).
At Synechron, we believe in the power of digital t...Show moreLast updated: 30+ days ago
Promoted
New!
Apply in 3 Minutes! Site Reliability Engineer
WhiteLotus Talent PartnersIndia
We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure power...Show moreLast updated: 1 hour ago
Promoted
Site Reliability Engineer
Sonata SoftwareNagpur, IN
We're Hiring : Senior Site Reliability Engineer.Onsite (Office : Hyderabad – Mandatory from Day 1).Senior Site Reliability Engineer (SRE).
This is a high-impact role where you’ll design scalable archi...Show moreLast updated: 3 days ago
Promoted
AWS Site Reliability Engineer
HTC Global ServicesIndia
HTC – A brief profile Established in 1990, HTC Inc.Troy, Michigan, is a leading global Information Technology solution and BPO provider.
HTC assists clients across multiple industry verticals, offer...Show moreLast updated: 13 days ago
Promoted
Principal Site Reliability Engineer
Rakuten IndiaIndia
Design, develop SLA, SLO, SLI of services within the Business Unit.Involve in whole process of Development, Production System Operation including system maintenance, monitoring, automation, backend...Show moreLast updated: 30+ days ago
Promoted
New!
▷ Immediate Start : Sr. Site Reliability Engineer [T500-20179]
Delta Air LinesIndia
Delta Air Lines (NYSE : DAL) is the U.Powered by our employees around the world, Delta has for a decade led the airline industry in operational excellence while maintaining our reputation for award-...Show moreLast updated: 1 hour ago
Promoted
New!
▷ (3 Days Left) Site Reliability Engineer
TalentiserIndia
Reliability, Automation, and Observability As a hybrid Site Reliability Engineer / DevOps Engineer, you'll be a key driver in ensuring the stability, performance, and scalability of our mission-criti...Show moreLast updated: 1 hour ago
Promoted
Software Engineer, Site Reliability Engineering (Ecoh Core)
EcohNagpur, IN
Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.Strong problem-solving and analytical skills.
Ability to debug, optimize code, and automate routine tasks.E...Show moreLast updated: 3 days ago
Promoted
Site Reliability Engineer
ACL DigitalIndia
Service Management : Maintain application uptime / performance, manage system enhancements and defects, oversee daily operational activities, and ensure continuous improvement and adherence to ITIL be...Show moreLast updated: 30+ days ago
Promoted
Site Reliability Engineer
Batch Systems IncIndia
Batch is a brand-first technology platform designed to amplify customer engagement, enable frictionless transactions, defend product authenticity, elevate customer loyalty, and ignite customer grow...Show moreLast updated: 1 day ago
Promoted
Site Reliability Engineer
o9 Solutions, Inc.nagpur, maharashtra, in
Be part of something revolutionary.At o9 Solutions, our mission is clear : be the Most Valuable Platform (MVP) for enterprises.
With our AI-driven platform — the o9 Digital Brain — we integrate globa...Show moreLast updated: 2 days ago
Promoted
Site Reliability Engineer
CodeKarmanagpur, maharashtra, in
Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 2 days ago
Promoted
New!
[Apply in 3 Minutes] Senior Site Reliability Engineer
ViewSonicIndia
At ViewSonic Technologies, we’re passionate about building software that solves problems.We count on our site reliability engineers (SREs) to empower users with a rich feature set, high availabilit...Show moreLast updated: 1 hour ago
Site Reliability Engineer- Platform Engineering
Weekday AIIN
Remote
Quick Apply
This role is for one of Weekday’s clients.We are looking for an experienced and motivated.Site Reliability Engineer (SRE) – Platform Engineering.
In this role, you will be responsible for designing,...Show moreLast updated: 18 days ago
Promoted
New!
▷ (15h Left) Site Reliability Engineer
Amicon Hub ServicesIndia
Manage and scale production systems hosted on Google Cloud Platform (GCP) - Implement SRE best practices : monitoring, alerting, SLAs, SLOs, and error budgets - Automate operational tasks using Infr...Show moreLast updated: 1 hour ago
Promoted
Site Reliability Engineer
Amicon Hub Servicesnagpur, maharashtra, in
Manage and scale production systems hosted on.Automate operational tasks using.Improve system reliability and reduce manual interventions through automation.
Collaborate with development teams to en...Show moreLast updated: 23 days ago
Promoted
New!
▷ Immediate Start : Site Reliability Engineer
TechVeritoIndia
As a SRE Engineer, you will have a strong background in cloud infrastructure management, migration and deployment, with expertise in Google Cloud Platform (GCP), DevOps tools, and Kubernetes ecosys...Show moreLast updated: 1 hour ago
Promoted
New!
▷ Only 24h Left : Principal Engineer, Site Reliability [T500-20295]
TMUS Global SolutionsIndia
NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 1 hour ago