Talent.com
This job offer is not available in your country.
DET-TT-Resilience and Reliability Engineer

DET-TT-Resilience and Reliability Engineer

EYCoimbatore, Tamil Nadu, India
10 hours ago
Job description

Senior Reliability Engineer :

Senior Reliability Engineer (Senior level)

Description

  • Reliability Engineering (SRE) is a modern way of delivering IT Solutions by imbibing Software engineering principles in Service Delivery to reduce IT Risk to business, improve business resilience, attain predictability & reliability, optimize cost of IT Infra and Ops
  • A Reliability Engineer typically has deep software engineering experience encompassing design, build, deploy and manage / maintain an IT solution ensuring resilience, reliability, and performance.
  • A Reliability Engineer is a bridge between development and operations by applying a software engineering mindset to the development, deployment, and maintenance of applications to maximize system reliability & automation, while improving efficiencies by optimizing resources

Responsibilities

  • Defining SLA / SLO / SLI for a product / service
  • Engineering in resilient design and implementation practices into solutions as they go through the product life cycle
  • Engineering out manual effort (Toil) through the development of automated processes and services (, Automated Management of Systems, CI / CD improvements)
  • Developing Observability Solutions to track, report, and measure SLA adherence
  • Help Optimize Cost of IT Infra & Operations - FinOps
  • Critical Situation management
  • SOP / Runbook automation, Toil reduction
  • Data Analytics & System trend analysis
  • Typical Skills and Background

  • 7+ years of experience in software product engineering principles, processes and systems
  • Hands-on experience in Java / J2EE, one of web server (Apache Tomcat or IBM HTTP Server), one of the application servers (Tomcat / WebSphere), and any major RDBMS like Oracle
  • Hands-on experience in at least one CI-CD (Azure DevOps, GitLab CI / CD, Jenkins) and IaC tools (Terraform, AWS CloudFormation, Ansible etc.)
  • Experience in at least one cloud technology (AWS / Azure / GCP etc. and Docker, Pivotal, Kubernetes, OpenShift etc.) and its reliability tools (Azure AppInsight, CloudWatch, Azure Monitor etc.)
  • Experience in Linux (RHEL) operating system performance monitoring parameters and their interpretation, commands used for monitoring
  • Experience in Observability - APM tools (Dynatrace, AppDynamics etc.), metrics / log consolidation (Splunk) and ELK Stack
  • Defining NFRs and SLA / SLO / SLI agreement for a product / platform / services
  • Knowledge on queuing models used, thread pools, request servicing processes etc.
  • Knowledge in Web Services, SOA, ESB (DataPower), RESTFul
  • Knowledge of application design patterns, J2EE application architectures, Microservices, Spring boot & Cloud native architectures
  • Proficiency in Java runtimes, Core Java, Garbage collection, JVM parameters tuning
  • Experience in performance tuning on Application Servers (Tomcat / WAS)
  • Experience in trouble shooting Performance / Scalability / Availability issues
  • Experience in Thread dump, heap dump generation & analysis
  • Knowledge on Query tuning and database designs & models
  • Knowledge at least one automation scripting language like Python
  • Mastery in collaborative software development using Git, Jira, Confluence etc.
  • AI / ML & Data Analytics knowledge and experience is a desirable
  • Create a job alert for this search

    Reliability Engineer • Coimbatore, Tamil Nadu, India

    Related jobs
    • Promoted
    LPG engineer Kigali Rwanda

    LPG engineer Kigali Rwanda

    Noah Gas LtdPalakkad, IN
    Job Opportunity : LPG Technician – Filling Station Supervisor & Stock Manager.Employment Type : Full-time | Permanent Position. International applicants welcome (Work visa support provided).We are a f...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    IN-Sr Technical Consultant - Backup and Disaster recovery, Monthly maintenance, Patch Management

    IN-Sr Technical Consultant - Backup and Disaster recovery, Monthly maintenance, Patch Management

    Blue YonderCoimbatore, Tamil Nadu, India
    We are seeking an experienced Backup Disaster Recovery and Monthly maintenance and Patch management Engineer to manage, optimize, and safeguard our organization’s data backup and recovery infrastru...Show moreLast updated: 10 hours ago
    • Promoted
    • New!
    Senior MLOps Engineer

    Senior MLOps Engineer

    Versatile PeopleGobichettipalayam, Tamil Nadu, India
    Full-time, 4-5 hours (PST overlap).A leading company specializing in delivering cutting-edge AI and machine learning solutions is seeking a seasoned ML Platform Engineer. This role involves designin...Show moreLast updated: 10 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    XebiaPalakkad, IN
    AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 29 days ago
    • Promoted
    Health Safety Environment Engineer

    Health Safety Environment Engineer

    Target Engineering Construction Co LLCPalakkad, IN
    We are seeking a proactive and experienced.The ideal candidate will have a strong background in health, safety, and environmental management, with a passion for promoting a safety-first culture acr...Show moreLast updated: 28 days ago
    • Promoted
    Deployment Engineer

    Deployment Engineer

    AvocaCoimbatore, IN
    Build, launch & optimize AI agents that power the next generation of home-service customer experiences.Avoca is the all-in-one AI lead-conversion platform. Our technology boosts booking rates, slash...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    mindcurvCoimbatore, Tamil Nadu, India
    About Mindcurv We help our customers rethink their digital business, experiences, and technology to navigate the new digital reality. We do this by designing sustainable and accountable solutions fo...Show moreLast updated: 10 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ExasoftTiruppur, IN
    Responsibilities and Requirements : .Experience must be at least 10+ years in SRE.Multi Cloud, Hybrid Cloud – on Data center sites. Experience with multiple operating systems (.Operating Systems, Kern...Show moreLast updated: 3 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ConcordPalakkad, IN
    Engineers (Individual Contributors).Strong SRE (Site Reliability Engineering).CI / CD, monitoring, automation, infrastructure as code, etc.Show moreLast updated: 21 days ago
    • Promoted
    Senior MLOps Engineer

    Senior MLOps Engineer

    Mitchell Martin Inc.Tiruppur, IN
    Include, but are not limited to, the following : .Own productionizing models—from tracked experiments to governed releases—ensuring resilient services with clear SLOs, runbooks, and fast, safe rollba...Show moreLast updated: 23 days ago
    • Promoted
    L3 O365 Engineer

    L3 O365 Engineer

    Nextbridge IT SolutionsTiruppur, IN
    We are seeking a highly skilled .This senior role is a critical escalation point for complex issues, driving the resolution of major incidents and ensuring the seamless operation, security, and pro...Show moreLast updated: 11 days ago
    • Promoted
    L4 UC Engineer

    L4 UC Engineer

    Servion Global SolutionsCoimbatore, IN
    UC Architecture & Design : Deep understanding of Unified Communications Products like CUCM, CUC, IM & Presence, and Expressways. Deep knowledge of designing and troubleshooting clusters, inter-cluste...Show moreLast updated: 21 days ago
    • Promoted
    Resource Deployment Manager

    Resource Deployment Manager

    PTR GlobalTiruppur, IN
    Pinnacle Group is a nationally recognized leader in workforce solutions, known for delivering high-impact staffing, talent management, and contingent workforce programs. We support some of the most ...Show moreLast updated: 11 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    BayOne Solutionscoimbatore, tamil nadu, in
    Role : Site Reliability Engineer.The CXE Site Reliability Engineering (SRE) team manages the CI / CD pipelines and cloud infrastructure, ensuring seamless deployment, monitoring, and maintenance.Howev...Show moreLast updated: 3 days ago
    • Promoted
    Lead Sustenance Engineer - Storage

    Lead Sustenance Engineer - Storage

    DDNPalakkad, IN
    This is an incredible opportunity to be part of a company that has been at the forefront of AI and high-performance data storage innovation for over two decades. DataDirect Networks (DDN) is a globa...Show moreLast updated: 11 days ago
    • Promoted
    DevOps / Platform Engineer

    DevOps / Platform Engineer

    iVedha Inc.Palakkad, IN
    Hiring a seasoned DevOps / Platform Engineer to drive automation, platform reliability, and robust.Design, deploy, and manage CI / CD pipelines and infrastructure automation, leveraging AI for.Implemen...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    UplersPalakkad, IN
    Uplers is hiring for one of the clients.SRE (Oracle Cloud Infrastructure).Remote | Mon–Fri | 10 : 30 AM – 7 : 30 PM IST.Use of personal device required. OCI cloud infrastructure using Terraform and GitL...Show moreLast updated: 27 days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.Coimbatore, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago