Talent.com
Systems Reliability Specialist

Systems Reliability Specialist

HRhelpdeskIndore, Republic Of India, IN
5 days ago
Job description

About the company : Company is a rapidly growing, private equity backed SaaS product company and provides cloud-based solutions.

Job Summary : As a Site Reliability Engineer (SRE), you will be responsible for building and maintaining the infrastructure, tools, and pipelines that keep our systems running smoothly. You will collaborate closely with DevOps, engineering, and product teams to design and deploy reliable, scalable, and automated systems. You will also improve the application code for user-facing bugs, ensuring enhanced performance and resilience.

RESPONSIBILITIES : Comfortable with work shift aligned with U.S. time zone (7 pm to 3 am IST)

1. CI / CD Pipeline Management :

  • Design, implement, and maintain robust CI / CD pipelines for automated software deployment.
  • Collaborate with DevOps and engineering teams to integrate testing, monitoring, and security checks into pipelines.
  • Continuously improve deployment processes to ensure smooth and error-free production releases.

2. Monitoring and Observability :

  • Create and manage comprehensive logging dashboards in Datadog to monitor system health, performance, and logs.
  • Set up alerting mechanisms to proactively identify and respond to system issues.
  • Analyze and visualize key performance metrics to drive improvements.
  • 3. Collaborate on Architectural Solutions :

  • Work closely with DevOps and engineering teams to design scalable, resilient, and secure infrastructure.
  • Ensure solutions adhere to best practices for performance, security, and maintainability.
  • 4. Code Optimization and Bug Fixing :

  • Improve application code to resolve user-facing bugs and enhance system resilience.
  • Troubleshoot and fix issues that impact the performance or availability of production systems.
  • Contribute to the continuous improvement of the codebase, focusing on optimizing performance and reliability.
  • 5. Automation and Continuous Improvement :

  • Automate repetitive tasks related to infrastructure management, monitoring, and troubleshooting.
  • Identify and propose innovative solutions to improve system efficiency and performance. 6. Custom Node.Js CLI Tool Development :
  • Develop and automate custom Node.Js CLI tools to enhance operational workflows and streamline repetitive tasks.
  • Implement automated solutions to optimize system processe
  • Requirements

    MUST HAVES :

  • Experience Level : 6-8 years
  • Comfortable with work shift aligned with U.S. time zone (7 pm to 3 am IST)
  • Prior experience working in cross-functional teams
  • Systems architecture and design skills
  • Proficiency in scripting languages such as Bash, Python, or PowerShell.
  • Experience with CI / CD tools such as Github Actions or similar platforms.
  • Build and deployment automation experience especially in a containerized world
  • Proficiency with common ops tools (ECS, Logstash, Datadog + Kibana, EKS etc)
  • Experience with AWS or Azure
  • Comfort maintaining live production systems
  • Strong communication and collaboration skills, with the ability to work effectively in a fast-paced team environment.
  • Create a job alert for this search

    System Specialist • Indore, Republic Of India, IN

    Related jobs
    • Promoted
    Systems Reliability Specialist

    Systems Reliability Specialist

    Grid DynamicsRepublic Of India, IN
    Location-Bangalore / Chennai / Hyderabad.Core Skills (Some combination of : ).These might include (Tomcat, Apache, Springboot, SQS, JBoss, IBM MQ, IBM DataPower, Hazelcast, Flink, Connect Direct, SSL).Un...Show moreLast updated: 13 days ago
    • Promoted
    Site Reliability Specialist

    Site Reliability Specialist

    Mindstix Software LabsPune, Republic Of India, IN
    Mindstix accelerates digital transformation for the world's leading brands.We are a team of passionate innovators specialized in. Cloud Engineering, Enterprise Mobility, Digital Experiences, and Dat...Show moreLast updated: 30+ days ago
    • Promoted
    System Reliability Expert

    System Reliability Expert

    SynechronPune, Republic Of India, IN
    We have immediate opportunity for.SRE (Senior Site Reliability Engineer) 5-8 years.SRE (Senior Site Reliability Engineer). We began life in 2001 as a small, self-funded team of technology specialist...Show moreLast updated: 8 days ago
    • Promoted
    Remote Senior DevOps Data Reliability Specialist

    Remote Senior DevOps Data Reliability Specialist

    Hyly.AIRepublic Of India, IN
    Remote
    AI, we’re building the first AI + Data Fabric for the multifamily industry, transforming how clients manage, secure, and scale their marketing and operational data. As the industry moves toward a co...Show moreLast updated: 6 days ago
    • Promoted
    GCP Reliability Specialist

    GCP Reliability Specialist

    NR ConsultingPune, Republic Of India, IN
    We are seeking a highly skilled Site Reliability Engineer (SRE) with strong expertise in Google Cloud Platform (GCP) and CI / CD automation to lead cloud infrastructure initiatives.The ideal candidat...Show moreLast updated: 2 days ago
    • Promoted
    Site Reliability Engineering Specialist

    Site Reliability Engineering Specialist

    iSoftStoneRepublic Of India, IN
    Greetings from ISoftStone Inc!.This is Rajlaxmi from the HR department of ISoftStone Inc.We are looking for a SRE / Devops. Location- Bangalore / Hybrid (2-3 days WFO).Bachelors degree in computer scie...Show moreLast updated: 13 days ago
    • Promoted
    Senior DevOps & Database Reliability Engineer – 100% Remote

    Senior DevOps & Database Reliability Engineer – 100% Remote

    Hyly.AIIndia, India
    Remote
    AI, we’re building the first AI + Data Fabric for the multifamily industry, transforming how clients manage, secure, and scale their marketing and operational data. As the industry moves toward a co...Show moreLast updated: 7 days ago
    • Promoted
    Site Reliability Specialist

    Site Reliability Specialist

    HTC Global ServicesChennai, Republic Of India, IN
    Troy, Michigan, is a leading global Information Technology solution and BPO provider.HTC assists clients across multiple industry verticals, offering turnkey project lifecycle in, e-business, data ...Show moreLast updated: 15 days ago
    • Promoted
    Systems Reliability Specialist

    Systems Reliability Specialist

    Grootan TechnologiesChennai, Republic Of India, IN
    Site Reliability Engineer (SRE).In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications.You will leverage your e...Show moreLast updated: 5 days ago
    • Promoted
    Senior Systems Reliability Specialist

    Senior Systems Reliability Specialist

    SynechronRepublic Of India, IN
    We have immediate opportunity for.SRE (Senior Site Reliability Engineer) 5+ years.SRE (Senior Site Reliability Engineer). We began life in 2001 as a small, self-funded team of technology specialists...Show moreLast updated: 23 hours ago
    • Promoted
    SRE (Site Reliability Engineer)

    SRE (Site Reliability Engineer)

    Tata Consultancy ServicesChennai, Republic Of India, IN
    Kubernetes (Any cloud) + PostgresSQL, SQL(Must).Linux (Optional), Java (Optional), Kubernetes (CLI), Prior Production support experience, Release Management, Prior Deployment experience,.Show moreLast updated: 5 days ago
    • Promoted
    TCS Walkin Drive For Site Reliability Engineering (SRE)

    TCS Walkin Drive For Site Reliability Engineering (SRE)

    Tata Consultancy ServicesIndia
    Site Reliability Engineering (SRE)Ops.TCS has been a great pioneer in feeding the fire of young Techies like you.We are a global leader in the technology arena and there’s nothing that can stop us ...Show moreLast updated: 1 day ago
    • Promoted
    Production Systems Reliability Engineer

    Production Systems Reliability Engineer

    RecRootsRepublic Of India, IN
    Key Job Responsibilities and Duties : .The core premise for the SRE lies in treating operational issues as a software problem. We code our way out of problems where operations are concerned addressing...Show moreLast updated: 30+ days ago
    • Promoted
    Cloud Systems Reliability Engineer

    Cloud Systems Reliability Engineer

    DeloitteChennai, Republic Of India, IN
    We’re hiring Cloud & Linux Operations Engineers (SMEs)!.Looking for experienced professionals to manage and support enterprise-scale Linux systems, cloud platforms (AWS, Azure, Kubernetes), and dat...Show moreLast updated: 15 days ago
    • Promoted
    Systems Reliability Specialist

    Systems Reliability Specialist

    SynechronPune, Republic Of India, IN
    We have immediate opportunity for.SRE (Senior Site Reliability Engineer) 5 to 9 years.SRE (Senior Site Reliability Engineer). We began life in 2001 as a small, self-funded team of technology special...Show moreLast updated: 8 days ago
    • Promoted
    Systems Reliability Specialist

    Systems Reliability Specialist

    Tata Consultancy ServicesChennai, Republic Of India, IN
    Role : Site Reliability Engineer.Location : Chennai / Bangalore / Hyderabad.Exposure to any APM tool like Dynatrace, Appdynamics, Splunk, etc. Gremlin or Chaos Monkey or Simian Army or Litmus expertise.Ex...Show moreLast updated: 5 days ago
    • Promoted
    Systems Reliability Engineer

    Systems Reliability Engineer

    PhonePePune, Republic Of India, IN
    Troubleshoot issues across the entire stack - hardware, software, application, and network.Work to improve the reliability and performance of the next generation of distributed systems.Work to impr...Show moreLast updated: 15 days ago
    • Promoted
    Systems Reliability Engineer

    Systems Reliability Engineer

    iSoftStoneRepublic Of India, IN
    Greetings from ISoftStone Inc!.This is Rajlaxmi from the HR department of ISoftStone Inc.We are looking for a SRE / Devops. Location- Bangalore / Hybrid (2-3 days WFO).Bachelors degree in computer scie...Show moreLast updated: 13 days ago