Talent.com
SRE Devops Lead / SSE - Hybrid Mode
SRE Devops Lead / SSE - Hybrid ModeInfinite Computer Solutions • Rajkot, IN
No longer accepting applications
SRE Devops Lead / SSE - Hybrid Mode

SRE Devops Lead / SSE - Hybrid Mode

Infinite Computer Solutions • Rajkot, IN
2 days ago
Job description

We are looking for Site Reliability / Cloud Engineer Devops Lead / SSE

Experience - 6 years - 12 years

Can join immediate - 30 days

Shift timing : Regular

Location : Bangalore / Hyderabad / Chennai / Noida / Pune / Gurgaon / Visakhapatnam

Interested candidates, Please share your profiles and below details to

Email ID : Shanmukh.Varma@infinite.com

Total experience :

Relevant Experience :

Current CTC :

Expected CTC :

Notice Period :

If Serving Notice Period, Last working day :

Email ID : Shanmukh.Varma@infinite.com

Job Title : Site Reliability / Cloud Engineer

Job Type : Full-time

Department : Engineering

Job Summary

We're seeking a motivated, and passionate Site Reliability Engineering (SRE) leader with strong expertise in programming, distributed systems, and Kubernetes. In this role, you'll help evolve our SRE team's Kubernetes and microservices architecture, while also supporting the integration of Agentic AI workloads both within Kubernetes and via managed services.

The SRE function plays a critical role in maintaining system visibility, ensuring platform scalability, and enhancing operational efficiency. As part of this, you'll help drive AIOps initiatives, leveraging AI tools and automation to proactively detect, diagnose, and remediate issues, enhancing the reliability and performance of Zyter’s global platform. As a cloud practictioner, you’ll have the opportunity to apply your technical strengths, shape platform reliability strategies, and collaborate closely with engineering teams across the organization. You’ll work as part of a globally distributed, inclusive team focused on AWS-based cloud infrastructure.

Key Responsibilities

Core SRE :

  • Collaborate with development teams, product owners, and stakeholders to define, enforce, and track SLOs and manage error budgets.
  • Improve system reliability by designing for failure, testing edge cases, and monitoring key metrics.
  • Boost performance by identifying bottlenecks, optimizing resource usage, and reducing latency across services.
  • Build scalable systems that handle growth in traffic or data without compromising performance.
  • Stay directly involved in technical work, contributing to the codebase and leading by example in solving complex infrastructure challenges

AI Ops :

  • Design and implement scalable deployment strategies optimized for large language models like, Llama, Claude, Cohere and others.
  • Set up continuous monitoring for model performance, ensuring robust alerting systems are in place to catch anomalies or degradation.
  • Stay current with advancements in MLOps and Generative AI, proactively introducing innovative practices to strengthen AI infrastructure and delivery.
  • Monitoring and Alerting :

  • Set up monitoring and observability using Prometheus, Grafana, CloudWatch, and logging with OpenSearch / ELK
  • Proactively identify and resolve issues by leveraging monitoring systems to catch early signals before they impact operations.
  • Design and maintain alerting mechanisms that are clear, actionable, and tuned to avoid unnecessary noise or alert fatigue.
  • Continuously improve system observability to enhance visibility, reduce false positives, and support faster incident response.
  • Apply best practices for alert thresholds and monitoring configurations to ensure reliability and maintain system health.
  • Cost Management :

  • Monitor infrastructure usage to identify waste and reduce unnecessary spending.
  • Optimize resource allocation by using right-sized instances, auto-scaling, and spot instances where appropriate.
  • Implement cost-aware design practices during architecture and deployment planning.
  • Track and analyze monthly cloud costs to ensure alignment with budget and forecast.
  • Collaborate with teams to increase cost visibility and promote ownership of cloud spend.
  • Required Skills & Experience :

  • Strong experience as SRE with a proven track record of managing large-scale, highly available systems.
  • Knowledge of core operating system principles, networking fundamentals, and systems management.
  • Strong understanding of cloud deployment and management practices
  • Hands-on experience with Terraform / OpenTofu, Helm, Docker, Kubernetes, Prometheus and Istio
  • Hands-on experience with tools and techniques to diagnose and uncover container performance
  • Skilled with AWS services both from technology and cost perspectives
  • Skilled in DevOps / SRE practices and build / release pipelines
  • Experience working with mature development practices and tools for source control, security, and deployment
  • Hands on experience with Python / Golang / Groovy / Java
  • Excellent communication skills, written and verbal
  • Strong analytical and problem-solving skills
  • Preferred Qualifications

  • Experience scaling Kubernetes clusters and managing ingress traffic.
  • Familiarity with multi-environment deployments and automated workflows.
  • Knowledge of AWS service quotas, cost optimization, and networking nuances.
  • Strong troubleshooting skills and effective communication across teams.
  • Prior experience in regulated environments (HIPAA, SOC2, ISO27001) is a plus
  • Create a job alert for this search

    Lead Sre • Rajkot, IN

    Related jobs
    Azure Devops Lead / Specialist

    Azure Devops Lead / Specialist

    Aventra Group • Rājkot, Republic Of India, IN
    Aventra Group is a fast-growing company dedicated to empowering and transforming enterprises through Data and Application Engineering services. We offer integrated solutions in Data and Analytics, E...Show more
    Last updated: 10 days ago • Promoted
    Site Reliability Engineer (Sre) With Azure & Ai

    Site Reliability Engineer (Sre) With Azure & Ai

    Datum Technologies Group • Rājkot, Republic Of India, IN
    Job Title : Site Reliability Engineer (SRE) With Azure & AI.Duration : Contract Position (On the Payroll of Datum Technology Group). Location : Chennai || Mumbai || Gurugram.Interview Process : Virtual ...Show more
    Last updated: 3 hours ago • Promoted • New!
    Senior DevOps Engineer (SRE)

    Senior DevOps Engineer (SRE)

    MightyBot • Rajkot, IN
    Title : Senior DevOps Engineer (SRE).Join our team as a Senior DevOps Engineer, where we're focused on graduating AI from interesting demos to indispensable products. You will build and maintain the ...Show more
    Last updated: 1 day ago • Promoted
    Sr. DevOps Engineer - 2+ years Experience

    Sr. DevOps Engineer - 2+ years Experience

    Trovex.ai • Rajkot, IN
    At Trovex, we’re transforming sales training with an AI-powered role-play simulator.Born from a need to fix outdated methods, Trovex uses generative AI to help reps sharpen their skills through rea...Show more
    Last updated: 1 day ago • Promoted
    SRE Devops Lead / SSE - Hybrid Mode

    SRE Devops Lead / SSE - Hybrid Mode

    Infinite Computer Solutions • rajkot, India
    We are looking for Site Reliability / Cloud Engineer Devops Lead / SSE.Experience - 6 years - 12 years.Location : Bangalore / Hyderabad / Chennai / Noida / Pune / Gurgaon / Visakhapatnam.Interested ca...Show more
    Last updated: 7 hours ago • Promoted • New!
    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Jade Global • rajkot, gujarat, in
    Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote. Site Reliability Engineer (SRE).SRE...Show more
    Last updated: 10 days ago • Promoted
    Senior Sap Devops

    Senior Sap Devops

    Smart Moves Consultants • Rājkot, Republic Of India, IN
    Job title : Senior Developer – SAP DevOps.Reporting to : Director, SAP DevOps.Partner with application product teams & vendor developers to define, design and implement new functionality and services...Show more
    Last updated: 25 days ago • Promoted
    Senior / Lead Engineer - Devops

    Senior / Lead Engineer - Devops

    QBurst • Rājkot, Republic Of India, IN
    We are seeking an experienced and versatile DevOps Engineer.The ideal candidate will have hands-on experience with CI / CD pipelines, Kubernetes, Linux systems, monitoring / logging tools, and Infrastr...Show more
    Last updated: 24 days ago • Promoted
    Senior Devops Engineer

    Senior Devops Engineer

    Algofficient • Rājkot, Republic Of India, IN
    Algofficient is a software development company which focuses on providing innovative and efficient solutions to all customers while using the latest AI infrastructure which is enabling faster deliv...Show more
    Last updated: 1 day ago • Promoted
    Senior Devops Engineer

    Senior Devops Engineer

    American Inference • rajkot, gujarat, in
    Please apply directly through LinkedIn for this position.We kindly request that candidates refrain from contacting company officials via email or messages regarding this role.We are an AI and Data ...Show more
    Last updated: 8 days ago • Promoted
    Senior SAP Devops

    Senior SAP Devops

    Smart Moves Consultants • Rajkot, IN
    Job title : Senior Developer – SAP DevOps.Reporting to : Director, SAP DevOps.Partner with application product teams & vendor developers to define, design and implement new functionality and services...Show more
    Last updated: 30+ days ago • Promoted
    Devops Engineer / Sre

    Devops Engineer / Sre

    SuprSend • Rājkot, Republic Of India, IN
    SuprSend is reinventing notification infrastructure for global businesses.Powering seamless, reliable distribution of millions of events across channels. Join us as we scale further and raise the ba...Show more
    Last updated: 2 days ago • Promoted
    Senior Devops Engineer

    Senior Devops Engineer

    SiteRecon • rajkot, India
    Following selection criteria will be followed -.Atleast 3 years of experience in the DevOps / Platform Engineering Role.SiteRecon is a B2B SaaS platform transforming property measurements for landsca...Show more
    Last updated: 20 days ago • Promoted
    DevOps Engineer / SRE

    DevOps Engineer / SRE

    SuprSend • rajkot, gujarat, in
    SuprSend is reinventing notification infrastructure for global businesses.Powering seamless, reliable distribution of millions of events across channels. Join us as we scale further and raise the ba...Show more
    Last updated: 30+ days ago • Promoted
    Azure DevOps Lead / Specialist

    Azure DevOps Lead / Specialist

    Aventra Group • rajkot, gujarat, in
    Aventra Group is a fast-growing company dedicated to empowering and transforming enterprises through Data and Application Engineering services. We offer integrated solutions in Data and Analytics, E...Show more
    Last updated: 10 days ago • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaService • Rajkot, Gujarat, India
    About InstaService InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ ...Show more
    Last updated: 8 days ago • Promoted
    Sr. Devops Engineer - 2+ Years Experience

    Sr. Devops Engineer - 2+ Years Experience

    Trovex.ai • Rājkot, Republic Of India, IN
    At Trovex, we’re transforming sales training with an AI-powered role-play simulator.Born from a need to fix outdated methods, Trovex uses generative AI to help reps sharpen their skills through rea...Show more
    Last updated: 1 day ago • Promoted
    Devsecops Lead

    Devsecops Lead

    Ekfrazo Technologies Private Limited • Rājkot, Republic Of India, IN
    Notice Period : Immediate to 15 Days.Excellent Communication Skills and Work Stability.We are seeking a highly experienced DevSecOps Lead to drive secure, scalable, and. The ideal candidate will poss...Show more
    Last updated: 1 hour ago • Promoted • New!