Talent.com
SRE Associate - Cloud Services
SRE Associate - Cloud ServicesWhiteLotus Talent Partners • Bengaluru, Republic Of India, IN
SRE Associate - Cloud Services

SRE Associate - Cloud Services

WhiteLotus Talent Partners • Bengaluru, Republic Of India, IN
2 days ago
Job description

We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes . In this role, you will focus on monitoring , basic troubleshooting , and incident response , helping to maintain high system availability, reliability, and performance. You will be responsible for identifying and addressing simple issues, as well as escalating more complex problems to senior SREs when needed.

The ideal candidate should have a basic understanding of cloud infrastructure (especially OpenStack and Kubernetes ), containerized environments , and system monitoring. This position offers an excellent opportunity for someone looking to grow into a more advanced SRE or DevOps role.

Key Responsibilities :

For L0 Support (Level 0) :

  • Incident Monitoring & Triage :
  • Respond to system alerts, monitor infrastructure health using tools like Prometheus , Grafana , and Observability for both OpenStack and Kubernetes.
  • Identify low-level issues and follow runbooks or predefined scripts to perform first-level triage.
  • Document and escalate unresolved incidents to L1 or L2 based on established escalation protocols.
  • System Health Checks :
  • Perform daily health checks for Kubernetes pods, nodes, and OpenStack instances.
  • Verify basic functionality of VMs , containers , and network services within the environment.
  • Basic Troubleshooting :
  • Resolve simple issues such as VM reboots, pod failures, and network connectivity issues within OpenStack or Kubernetes environments.
  • Follow the predefined steps for basic troubleshooting tasks like restarting services or clearing logs.
  • Ticket Management :
  • Log incidents and issues into a ticketing system (e.G., JIRA , ServiceNow ) for tracking and escalation.
  • Update incident tickets and provide relevant information for ongoing resolution efforts.

=========================================================================================================

For L1 Support (Level 1) :

  • Incident Resolution :
  • Investigate and resolve more complex issues compared to L0, such as Kubernetes pod crashes, network misconfigurations in OpenStack, and minor service disruptions.
  • Work with tools like kubectl to troubleshoot Kubernetes pods and nodes, and OpenStack CLI to diagnose problems with VMs, storage, and networks.
  • Automation & Scripting :
  • Automate routine tasks, such as VM provisioning, pod deployments, or status checks, using basic scripting languages ( Python , Bash ).
  • Improve automation workflows based on feedback and frequently encountered issues.
  • Log Aggregation & Monitoring :
  • Review logs and metrics collected from ELK Stack , Prometheus , Grafana , or other logging tools to detect trends and potential issues.
  • Analyze logs and metrics from OpenStack and Kubernetes clusters to pinpoint underlying problems (e.G., high CPU usage, memory leaks).
  • Basic Network & Storage Management :
  • Investigate networking issues related to Neutron (for OpenStack) and CNI configurations (for Kubernetes).
  • Manage storage resources within OpenStack and Kubernetes (e.G., creating persistent volumes, debugging storage access issues).
  • Collaboration & Escalation :
  • Work closely with L2 and L3 engineers for complex troubleshooting or advanced system issues that require in-depth knowledge.
  • Share knowledge with the team and assist in creating new documentation or updating existing troubleshooting guides.
  • User and Permissions Management :
  • Perform basic user management tasks within OpenStack (e.G., creating and managing tenants, security groups).
  • Review and modify Kubernetes RBAC (Role-Based Access Control) settings based on user access needs.
  • Skills & Qualifications :

    Required Skills :

  • Basic Cloud & Kubernetes Knowledge :
  • Familiarity with OpenStack architecture (e.G., Nova , Neutron , Cinder ).
  • Basic understanding of Kubernetes components, including pods , services , deployments , and namespaces .
  • Systems & Networking :
  • Knowledge of Linux / Unix-based operating systems (e.G., Ubuntu , CentOS , Red Hat ).
  • Understanding of networking concepts like DNS , IP routing , and VLANs in cloud environments.
  • Monitoring & Alerting Tools :
  • Familiarity with monitoring tools like Prometheus , Grafana , Zabbix , or CloudWatch for alert management and system health monitoring.
  • Troubleshooting & Incident Response :
  • Experience in using log aggregation tools ( ELK stack , Splunk ) and interpreting logs for incident detection.
  • Ability to perform basic troubleshooting steps (e.G., restarting services, running basic shell commands) to resolve issues.
  • Communication Skills :
  • Strong communication skills to collaborate effectively with senior SREs, developers, and other teams.
  • Ability to document incidents, solutions, and troubleshooting steps clearly.
  • Preferred Skills :

  • Basic Scripting & Automation :
  • Exposure to scripting languages such as Bash , Python , or Go to automate basic administrative tasks.
  • Cloud Platform Experience :
  • Familiarity with other cloud technologies such as AWS , Azure , or Google Cloud Platform .
  • Certifications :
  • Basic certifications such as CompTIA Linux+ , AWS Certified Solutions Architect , Kubernetes Fundamentals (CKA), or OpenStack COA are a plus.
  • Create a job alert for this search

    Sre • Bengaluru, Republic Of India, IN

    Related jobs
    Cloud Infra - Senior Associate, SAL 1

    Cloud Infra - Senior Associate, SAL 1

    Confidential • Bengaluru / Bangalore
    Simplify3x Software PVT LTD is looking for a Cloud & DevOps Engineer to join our team of bright thinkers and enablers.You will use your problem-solving skills, craft & creativity to design and deve...Show more
    Last updated: 17 days ago • Promoted
    SRE / DevOps

    SRE / DevOps

    Confidential • Bengaluru / Bangalore
    Demonstrated ability in designing, building, refactoring and releasing software written in Python.ML frameworks such as PyTorch, TensorFlow, Triton. Ability to handle framework-related issues, versi...Show more
    Last updated: 19 days ago • Promoted
    Software Engineer - Cloud SRE

    Software Engineer - Cloud SRE

    Confidential • Bengaluru / Bangalore, India
    At eBay, we're more than a global ecommerce leader — we're changing the way the world shops and sells.Our platform empowers millions of buyers and sellers in more than 190 markets around the world....Show more
    Last updated: 13 days ago • Promoted
    SRE Cloud Engineer

    SRE Cloud Engineer

    Confidential • Bengaluru / Bangalore
    If you are go-getter who can work with minimal guidance and supervision in emerging technologies, then you have the right opportunity here!. Looking for a talented Site Reliability Engineer (SRE) to...Show more
    Last updated: 27 days ago • Promoted
    SRE DevOps

    SRE DevOps

    Confidential • Bengaluru / Bangalore
    Apply server-side software engineering skills, including scripting with.Optional, but a plus) Contribute to the process of. Puppet, Ansible, Chef, or Terraform to automate infrastructure provisionin...Show more
    Last updated: 30+ days ago • Promoted
    Associate Engineer- Devops

    Associate Engineer- Devops

    Confidential • Bengaluru / Bangalore, India
    Introduction : A Career at HARMAN Automotive.We're a global, multi-disciplinary team that's putting the innovative power of technology to work and transforming tomorrow. At HARMAN Automotive, we give...Show more
    Last updated: 13 days ago • Promoted
    Solutions Associate, Data Cloud Applications

    Solutions Associate, Data Cloud Applications

    Confidential • Bengaluru / Bangalore, India
    Zeta Global is seeking a Solutions Associate for our Data Cloud Applications team to drive operational excellence, client support, and solution innovation. This role provides critical leverage to th...Show more
    Last updated: 19 days ago • Promoted
    Cloud Platform Specialist

    Cloud Platform Specialist

    Tata Consultancy Services • Bengaluru, Republic Of India, IN
    TCS has been a great pioneer in feeding the fire of Techies like you.We are a global leader in the technology arena and there’s nothing that can stop us from growing together.Your role is of key im...Show more
    Last updated: 30+ days ago • Promoted
    SRE (Devops)

    SRE (Devops)

    Cozzera • bangalore district, karnataka, in
    Manage and optimize cloud infrastructure with strong hands-on expertise in.Automate deployment pipelines and ensure high availability and scalability of services. Troubleshoot production issues and ...Show more
    Last updated: 15 hours ago • Promoted • New!
    Cloud SRE

    Cloud SRE

    Virtusa • Bengaluru, Karnataka, India
    P2-C3-STSJDThe Value You DeliverLeading the initiative to craft and deploy our applications to the cloudPromoting a DevOps mentality providing mentorship and establishing development standard metho...Show more
    Last updated: 13 days ago • Promoted
    Senior SRE Cloud (Site Reliability Engineering)

    Senior SRE Cloud (Site Reliability Engineering)

    Confidential • Bengaluru / Bangalore
    A Cloud Site Reliability Engineering Engineer closely works with app developers to tide the cloud infrastructure to the application behavior or deployment like a software engineer.The close collabo...Show more
    Last updated: 7 days ago • Promoted
    Cloud SRE

    Cloud SRE

    Confidential • Bengaluru / Bangalore, India
    AWS in a production environment.Experience building and deploying Docker images including Docker Compose.Production experience running Kubernetes workloads ideally on AWS EKS.Experience managing an...Show more
    Last updated: 9 days ago • Promoted
    Cloud Infra - Senior Associate, SAL 2

    Cloud Infra - Senior Associate, SAL 2

    Confidential • Bengaluru / Bangalore
    Your Impact OR Responsibilities : .Combine your technical expertise and problem-solving passion to work closely with clients, turning complex ideas into end-to-end solutions that transform our client...Show more
    Last updated: 17 days ago • Promoted
    Sr Manager Cloud Engineer

    Sr Manager Cloud Engineer

    Standard Chartered Bank • Bengaluru, Karnataka, India
    This job is with Standard Chartered Bank, an inclusive employer and a member of myGwork – the largest global platform for the LGBTQ+ business community. Please do not contact the recruiter directly....Show more
    Last updated: 6 days ago • Promoted
    SRE

    SRE

    Confidential • Bengaluru / Bangalore
    We are seeking a skilled Site Reliability Engineer (SRE) to ensure the reliability, scalability, and performance of critical systems and applications. The ideal candidate will have strong expertise ...Show more
    Last updated: 30+ days ago • Promoted
    Senior Cloud SRE Engineer (Azure)

    Senior Cloud SRE Engineer (Azure)

    London Stock Exchange Group • Bangalore, India
    In this role, you will be joining our Cloud SRE team within.Cloud & Productivity Engineering.This team focuses on applying software Engineering practices to IT operations tasks to maintain and impr...Show more
    Last updated: 30+ days ago • Promoted
    Sr Cloud Engineer (ML operations)

    Sr Cloud Engineer (ML operations)

    Moody's • Bangalore, India
    At Moody's, we unite the brightest minds to turn today's risks into tomorrow's opportunities.We do this by striving to create an inclusive environment where everyone feels welcome to be who they ar...Show more
    Last updated: 13 hours ago • Promoted • New!
    SRE Cloud Infrastructure Specialist

    SRE Cloud Infrastructure Specialist

    o9 Solutions, Inc. • Bengaluru, Republic Of India, IN
    Be part of something revolutionary.At o9 Solutions, our mission is clear : be the Most Valuable Platform (MVP) for enterprises. With our AI-driven platform — the o9 Digital Brain — we integrate globa...Show more
    Last updated: 6 days ago • Promoted