Talent.com
Openstack Engineer

Openstack Engineer

WhiteLotus Talent PartnersBengaluru, Karnataka, India
17 days ago
Job description

We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes . In this role, you will focus on monitoring , basic troubleshooting , and incident response , helping to maintain high system availability, reliability, and performance. You will be responsible for identifying and addressing simple issues, as well as escalating more complex problems to senior SREs when needed.

The ideal candidate should have a basic understanding of cloud infrastructure (especially OpenStack and Kubernetes ), containerized environments , and system monitoring. This position offers an excellent opportunity for someone looking to grow into a more advanced SRE or DevOps role.

Key Responsibilities :

For L0 Support (Level 0) :

  • Incident Monitoring & Triage :
  • Respond to system alerts, monitor infrastructure health using tools like Prometheus , Grafana , and Observability for both OpenStack and Kubernetes.
  • Identify low-level issues and follow runbooks or predefined scripts to perform first-level triage.
  • Document and escalate unresolved incidents to L1 or L2 based on established escalation protocols.
  • System Health Checks :
  • Perform daily health checks for Kubernetes pods, nodes, and OpenStack instances.
  • Verify basic functionality of VMs , containers , and network services within the environment.
  • Basic Troubleshooting :
  • Resolve simple issues such as VM reboots, pod failures, and network connectivity issues within OpenStack or Kubernetes environments.
  • Follow the predefined steps for basic troubleshooting tasks like restarting services or clearing logs.
  • Ticket Management :
  • Log incidents and issues into a ticketing system (e.g., JIRA , ServiceNow ) for tracking and escalation.
  • Update incident tickets and provide relevant information for ongoing resolution efforts.

=========================================================================================================

For L1 Support (Level 1) :

  • Incident Resolution :
  • Investigate and resolve more complex issues compared to L0, such as Kubernetes pod crashes, network misconfigurations in OpenStack, and minor service disruptions.
  • Work with tools like kubectl to troubleshoot Kubernetes pods and nodes, and OpenStack CLI to diagnose problems with VMs, storage, and networks.
  • Automation & Scripting :
  • Automate routine tasks, such as VM provisioning, pod deployments, or status checks, using basic scripting languages ( Python , Bash ).
  • Improve automation workflows based on feedback and frequently encountered issues.
  • Log Aggregation & Monitoring :
  • Review logs and metrics collected from ELK Stack , Prometheus , Grafana , or other logging tools to detect trends and potential issues.
  • Analyze logs and metrics from OpenStack and Kubernetes clusters to pinpoint underlying problems (e.g., high CPU usage, memory leaks).
  • Basic Network & Storage Management :
  • Investigate networking issues related to Neutron (for OpenStack) and CNI configurations (for Kubernetes).
  • Manage storage resources within OpenStack and Kubernetes (e.g., creating persistent volumes, debugging storage access issues).
  • Collaboration & Escalation :
  • Work closely with L2 and L3 engineers for complex troubleshooting or advanced system issues that require in-depth knowledge.
  • Share knowledge with the team and assist in creating new documentation or updating existing troubleshooting guides.
  • User and Permissions Management :
  • Perform basic user management tasks within OpenStack (e.g., creating and managing tenants, security groups).
  • Review and modify Kubernetes RBAC (Role-Based Access Control) settings based on user access needs.
  • Skills & Qualifications :

    Required Skills :

  • Basic Cloud & Kubernetes Knowledge :
  • Familiarity with OpenStack architecture (e.g., Nova , Neutron , Cinder ).
  • Basic understanding of Kubernetes components, including pods , services , deployments , and namespaces .
  • Systems & Networking :
  • Knowledge of Linux / Unix-based operating systems (e.g., Ubuntu , CentOS , Red Hat ).
  • Understanding of networking concepts like DNS , IP routing , and VLANs in cloud environments.
  • Monitoring & Alerting Tools :
  • Familiarity with monitoring tools like Prometheus , Grafana , Zabbix , or CloudWatch for alert management and system health monitoring.
  • Troubleshooting & Incident Response :
  • Experience in using log aggregation tools ( ELK stack , Splunk ) and interpreting logs for incident detection.
  • Ability to perform basic troubleshooting steps (e.g., restarting services, running basic shell commands) to resolve issues.
  • Communication Skills :
  • Strong communication skills to collaborate effectively with senior SREs, developers, and other teams.
  • Ability to document incidents, solutions, and troubleshooting steps clearly.
  • Preferred Skills :

  • Basic Scripting & Automation :
  • Exposure to scripting languages such as Bash , Python , or Go to automate basic administrative tasks.
  • Cloud Platform Experience :
  • Familiarity with other cloud technologies such as AWS , Azure , or Google Cloud Platform .
  • Certifications :
  • Basic certifications such as CompTIA Linux+ , AWS Certified Solutions Architect , Kubernetes Fundamentals (CKA), or OpenStack COA are a plus.
  • Create a job alert for this search

    Engineer • Bengaluru, Karnataka, India

    Related jobs
    • Promoted
    Lead Engineer - Fullstack [T500-18808]

    Lead Engineer - Fullstack [T500-18808]

    Neighborly®Bengaluru, Karnataka, India
    Neighborly is a local network of home service brands that will connect you to very specific vetted local experts.Our family of service professionals work with rigorous quality standards to repair, ...Show moreLast updated: 30+ days ago
    • Promoted
    Software Engineer

    Software Engineer

    capeironBengaluru, IN
    We’re Hiring : Software Engineer [Remote].Month Contract Role at Capeiron Technology.Fixed Compensation for contract : INR 5,00,000. As a Software Engineer, you’ll be responsible for developing and ma...Show moreLast updated: 30+ days ago
    • Promoted
    Senior DevOps Engineer

    Senior DevOps Engineer

    ElestioBangalore, IN
    Elestio is growing, and we’re looking for a DevOps Expert to join our team!.To support our fast growth, we’re looking for someone passionate about DevOps, open-source technologies, and customer suc...Show moreLast updated: 30+ days ago
    • Promoted
    Megthink - Auth0 Developer - Okta / CIAM

    Megthink - Auth0 Developer - Okta / CIAM

    MegThink Solutions Private LimitedBangalore
    Position Overview : We are seeking an experienced Auth0 Developer with strong expertise in Oktas Auth0 CIAM solution.The ideal candidate will be responsible for confi...Show moreLast updated: 30+ days ago
    • Promoted
    Implementation Engineer

    Implementation Engineer

    HexnodeBengaluru, Karnataka, India
    Hexnode, the enterprise software division of Mitsogo Inc.With a robust presence in over 100 countries, Hexnode UEM (Unified Endpoint Management) empowers organizations across myriad sectors to achi...Show moreLast updated: 30+ days ago
    • Promoted
    AspenTech - Senior DevOps Engineer - System Infrastructure

    AspenTech - Senior DevOps Engineer - System Infrastructure

    Aspen TechnologyBangalore
    The Role : As a Senior DevOps Engineer, you will play a key role in designing, deploying, and supporting DGMs control systems infrastructure.Youll...Show moreLast updated: 30+ days ago
    • Promoted
    Engineer - Fullstack [T500-20529]

    Engineer - Fullstack [T500-20529]

    ANSRBengaluru, IN
    ANSR is hiring for one of its clients.ArcelorMittal was formed in 2006 from the strategic merger of European company Arcelor and Indian-owned Mittal Steel. Over a journey of two decades, we have eme...Show moreLast updated: 30+ days ago
    • Promoted
    Implementation Engineer

    Implementation Engineer

    Tuebora IncBengaluru, Karnataka, India
    Implementation Engineer Job Description Position Overview : The Implementation engineer role is customer facing and the prospective hire will be working with the customers in understanding their IAM...Show moreLast updated: 14 days ago
    • Promoted
    Software Development Engineer - II (iOS)

    Software Development Engineer - II (iOS)

    Capillary TechnologiesGreater Bengaluru Area, India
    Design, build, and maintain advanced iOS applications using Swift and Objective-C.Translate product requirements into scalable technical solutions. Collaborate with cross-functional teams including ...Show moreLast updated: 16 days ago
    • Promoted
    Solutions Engineer – Onboarding & Implementation

    Solutions Engineer – Onboarding & Implementation

    OnArrivalBengaluru, India
    OnArrival is the AWS of travel—powering flights, hotels, insurance, and more via modular APIs and SDKs.We enable fintechs, banks, and large ecosystems to launch embedded travel experiences in under...Show moreLast updated: 30+ days ago
    • Promoted
    Cloud Application Development Engineer

    Cloud Application Development Engineer

    Albertsons Companies IndiaBengaluru, Republic Of India, IN
    As a leading food and drug retailer in the United States, Albertsons Companies, Inc.ACI) operates over 2,200 stores across 34 states and the District of Columbia. Our well-known global banners, incl...Show moreLast updated: 16 days ago
    • Promoted
    TechOps Engineer

    TechOps Engineer

    Aquanowvijayapura, India
    Aquanow is a trading and technology company powering the next generation of financial services.We’re at the forefront of the rapidly evolving digital asset space, empowering businesses to navigate ...Show moreLast updated: 10 days ago
    • Promoted
    DevOps Engineer

    DevOps Engineer

    Alp Consulting Ltd.Greater Bengaluru Area, India
    Good knowledge of AWS technologies including EC2, ECS / EKS (Docker containers), RDS, S3, Lambda, CloudHSM.Cloud stack deployment & upgrade using CloudFormation / Terraform.REST end point development...Show moreLast updated: 14 days ago
    • Promoted
    OpenStack Associate Engineer

    OpenStack Associate Engineer

    Anicalls (Pty) Ltdbangalore, India
    Must have an account of storage platforms; Ceph Storage experience specifically preferred.Show moreLast updated: 30+ days ago
    • Promoted
    EUC Engineer (IMAC Implementation)

    EUC Engineer (IMAC Implementation)

    TECEZEBengaluru, Karnataka, India
    EUC Engineer (IMAC Implementation).The EUC Engineer will execute and support.IMAC (Install, Move, Add, Change).IT assets, while adhering to IT standards and SLAs. Execute IMAC activities for desktop...Show moreLast updated: 16 days ago
    • Promoted
    Big Oh Tech - DevOps Engineer - IAC Terraform

    Big Oh Tech - DevOps Engineer - IAC Terraform

    Big Oh TechBangalore
    Responsibilities : - Design, implement, and manage CI / CD pipelines using GitLab.Work with Azure cloud services for application deployment and moni...Show moreLast updated: 30+ days ago
    • Promoted
    DevOps Engineer- Oracle Exadata

    DevOps Engineer- Oracle Exadata

    Cognizantvijayapura, India
    DevOps Engineer- Oracle Exadata - Hybrid Working.Looking for candidates who are willing to relocate to Sweden.What makes Cognizant a unique place to work? The combination of rapid growth and an int...Show moreLast updated: 10 days ago
    • Promoted
    iOS Developer

    iOS Developer

    ArcanaGreater Bengaluru Area, India
    We're looking for a passionate and experienced.We're building a next-gen ultra-fast, secure portfolio intelligence platform that blends speed, delight, and reliability, and now, we're ready to brin...Show moreLast updated: 30+ days ago