Talent.com
This job offer is not available in your country.
Site Reliability Engineer - Kubernetes

Site Reliability Engineer - Kubernetes

HireloBangalore
16 days ago
Job description

Job Description :

The candidate will be required to have skills across the following :

Database Administration (DBA) Skills :

  • Relational Databases : MySQL, PostgreSQL, Oracle, MS SQL Server.
  • Database Backup and Recovery : Tools and strategies for database backups and disaster recovery.
  • Performance Tuning : Query optimization, indexing strategies, and database performance troubleshooting.
  • Database Security : User management, roles, access control, and auditing.

Infrastructure as a Service Knowledge :

  • Infrastructure as Code (IaC) : Terraform, CloudFormation, Kubernetes.
  • Kubernetes and Containers : Good Knowledge and Understanding of Kubernetes and the usage of Containers.
  • Observability Tools : ELK stack (Elasticsearch, Logstash, Kibana).
  • Database Migration : Migrating databases across different platforms or cloud environments.
  • Infrastructure Scaling : Vertical and horizontal scaling techniques in cloud environments.
  • SRE Principles and knowledge (Site Reliability Engineering) :

  • Strong hands-on experience in AWS and Azure cloud, and a fair understanding of Google Cloud would be required.
  • Experience in handling APIs, troubleshooting API calls, and ensuring seamless integration and performance.
  • Incident Management : Handling database outages, incident response, and on-call rotations.
  • Monitoring and Alerting : Tools like Prometheus, Grafana, Datadog, CloudWatch, suggest proactive monitoring for the application stack.
  • Understanding of core SRE principles : SLA, SLI, SLO, Error budgets, etc.
  • Disaster Recovery Planning : Ensuring high availability (HA) and disaster recovery (DR) solutions.
  • Performance Optimisation : Track latency, slow performance, high utilisation issues, and recommend optimisation as required.
  • Scripting and Automation :

  • Scripting Languages : Python, Shell scripting, Bash, PowerShell.
  • Automation Tools : Ansible, Puppet, Chef.
  • Infrastructure Automation : Automating database deployment, patching, and scaling.
  • Networking and Infrastructure :

  • Networking Basics : TCP / IP, DNS, Firewall, Load Balancers.
  • Database Connectivity : Connection pooling, failover strategies, and multi-region deployment.
  • Storage and Disk Management : Understanding IOPS, latency, and throughput.
  • Infrastructure : Familiarity with AWS services like EC2 S3 VPC, Security Groups, Private and
  • Public subnets, IAM, CloudWatch, Cloudtrail, etc., and Azure services like Virtual Machines, Azure Functions, Virtual Network, Resource Manager, Skills :

  • Expertise in Linux OS ( RHEL, Ubuntu, CentOS).
  • Understanding of file systems (ext4 XFS, etc. ), permissions, and ownerships.
  • Knowledge of process monitoring, management, and troubleshooting.
  • Proficiency with tools like top, htop, vmstat, iostat, sar, and dstat to monitor CPU, memory, disk I / O, and network usage.
  • Ability to analyze system logs ( / var / log / , journalctl, dmesg) for troubleshooting.
  • Understanding of resource limits (CPU, memory, disk, network) and how they impact database performance.
  • Knowledge of partitioning tools (fdisk, parted) and file system management (mkfs, mount, umount).
  • Understanding of RAID configurations and Logical Volume Management (LVM) for storage scalability.
  • Troubleshooting and Debugging :

  • Log Analysis : Reading and analysing database and system logs.
  • Root Cause Analysis (RCA) : Performing in-depth analysis after major incidents and sharing RCA with customers.
  • Query Performance : Analysing slow queries, deadlocks, and resource contention.
  • Soft Skills :

  • Communication Skills : Clear written and verbal communication with internal and external
  • stakeholders.

  • Problem-Solving : Ability to prioritise, troubleshoot critical issues, and bring them to closure.
  • Collaboration : Working closely with DevOps, Infrastructure, and Engineering teams.
  • ref : hirist.tech)

    Create a job alert for this search

    Site Reliability Engineer • Bangalore

    Related jobs
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    NexionPro ServicesBangalore Urban, Karnataka, India
    Bangalore (Work From Office — All 5 Days).Site Reliability Engineer (SRE).The ideal candidate will bring expertise in observability tools, incident management, and automation, ensuring high availab...Show moreLast updated: 8 days ago
    Site Reliability Engineer

    Site Reliability Engineer

    AIONBengaluru, KA, IN
    Quick Apply
    AION is building the next generation of AI cloud platform by transforming the future of high-performance computing (HPC) through its decentralized AI cloud. Purpose-built for bare-metal performance,...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    RecroBengaluru, Karnataka, India
    Skills - Reliability , Python , Bash, Bigquery, GCP or Azure Cloud.Proficient in scripting / programming languages such as Python, Bash. Experience with cloud platforms (Google Cloud Platform & Azure ...Show moreLast updated: 6 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    InOpTra DigitalBangalore
    About the job : Job Description : Site Reliability Engineer For this position, we're looking for talented & experienced engineers who have a passi...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Tata Consultancy ServicesBengaluru, Karnataka, India
    Greetings from TATA Consultancy Services!!.Thank you for expressing your interest in exploring a career possibility with the TCS Family. AWS with Terraform coding,Linux administration & troubleshoot...Show moreLast updated: 8 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    RecrootsBangalore
    Why you should join us : - You will impact millions of people all over the globe with your creative solutions ...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Aqilea (formerly Soltia)Bangalore, Karnataka, India
    We are a consulting company with a bunch of technology-interested and happy people!.We love technology, we love design and we love quality. Our diversity makes us unique and creates an inclusive and...Show moreLast updated: 21 days ago
    • Promoted
    Site Reliability Engineer - Kubernetes / Terraform

    Site Reliability Engineer - Kubernetes / Terraform

    Hire AlphaBangalore
    We're Hiring | Senior Site Reliability Engineer (SRE) Location : Bangalore | Hybrid.Are you ready to help shape the future of cloud contact centers? were buildin...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    SynechronBangalore Urban, Karnataka, India
    We have immediate opportunity for.SRE (Senior Site Reliability Engineer) 5 to 9 years.SRE (Senior Site Reliability Engineer). We began life in 2001 as a small, self-funded team of technology special...Show moreLast updated: 8 days ago
    Site Reliability Engineer

    Site Reliability Engineer

    CommendaKoramangala, Bangalore Division, IN
    Quick Apply
    Commenda is building the world's first global business console, allowing multinational businesses to seamlessly comply with regulations everywhere that they operate. With paying customers and real p...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PeoplefyBangalore
    Lead Site Reliability Engineer Location : Bangalore Experience : 10 - 15 Years (with 5+ years in Site Reliability Engineering) Job Sum...Show moreLast updated: 23 days ago
    • Promoted
    Site Reliability Engineer - AWS / Kubernetes

    Site Reliability Engineer - AWS / Kubernetes

    Varite IndiaBangalore
    Site Reliability Engineer (SRE) We are looking for an experienced Site Reliability Engineer (SRE) to ensure the reliabil...Show moreLast updated: 24 days ago
    • Promoted
    Site Reliability Engineer - Elastic Kubernetes Service

    Site Reliability Engineer - Elastic Kubernetes Service

    Scaling TheoryBangalore
    Responsibilities : - You will be responsible for understanding requirements and SRE goals in depth from both tech and business perspectives.You will provide solutions...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    noonBangalore, IN
    Job Title : Site Reliability Engineer.In doing this we hope to accelerate the digital economy of the Middle East, empowering regional talent and businesses to meet the full range of consumers' onlin...Show moreLast updated: 16 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Whitefield CareersBangalore
    Experience : 6- 10 years of relevant Period : 15 days or Immediate Skills Required : - Strong hands-on experience in Linux tro...Show moreLast updated: 16 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Genius Business Solutions India Pvt. LtdBangalore
    About the Role : As a Site Reliability Engineer (SRE) at GBSI, youll be at the heart of our mission to build and maintain scalable, reliable, and secure systems.You w...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    EurofinsBengaluru, Karnataka, India
    Eurofins IT Solutions Bengaluru Karnataka India.With 36 facilities worldwide Eurofins BioPharma Product Testing (BPT) is the largest network of bio / pharmaceutical GMP product testing laboratories p...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CirrusLabsBengaluru, Karnataka, India
    Our vision is to become the world's most sought-after niche digital transformation company that helps customers realize value through innovation. Our mission is to co-create success with our custome...Show moreLast updated: 8 days ago