Talent.com
This job offer is not available in your country.
Site Reliability Engineer - Kubernetes

Site Reliability Engineer - Kubernetes

HireloBangalore
19 days ago
Job description

Job Description :

The candidate will be required to have skills across the following :

Database Administration (DBA) Skills :

  • Relational Databases : MySQL, PostgreSQL, Oracle, MS SQL Server.
  • Database Backup and Recovery : Tools and strategies for database backups and disaster recovery.
  • Performance Tuning : Query optimization, indexing strategies, and database performance troubleshooting.
  • Database Security : User management, roles, access control, and auditing.

Infrastructure as a Service Knowledge :

  • Infrastructure as Code (IaC) : Terraform, CloudFormation, Kubernetes.
  • Kubernetes and Containers : Good Knowledge and Understanding of Kubernetes and the usage of Containers.
  • Observability Tools : ELK stack (Elasticsearch, Logstash, Kibana).
  • Database Migration : Migrating databases across different platforms or cloud environments.
  • Infrastructure Scaling : Vertical and horizontal scaling techniques in cloud environments.
  • SRE Principles and knowledge (Site Reliability Engineering) :

  • Strong hands-on experience in AWS and Azure cloud, and a fair understanding of Google Cloud would be required.
  • Experience in handling APIs, troubleshooting API calls, and ensuring seamless integration and performance.
  • Incident Management : Handling database outages, incident response, and on-call rotations.
  • Monitoring and Alerting : Tools like Prometheus, Grafana, Datadog, CloudWatch, suggest proactive monitoring for the application stack.
  • Understanding of core SRE principles : SLA, SLI, SLO, Error budgets, etc.
  • Disaster Recovery Planning : Ensuring high availability (HA) and disaster recovery (DR) solutions.
  • Performance Optimisation : Track latency, slow performance, high utilisation issues, and recommend optimisation as required.
  • Scripting and Automation :

  • Scripting Languages : Python, Shell scripting, Bash, PowerShell.
  • Automation Tools : Ansible, Puppet, Chef.
  • Infrastructure Automation : Automating database deployment, patching, and scaling.
  • Networking and Infrastructure :

  • Networking Basics : TCP / IP, DNS, Firewall, Load Balancers.
  • Database Connectivity : Connection pooling, failover strategies, and multi-region deployment.
  • Storage and Disk Management : Understanding IOPS, latency, and throughput.
  • Infrastructure : Familiarity with AWS services like EC2 S3 VPC, Security Groups, Private and
  • Public subnets, IAM, CloudWatch, Cloudtrail, etc., and Azure services like Virtual Machines, Azure Functions, Virtual Network, Resource Manager, Skills :

  • Expertise in Linux OS ( RHEL, Ubuntu, CentOS).
  • Understanding of file systems (ext4 XFS, etc. ), permissions, and ownerships.
  • Knowledge of process monitoring, management, and troubleshooting.
  • Proficiency with tools like top, htop, vmstat, iostat, sar, and dstat to monitor CPU, memory, disk I / O, and network usage.
  • Ability to analyze system logs ( / var / log / , journalctl, dmesg) for troubleshooting.
  • Understanding of resource limits (CPU, memory, disk, network) and how they impact database performance.
  • Knowledge of partitioning tools (fdisk, parted) and file system management (mkfs, mount, umount).
  • Understanding of RAID configurations and Logical Volume Management (LVM) for storage scalability.
  • Troubleshooting and Debugging :

  • Log Analysis : Reading and analysing database and system logs.
  • Root Cause Analysis (RCA) : Performing in-depth analysis after major incidents and sharing RCA with customers.
  • Query Performance : Analysing slow queries, deadlocks, and resource contention.
  • Soft Skills :

  • Communication Skills : Clear written and verbal communication with internal and external
  • stakeholders.

  • Problem-Solving : Ability to prioritise, troubleshoot critical issues, and bring them to closure.
  • Collaboration : Working closely with DevOps, Infrastructure, and Engineering teams.
  • ref : hirist.tech)

    Create a job alert for this search

    Site Reliability Engineer • Bangalore

    Related jobs
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    TRUGlobalBengaluru, Karnataka, India
    Site Reliability Engineer (SRE) with Python Development Expertise.Site Reliability Engineer (SRE).Python development experience to join our team. The ideal candidate will be responsible for ensuring...Show moreLast updated: 1 hour ago
    Site Reliability Engineer

    Site Reliability Engineer

    AIONBengaluru, KA, IN
    Quick Apply
    AION is building the next generation of AI cloud platform by transforming the future of high-performance computing (HPC) through its decentralized AI cloud. Purpose-built for bare-metal performance,...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer - Kubernetes / Terraform

    Site Reliability Engineer - Kubernetes / Terraform

    Hire AlphaBangalore
    We're Hiring | Senior Site Reliability Engineer (SRE) Location : Bangalore | Hybrid.Are you ready to help shape the future of cloud contact centers? were buildin...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PeoplefyBangalore
    Lead Site Reliability Engineer Location : Bangalore Experience : 10 - 15 Years (with 5+ years in Site Reliability Engineering) Job Sum...Show moreLast updated: 26 days ago
    Site Reliability Engineer

    Site Reliability Engineer

    CommendaKoramangala, Bangalore Division, IN
    Quick Apply
    Commenda is building the world's first global business console, allowing multinational businesses to seamlessly comply with regulations everywhere that they operate. With paying customers and real p...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Aqilea (formerly Soltia)Bangalore, Karnataka, India
    We are a consulting company with a bunch of technology-interested and happy people!.We love technology, we love design and we love quality. Our diversity makes us unique and creates an inclusive and...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer - AWS / Kubernetes

    Site Reliability Engineer - AWS / Kubernetes

    Varite IndiaBangalore
    Site Reliability Engineer (SRE) We are looking for an experienced Site Reliability Engineer (SRE) to ensure the reliabil...Show moreLast updated: 26 days ago
    • Promoted
    Site Reliability Engineer - Elastic Kubernetes Service

    Site Reliability Engineer - Elastic Kubernetes Service

    Scaling TheoryBangalore
    Responsibilities : - You will be responsible for understanding requirements and SRE goals in depth from both tech and business perspectives.You will provide solutions...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    MyRemoteTeam IncBengaluru, Karnataka, India
    MyRemoteTeam, Inc is a fast-growing distributed workforce enabler, helping companies scale with top global talent.We empower businesses by providing world-class software engineers, operations suppo...Show moreLast updated: 1 hour ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Whitefield CareersBangalore
    Experience : 6- 10 years of relevant Period : 15 days or Immediate Skills Required : - Strong hands-on experience in Linux tro...Show moreLast updated: 19 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    noonBangalore, IN
    Job Title : Site Reliability Engineer.In doing this we hope to accelerate the digital economy of the Middle East, empowering regional talent and businesses to meet the full range of consumers' onlin...Show moreLast updated: 19 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    RecroBangalore Urban, Karnataka, India
    Infrastructure Engineer - Sarvam AI.The Sarvam AI Reasoning team is building sophisticated reasoning capabilities for India's first sovereign AI platform. We are seeking a talented Infrastructure En...Show moreLast updated: 26 days ago
    Site Reliability Engineer

    Site Reliability Engineer

    EurofinsBengaluru, Karnataka, India
    Eurofins IT Solutions Bengaluru Karnataka India.With 36 facilities worldwide Eurofins BioPharma Product Testing (BPT) is the largest network of bio / pharmaceutical GMP product testing laboratories p...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Concentrix CatalystBengaluru, IN
    Senior Site Reliability Engineer.Remote (may need to travel to nearby Concentrix office as per business need).Minimum Experience required : 8+ Years. Stakeholder Management Working with key technolog...Show moreLast updated: 1 hour ago