Talent.com
SRE (Site Reliability Engineer)

SRE (Site Reliability Engineer)

KaitongoBengaluru, Republic Of India, IN
6 hours ago
Job description

About Kaitongo

Kaitongo is a Canadian GPT-powered Market Intelligence startup helping client-facing professionals stay ahead with relevant market insights and credible, personalized engagement. Our platform blends domain data, advanced AI, and engineering to deliver intelligence that drives business relationships.

We’re now looking for a DevOps Engineer to take full ownership of our infrastructure — scaling and securing the systems that power our AI and data-driven products.

Role Overview

As a DevOps Engineer , you’ll lead the design, automation, and monitoring of Kaitongo’s hybrid infrastructure across Hetzner bare-metal (RKE2 / Kubernetes) and AWS.

You’ll manage Kubernetes clusters, Docker-based workloads (build, harden, publish), AWS serverless (10+ Lambdas), CI / CD (Bitbucket Pipelines), and critical data infra (PostgreSQL + PgBouncer, OpenSearch, Weaviate, Redis) with a strong focus on security and cost efficiency.

You’ll work closely with backend, data, and AI engineers to ensure deployments are fast, stable, and secure.

This is a hands-on role with high autonomy and a chance to shape DevOps practices from the ground up.

Key Responsibilities

Infrastructure Management

  • Own RKE2 Kubernetes on Hetzner, including Docker runtime, Helm / YAML, Longhorn storage, ingress, and RBAC.
  • Manage core services : Django / React apps, PgBouncer, PostgreSQL, OpenSearch, Weaviate, Redis.
  • Implement Infra-as-Code with Helm and CloudFormation (Terraform welcome).
  • Administer Harbor (private Docker registry) : image scanning, retention policies, access control.

CI / CD & Automation

  • Evolve Bitbucket Pipelines for automated builds, tests, and deployments.
  • Build, harden, and publish Docker images for backend, frontend, and infra components;
  • implement multi-arch where needed.

  • Implement canary / blue-green strategies, version tagging, and safe rollbacks.
  • Containerization & Orchestration

  • Operate and optimize Docker + Kubernetes workloads (resource requests / limits, autoscaling, HPA).
  • Enforce best practices for Docker image security (minimal bases, SBOMs, scanning), service isolation, secrets management, and multi-env promotion (dev / stage / prod).
  • Monitoring, Security & Reliability

  • Ship observability with Prometheus / Grafana, centralized logs (OpenSearch), and actionable alerting.
  • Manage TLS / Let’s Encrypt (wildcard), network policies, IAM / RBAC, and routine security audits.
  • Cost monitoring and optimization across AWS (Lambda / S3 / SQS) and K8s resources.
  • Implement reliable backups / DR for PostgreSQL, OpenSearch, Weaviate (S3 & Longhorn).
  • Collaboration & Leadership

  • Partner with Data / AI teams on RAG pipelines and embeddings;
  • support backend releases and performance tuning.

  • Champion documentation, runbooks, automation-first mindset, and observability standards.
  • Qualifications

  • 4+ years in DevOps / SRE / Cloud Infrastructure.
  • Strong Kubernetes (RKE2 / EKS) and Docker experience;
  • solid Linux / Bash.

  • AWS (Lambda, S3, SQS, IAM, VPC, CloudWatch) proficiency.
  • PostgreSQL admin & performance tuning;
  • PgBouncer familiarity.

  • CI / CD with Bitbucket Pipelines (or GitHub / Jenkins / GitLab).
  • IaC with Helm and CloudFormation (Terraform a plus).
  • Web stack : Nginx, SSL / TLS.
  • Bonus : OpenSearch, Weaviate / vector DBs, Harbor, Hetzner, Celery / Django, legacy Puppet.
  • Bachelor’s degree in Computer Science, Engineering, or related discipline.
  • Why Join Kaitongo

  • Build the backbone of an AI-first market intelligence platform.
  • Own infrastructure strategy and shape DevOps culture from day one.
  • Work alongside senior AI and data engineers solving real-world challenges.
  • Competitive salary + growth opportunities in a fast-moving GenAI environment.
  • Create a job alert for this search

    Site Reliability Engineer • Bengaluru, Republic Of India, IN

    Related jobs
    • Promoted
    Sr Site Reliability Engineer

    Sr Site Reliability Engineer

    Media.netbangalore, karnataka, in
    Our proprietary contextual technology is at the forefront of enhancing Programmatic buying, the latest industry standard in ad buying for digital platforms. HQ is based in New York, and the Global H...Show moreLast updated: 14 days ago
    • Promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    Quant-data.ioBengaluru, Republic Of India, IN
    DevOps Engineer (AWS Focus | Bangalore | Full-Time).Drive scalable automation and cloud infrastructure for enterprise-grade data platforms. AI-native data platforms that power the next generation of...Show moreLast updated: 3 days ago
    • Promoted
    Senior Site Reliability Engineer (SRE)

    Senior Site Reliability Engineer (SRE)

    Tata Consultancy ServicesBengaluru, Karnataka, India
    Senior Site Reliability Engineer (SRE).Senior Site Reliability Engineer (SRE).Desired Experience Range : 7 - 10 yrs.Notice Period : Immediate to 90Days only. We are currently planning to do a Virtual....Show moreLast updated: 13 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    JRD SystemsBengaluru, Karnataka, India
    Site Reliability Engineer (Windows / Cloud / Automation).We are seeking an experienced Site Reliability Engineer with a strong background in managing Windows infrastructure and cloud environments.T...Show moreLast updated: 22 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Capgeminihosur, tamil nadu, in
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 13 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CodeKarmahosur, tamil nadu, in
    Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 24 days ago
    • Promoted
    Site Reliability Engineer (SRE II)

    Site Reliability Engineer (SRE II)

    greytHRBengaluru, Karnataka, India
    We are looking for a passionate and detail-oriented.Site Reliability Engineer (SRE).As an SRE, you will play a critical role in ensuring the reliability, scalability, and performance of our infrast...Show moreLast updated: 24 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdgeBengaluru, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 16 days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.hosur, tamil nadu, in
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    BambooBoxBengaluru, Republic Of India, IN
    With its advanced AI and ML-driven platform,.We work with clients like Airtel Business, Darwinbox, Acalvio, and other global organizations. We are backed by investors such as Peak XV (earlier Sequoi...Show moreLast updated: 6 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    greytHRBengaluru, Republic Of India, IN
    We are looking for a passionate and detail-oriented.Site Reliability Engineer (SRE).As an SRE, you will play a critical role in ensuring the reliability, scalability, and performance of our infrast...Show moreLast updated: 24 days ago
    • Promoted
    • New!
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ThalesBengaluru, Republic Of India, IN
    Apply SRE core tenets of measurement (SLI / SLO / SLA), eliminate toil, and reliability modeling.Enable and educate development teams on industry best practice design patterns, ways of working and oper...Show moreLast updated: 19 hours ago
    • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaServicehosur, tamil nadu, in
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    super.moneyBengaluru, Karnataka, India
    Site Reliability Engineer (SRE) Level 3.A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and...Show moreLast updated: 3 days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-21295]

    Sr Engineer, Site Reliability [T500-21295]

    TMUS Global Solutionshosur, tamil nadu, in
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 2 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CitNOW GroupBangalore, IN
    Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably.CitNOW’s app-based plat...Show moreLast updated: 2 days ago
    • Promoted
    SRE (Site Reliability Engineer)

    SRE (Site Reliability Engineer)

    TrantorBengaluru, Republic Of India, IN
    We are seeking a talented and experienced Senior Software Engineer to join our Cloud Engineering team.You will play a crucial role in implementing and supporting cloud infrastructure solutions that...Show moreLast updated: 3 days ago
    • Promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    Alp Consulting Ltd.Bengaluru, Republic Of India, IN
    Terraform mastery for advanced infrastructure management and.Deep understanding of Kubernetes internals and cluster management at scale. Advanced scripting and automation skills (Python preferred).E...Show moreLast updated: 22 days ago