Commenda is building the world's first global business console, allowing multinational businesses to seamlessly comply with regulations everywhere that they operate. With paying customers and real production traffic, reliability is already a top-line business metric. You’ll be the first dedicated SRE, setting the vision and hands-on practices that keep our web app, backend services, and critical third-party integrations up and humming—even as we scale 10×.
This is a high-impact, high-autonomy role : you’ll report to the CTO, work shoulder-to-shoulder with every engineer, and have the authority to knock down whatever walls stand between our users and 99.99% availability.
What you'll do
First 90 days :
- Audit our current infra (AWS, Terraform, Docker, CI / CD) and spin up a pragmatic roadmap for reliability, observability, and incident response.
- Stand up monitoring & alerting (Prometheus / Grafana, OpenTelemetry, PagerDuty—or better tools you champion).
- Harden CI / CD pipelines and IaC templates so every deploy is boring and reversible.
- Draft, socialize, and enforce lightweight runbooks; mentor engineers to own their services in prod.
Next 12 months :
Drive our SLO / SLI program—own the numbers, surface them to the org, and course-correct when we drift.Lead game days, post-mortems, and deep-dive root cause analyses that actually change systems & culture.Evolve our capacity-planning, autoscaling, and cost-to-serve models as we land bigger customers.Champion security best practices (least privilege, secret management, vulnerability scanning).Continuously hunt for “toil” and automate it away—think one-click staging envs, zero-touch rollbacks, etc.Requirements
We care less about your education and work experience and more about culture fit, commitment and drive.
This is probably the right role for you if :
You read the Site Reliability Engineering book and felt something click—it wasn’t just a set of practices, it was a philosophy you’d been living all along. Keeping systems upright is, for you, an act of care : every nines-worth of uptime is a promise kept to real humans depending on your software.You’re happiest when you can tinker, prototype, and ship improvements daily—no permission slips needed. You see an incident post-mortem as an open invitation to automate, refactor, or re-architect whatever’s brittle, and you’re energized by turning lessons learned into lasting fixes.You feel alive when production is on fire at 2am, but you know better than to let it keep happening.Junior applicants who want to get good very very quickly are encouraged to apply, as are seniors who want to hit the ground running and do great work
Benefits
Competitive comp on our Bangalore pay scale, health insurance,