Key Responsibilities
- Platform Stability & Performance
- Own end-to-end platform stability across services and components.
- Monitor system performance, identify bottlenecks, and drive resolution.
- Define and track key metrics (latency, uptime, error rates) to measure platform health.
- Issue Identification & Technical Debt
- Build and maintain a backlog of platform issues and tech debt.
- Prioritize and coordinate resolution with development teams.
- Ensure timely closure of high-impact issues through structured triage.
- Platform Design & Architecture
- Collaborate on evolving platform architecture to meet scalability and reliability goals.
- Align platform design with non-functional requirements (NFRs) such as security, performance, and maintainability.
- Contribute to long-term platform roadmap and modernization initiatives.
- Tooling & Automation
- Evaluate and implement monitoring and observability tools.
- Automate health checks, performance benchmarks, and reporting dashboards.
What Success Looks Like
A stable, scalable platform with reduced incidents and faster recovery times.Clear governance and monitoring processes in place.A well-defined roadmap for platform evolution and technical debt reduction.Strong collaboration between platform, product, and engineering teams.Skills / Requirements : Certified Lead System Architect + Certified Decisioning Architect (w / hands on Decisioning experience)
Location - Hyderabad / Chennai
Show more
Show less
Skills Required
Certified Decisioning Architect, Certified Lead System Architect, Observability tools