About the Company
Trianz believes that companies around the world face three challenges in their digital transformation journeys - shrinking ‘time to transform’ due to competition & AI, lack of digital-ready talent, and uncertain economic conditions. To help clients leapfrog over these challenges, Trianz has built IP and platforms that have transformed the adoption of the cloud, data, analytics & insights AI.
Specifically, the following Trianz platforms are changing the way companies approach transformations in various disciplines :
- Concierto : A fully automated platform to Migrate, Manage, and Maximize the multi & hybrid cloud. A zero code and SaaS platform, Concierto allows teams to migrate to AWS, Azure and GCP and manage them efficiently from a single pane of glass. Visit www.concierto.cloud for more information.
- Extrica Data to AI Platform : Built on the concept of ‘federated or distributed data’, Extrica revolutionizes how users access data anywhere in the company’s ecosystems; productizes data and makes it available in a Netflix like user experience while delivering BI and AI powered insights. Visit www.extrica.io for more.
- Pulse : Recognizing that workforces will be distributed, mobile, and fluid, Trianz has built a ‘future of work’ digital workplace platform called Pulse. Visit www.trianz.com / Pulse .
Since the market launch of this strategy in mid-2023, Trianz has experienced enormous growth, success, and recognition. Some of Trianz’ built IP in data and analytics was acquired by Amazon. Since then, Trianz has been made an engineering partner of Amazon for building / supporting connected ecosystems across multiple AWS platforms.
Most recently, Trianz and AWS have signed a strategic collaboration agreement within which the two companies will work on joint roadmaps / solutions for the cloud; AWS will buy Trianz | Concierto in bulk for AWS partners to use for migrations; AWS will also recommend Concierto to their MSPs and finally, AWS Professional Services and Trianz have signed an agreement for joint solutioning and customer delivery. Read more : Trianz enters into a Strategic Collaboration Agreement with AWS to Revolutionize Cloud Adoption and Management (yahoo.com) .
About the Role
We're looking for a seasoned infrastructure leader to own and evolve our AWS cloud platform—the foundation that powers our business 24x7. In this role, you'll lead a high-performing team of CloudOps and SRE engineers, driving operational excellence while shaping our cloud architecture strategy and security posture for scale. This isn't just an operations role. You'll influence how we build, secure, and run our infrastructure—bridging the gap between reliability, innovation, and security. If you thrive on building resilient systems, mentoring technical teams, and making strategic architecture decisions that impact the entire organization, this is your role.
Responsibilities
Operational Excellence at Scale :Lead a unified CloudOps / SRE team across L1 / L2 / L3 support, ensuring seamless 24x7 operations through structured shift rotations and escalation frameworks.Drive incident management excellence—from first response to root cause analysis and continuous improvement.Maintain and exceed operational KPIs : MTTA, MTTR, uptime SLAs, and availability objectives.Oversee day-to-day operations across our AWS footprint : EC2, VPC, ELB / ALB, EKS / ECS, RDS / Aurora, S3, IAM, Lambda, CloudFront, and CloudWatch.Architecture Leadership & Platform Evolution :Provide architectural oversight for production workloads, guiding teams on scalable, cost-optimized, and secure AWS designs.Review and approve architecture patterns, deployment topologies, and infrastructure standards.Partner with Cloud Architects to establish guardrails, reference architectures, and reusable Infrastructure-as-Code modules.Create feedback loops where operational insights directly influence design decisions—ensuring we build for observability, resilience, and efficiency.Champion modernization initiatives : containerization, serverless adoption, and edge optimization strategies.Security Posture & Compliance :Own cloud security governance across IAM, network segmentation, encryption, logging, and compliance.Drive continuous security monitoring using AWS Security Hub, GuardDuty, IAM Access Analyzer, Config, Inspector, and third-party CSPM tools.Ensure automated remediation for vulnerabilities, misconfigurations, and security baseline drift.Maintain compliance with SOC2, ISO27001, CIS Benchmarks, and customer-specific security requirements.Lead operational security hygiene : identity lifecycle management, least privilege enforcement, secrets management, and patch compliance.Coordinate cloud security incident response with tight CloudOps-SecOps integration.Automation & Tooling Strategy :Drive automation and tooling adoption across :Monitoring & Observability : CloudWatch, Elastic Stack, distributed tracing.Logging & Analytics : CloudWatch Logs, ELK, OpenSearchITSM : ServiceNow, Jira Service ManagementIaC & Automation : CloudFormation, Terraform, Python, Shell scripting, GitOps workflowsBuild self-healing operations through automated provisioning, scaling, failover, and compliance checkingGovernance & Continuous Improvement
Establish and refine operational playbooks, runbooks, SOPs, and change control frameworksImplement ITIL-aligned processes for change, problem, and incident managementDrive continuous improvement through automation, operational analytics, and team feedback loops.Experience & Leadership
15–20 years in infrastructure / operations with 8+ years leading cloud or production operations teamsProven track record managing 24x7 support teams of 20+ engineers in high-availability AWS environmentsExperience scaling teams and operations while maintaining quality and reliabilityTechnical Expertise
Deep knowledge of AWS architecture, networking, security, and distributed systems designStrong understanding of cloud security posture management, identity governance, and compliance frameworks (SOC2, ISO27001, CIS Benchmarks)Expertise in incident management, SRE practices, reliability engineering, and operational KPIsHands-on experience with Infrastructure-as-Code (Terraform, CloudFormation), automation, and GitOps workflowsStrategic & Communication Skills
Ability to translate technical complexity into clear business impact for executive audiencesTrack record of building high-performing, collaborative teamsStrong stakeholder management and cross-functional partnership capabilitiesBias toward automation, continuous improvement, and operational excellence