Job Description – AWS Platform Engineer (Financial Systems)
About the Role
We are seeking a Platform Engineer to take full ownership of mission-critical financial systems within an investment banking environment. This role is not a traditional “support” function—it requires a hands-on engineer who can ensure system resilience, reliability, and operational excellence across on-premises and AWS-hosted environments.
The ideal candidate is a hardcore technical engineer who can automate, troubleshoot, and optimize platforms end-to-end while driving operational maturity.
Key Responsibilities
- System Ownership & Reliability
- Take complete responsibility for assigned financial systems and ensure high availability.
- Troubleshoot complex issues across infrastructure, applications, and integrations.
- Conduct root cause analysis and implement permanent fixes to prevent recurrences.
- Operational Excellence
- Design and manage Disaster Recovery (DR) strategies; conduct periodic DR drills.
- Ensure timely patching, upgrades, and compliance with security standards.
- Define and manage backup and restore strategies.
- Build monitoring, logging, and alerting frameworks to proactively detect issues.
- SRE & Automation
- Apply Site Reliability Engineering principles to improve system performance and resilience.
- Automate operational tasks (deployments, failover tests, log analysis, scaling).
- Develop tooling and scripts (Python, Shell, Ansible) for efficiency and reliability.
- Implement self-healing mechanisms and runbooks for predictable operations.
- Cloud & Hybrid Environments
- Engineer and operate systems deployed on On-Prem and AWS platforms.
- Leverage key AWS services such as EC2, ECS / Fargate, RDS, S3, CloudWatch, IAM, Lambda, and VPC networking.
- Work closely with infrastructure teams to optimize scalability and performance.
Required Skills & Experience
Technical Skills5–10 years in platform engineering within financial services or capital markets.Strong SRE (Site Reliability Engineering) experience focused on automation, observability, and resilience.Expertise in Linux / Windows environments.Hands-on with key AWS services (EC2, ECS / Fargate, RDS, S3, IAM, Lambda, VPC).Strong automation and scripting skills (Python, Shell, Ansible, Terraform preferred).Proficiency in monitoring / observability tools (CloudWatch, Prometheus, Grafana, ELK, etc.).Operational ExpertiseProven experience in patching, DR, backup, monitoring, and system hardening.Strong troubleshooting skills across applications, middleware, and databases.Familiarity with incident, problem, and change management frameworks (ITIL preferred).PreferredExposure to financial applications (Front Arena, Calypso, Murex, or similar).Strong background in automation-driven operations and performance tuning.Soft Skills
Strong sense of ownership and accountability.Calm, decisive, and resilient in high-pressure financial environments.Collaborative, with excellent communication skills across business and technical teams.Passionate about automation and continuous improvement.Why Join Us?
This is an opportunity for a technical leader who wants to own platforms end-to-end, applying SRE principles to financials systems that are critical to investment banking operations. You will play a pivotal role in making these systems highly reliable, scalable, and secure.