Job Description
Work Location Options :
Hybrid
At American Express, our culture is built on a 175-year history of innovation, shared and Leadership Behaviors, and an unwavering commitment to back our customers, communities, and colleagues. As part of Team Amex, you'll experience this powerful backing with comprehensive support for your holistic well-being and many opportunities to learn new skills, develop as a leader, and grow your career.
Here, your voice and ideas matter, your work makes an impact, and together, you will help us define the future of American Express.
Key Responsibilities :
- SRE Strategy and Leadership : Develop and implement a comprehensive SRE strategy aligned with the company's goals and objectives. Lead junior members of the team to drive the reliability, performance, and scalability of technology solutions.
- Observability and Monitoring : Establish observability practices to ensure real-time insights into system performance, availability, and customer experience. Implement monitoring tools, metrics, and dashboards to proactively identify and address potential issues.
- Reliability Engineering Best Practices : Promote and implement standard methodologies, including error budgeting, chaos engineering, and disaster recovery planning. Cultivate a culture of resilience and reliability within technology.
- Automation and Efficiency : Champion automation initiatives to streamline operational workflows, deployment processes, and incident response tasks. Leverage automation tools and orchestration to improve reliability and reduce manual intervention.
- Production Support Optimization : Lead all aspects of end-to-end production support process, including incident management, problem resolution, and service-level agreement (SLA) compliance. Drive continuous improvement initiatives to enhance operational effectiveness and reduce mean time to resolution (MTTR).
- Colleague Journeys : Collaborate with multi-functional teams to enhance colleague journeys through seamless and reliable technology experiences.
Qualifications :
8-13 years of experience and degree or equivalent experience in Computer Science, Information Technology, or related field. Advanced certifications in SRE or related are a plus.Leadership and people management skills, with the ability to inspire and empower successful SRE teams.Required Skills :
Hands-on coding of highly available distributed systems in any of the programming languages : Java / Python / JavaScriptKnowledge on modern observability stack splunk, elastic search, Prometheus, GrafanaKnowledge of cloud-based SRE practices and experience with public cloud platforms such as AWS, Azure, or Google Cloud.Familiarity with microservices architecture and design.Demonstrated expertise in driving culture change, DevOps practices, and continuous improvement in SRE and production support functions.Deep understanding of observability tools and methodologies, including experience with logging, monitoring, tracing, and performance analysis platforms.Knowledge of ServiceNow or any other ticketing tools, ITIL experience.Join our innovative team and be at the forefront of advancing Site Reliability Engineering and production support in the Global Risk and Compliance Technology space. If you are passionate about driving reliability, observability, and excellence in customer experiences, we invite you to apply and join our mission to redefine the future of risk and compliance technology. Apply now and join us in shaping the reliability and performance of solutions for a secure and compliant world.
We back our colleagues and their loved ones with benefits and programs that support their holistic well-being. That means we prioritize their physical, financial, and mental health through each stage of life. Benefits include :
Competitive base salariesBonus incentivesSupport for financial-well-being and retirementComprehensive medical, dental, vision, life insurance, and disability benefits (depending on location)Flexible working model with hybrid, onsite or virtual arrangements depending on role and business needGenerous paid parental leave policies (depending on your location)Free access to global on-site wellness centers staffed with nurses and doctors (depending on location)Free and confidential counseling support through our Healthy Minds programCareer development and training opportunitiesAmerican Express is an equal opportunity employer and makes employment decisions without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran status, disability status, age, or any other status protected by law.
Offer of employment with American Express is conditioned upon the successful completion of a background verification check, subject to application.
Other Jobs You May Be Interested In
Senior Software Engineer I
CHENNAI, Tamil Nadu, India
Senior Pega Software Engineer I - BPM - Global Services Group Technology
Sunrise, Florida, United States
Senior Software Engineer II
Bengaluru Urban, Karnataka, India
Software Engineer II
Phoenix, Arizona, United States
Software Engineer I
Bengaluru Urban, Karnataka, India
Senior Engineering Manager- Golang
Bengaluru Urban, Karnataka, India
Software Engineer (Java) Merchant Services Technologies
Phoenix, Arizona, United States
Software Engineer III
Bengaluru Urban, Karnataka, India
AWS DevOps Engineer
Phoenix, Arizona, United States
Slide 1 of 3When you become part of our Talent Community, well keep you posted about future job opportunities that you may be a match for, as well as career-related events.