Talent.com
Site Reliability Engineer-Vice President -Software Production Management & Reliability Engineering

Site Reliability Engineer-Vice President -Software Production Management & Reliability Engineering

Morgan StanleyMumbai, India
1 day ago
Job description

Morgan Stanley

Vice President - Software Production Management & Reliability Engineering

Profile Description

We're seeking someone to join our team as Vice President ( Site Reliability Engineer ) who will be responsible for providing swift, courteous, and knowledgeable customer service to end users of the production systems. This position is focused on user and systems support, monitoring systems alerts, and taking corrective action. IM Production Management Services is responsible for driving Resiliency, Automation, Performance, Stability, and Efficiency across Investment Management Technology.

Software Production Management & Reliability Engineering

This is Vice President position that oversees the production environment, ensuring the operational reliability of deployed software, and implements strategies to optimize performance and minimize downtime.

Investment_Management

In the Investment Management division, we deliver active investment strategies across public and private markets and custom solutions to institutional and individual investors.

IMIT App Dev

IMIT provides industry-leading strategies and solutions to enable business growth and deliver best in class functions to Morgan Stanley's Investment Management business.

Software Production Management & Reliability Engineering

This is Vice President position that oversees the production environment, ensuring the operational reliability of deployed software, and implements strategies to optimize performance and minimize downtime.

Morgan Stanley is an industry leader in financial services, known for mobilizing capital to help governments, corporations, institutions, and individuals around the world achieve their financial goals.

At Morgan Stanley India, we support the Firm's global businesses, with critical presence across Institutional Securities, Wealth Management, and Investment management, as well as in the Firm's infrastructure functions of Technology, Operations, Finance, Risk Management, Legal and Corporate & Enterprise Services. Morgan Stanley has been rooted in India since 1993, with campuses in both Mumbai and Bengaluru. We empower our multi-faceted and talented teams to advance their careers and make a global impact on the business. For those who show passion and grit in their work, there's ample opportunity to move across the businesses for those who show passion and grit in their work.

Interested in joining a team that's eager to create, innovate and make an impact on the world? Read on...

What you'll do in the role :

Proactively detecting, troubleshooting, and resolving all issues affecting production applications. This involves coordination with and escalation to development and external teams where necessary. This team owns all issues escalated to us until it is resolved or a workaround is provided for end user to continue functioning.

  • Responsible for maintaining clear, concise, and timely communications with affected parties during the investigation and resolution of any individual or system-wide outage. Responsible for the stability of the Production environment.
  • Develop and continually revise (in partnership with other teams where necessary) suitable policies and procedures to ensure appropriate application development standards are available to guide development for systems deployed to Production.
  • As the gatekeepers of the Production environment, responsible for ensuring the Change Implementation Management guidelines / policies are adhered to for all systems deployed to Production.
  • Responsible for servicing all requests for data or other activities that require access to Production systems
  • Work with development teams at the appropriate stages in application development to ensure any new systems or projects meet the Production standard
  • Responsible for maintaining and growing a body of knowledge that is accessible to all team members. Ensure information regarding any support related activities or issues are available and easily accessible. The goal is to improve self-reliance and reduce dependency on the availability of development or external team resources for the initial troubleshooting and resolution of problems.
  • As a team member with expertise in deep analytical triage, you will provide subject matter expertise in debugging, issue analysis and troubleshooting, working with business and technical colleagues to provide reviews and recommendations to avoid any future application issues. Produce guidance documentation, standards and procedures, products assessments, and training material including working with the various application and infrastructure support teams ensuring that they are documenting every single troubleshooting step in Morgan Stanley knowledge base system to resolve issues in a faster time frame. You will serve as a fully seasoned / proficient technical resource; provide technical knowledge in outage management and proactive solutions to improve.

What you'll bring to the role :

  • At least 7 years' relevant experience would generally be expected to find the skills required for this role
  • Minimum 7 years of experience in developing and / or supporting Enterprise Applications
  • Willingness to embrace Agile and DevOps / SRE concepts.
  • Solid analytical skills, problem determination, and resolution recovery processes
  • Have experience with observability tools such as Prometheus, Grafana , Loki, kibana, splunk etc,
  • Ability to interface and cultivate excellent working relationships with technology teams, business analysts, and vendors
  • Strong Unix Shell scripting experience required.
  • Have administrative competence in at least one major programming language or platform (for example : Perl, Powershell, Python or Java)
  • Should be a fast learner of technologies in a quick paced environment.
  • Have strong organizational skills and the ability to manage multiple tasks and high pressure situations for outage handling, management, or resolution
  • Is driven to learn new technologies, techniques and what it takes to be an integral member of this team

  • Hands-on experience administering large-scale, high-availability systems and the tools to monitor performance and availability
  • BS / MS or equivalent, preferably in quantitative discipline (Computer Science, Computer Engineering, EE, Math, Physics).
  • Experience with incident on call and ability to respond to emergencies on a 24 / 7 basis
  • Experience working with Financial Services area will be a plus
  • Experience with monitoring & incident response automation tools BigPanda, PagerDuty, Apica, and configuring monitors
  • Experience with business process automation tools UIPATH.
  • WHAT YOU CAN EXPECT FROM MORGAN STANLEY :

    We are committed to maintaining the first-class service and high standard of excellence that have defined Morgan Stanley for over 89 years. Our values - putting clients first, doing the right thing, leading with exceptional ideas, committing to diversity and inclusion, and giving back - aren't just beliefs, they guide the decisions we make every day to do what's best for our clients, communities and more than 80,000 employees in 1,200 offices across 42 countries. At Morgan Stanley, you'll find an opportunity to work alongside the best and the brightest, in an environment where you are supported and empowered. Our teams are relentless collaborators and creative thinkers, fueled by their diverse backgrounds and experiences. We are proud to support our employees and their families at every point along their work-life journey, offering some of the most attractive and comprehensive employee benefits and perks in the industry. There's also ample opportunity to move about the business for those who show passion and grit in their work.

    To learn more about our offices across the globe, please copy and paste https : / / www.morganstanley.com / about-us / global-offices into your browser.

    Morgan Stanley is an equal opportunities employer. We work to provide a supportive and inclusive environment where all individuals can maximize their full potential. Our skilled and creative workforce is comprised of individuals drawn from a broad cross section of the global communities in which we operate and who reflect a variety of backgrounds, talents, perspectives, and experiences. Our strong commitment to a culture of inclusion is evident through our constant focus on recruiting, developing, and advancing individuals based on their skills and talents.

    Create a job alert for this search

    Software Reliability • Mumbai, India

    Related jobs
    • Promoted
    Natobotics - Vice President - Site Reliability Engineering

    Natobotics - Vice President - Site Reliability Engineering

    NatoboticsMumbai, India
    Position : VP Site Reliability Engineering (SRE) Job Type : Full-time Executive Summary ...Show moreLast updated: 30+ days ago
    • Promoted
    RELX - Site Reliability Engineer - IAC Terraform

    RELX - Site Reliability Engineer - IAC Terraform

    REED ELSEVIER INDIA (a part of RELX India Pvt Ltd)Mumbai
    Job Description : - Lead initiatives to identify and eliminate manual, repetitive tasks through automation and tooling.Develop s...Show moreLast updated: 30+ days ago
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    Session AIMumbai, MH, IN
    Quick Apply
    Are you ready to make your mark with a true industry disruptor? ZineOne, a subsidiary of.We work with some of the leading brands nationwide and we innovate how brands connect with and convert custo...Show moreLast updated: 30+ days ago
    • Promoted
    Akasa Air - Site Reliability Engineer

    Akasa Air - Site Reliability Engineer

    SNV AVIATION PRIVATE LIMITED / Akasa AirMumbai
    As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our systems and infrastructure. This includes troubleshooting issues, developing and maintaini...Show moreLast updated: 30+ days ago
    • Promoted
    Sr Site Reliability Engineer

    Sr Site Reliability Engineer

    Media.netMumbai, Maharashtra, India
    Our proprietary contextual technology is at the forefront of enhancing Programmatic buying, the latest industry standard in ad buying for digital platforms. HQ is based in New York, and the Global H...Show moreLast updated: 21 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CodeKarmadombivli, maharashtra, in
    Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 6 days ago
    • Promoted
    Natobotics - Vice President - Site Reliability Engineering

    Natobotics - Vice President - Site Reliability Engineering

    Natobotics Technologies Pvt LimitedMumbai
    Job Summary : We are seeking a visionary and strategic VP Site Reliability Engineering (SRE) to join the leadership team. This is a foundational role within the CTO o...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    SynechronMumbai, Maharashtra, India
    We have immediate opportunity for.Site Reliability Engineer Devop 5 to 9 years.SRE (Senior Site Reliability Engineer) Devop. We began life in 2001 as a small, self-funded team of technology speciali...Show moreLast updated: 6 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    o9 Solutions, Inc.dombivli, maharashtra, in
    Be part of something revolutionary.At o9 Solutions, our mission is clear : be the Most Valuable Platform (MVP) for enterprises. With our AI-driven platform — the o9 Digital Brain — we integrate globa...Show moreLast updated: 7 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Amicon Hub Servicesdombivli, maharashtra, in
    Manage and scale production systems hosted on.Automate operational tasks using.Improve system reliability and reduce manual interventions through automation. Collaborate with development teams to en...Show moreLast updated: 28 days ago
    • Promoted
    Zycus - Site Reliability Engineering Manager

    Zycus - Site Reliability Engineering Manager

    Zycus Infotech Pvt LtdMumbai
    Job Description : Zycus is looking for a Site Reliability Engineer (SRE) with deep expertise in Kubernetes, automation, and Linux systems. The ideal candidate will ha...Show moreLast updated: 6 days ago
    • Promoted
    Site Reliability Engineer / Lead - CI / CD Pipeline

    Site Reliability Engineer / Lead - CI / CD Pipeline

    SolutionTech HRMumbai
    Key Responsibilities : - Lead and mentor a team of SREs / DevOps Engineers, fostering a culture of ownership, reliability,...Show moreLast updated: 29 days ago
    • Promoted
    ▷ Only 24h Left : Sr Site Reliability Engineer

    ▷ Only 24h Left : Sr Site Reliability Engineer

    Media.netMumbai, Maharashtra, India
    Our proprietary contextual technology is at the forefront of enhancing Programmatic buying, the latest industry standard in ad buying for digital platforms. HQ is based in New York, and the Global H...Show moreLast updated: 12 days ago
    • Promoted
    Media.net - Senior Site Reliability Engineer - IAC Terraform

    Media.net - Senior Site Reliability Engineer - IAC Terraform

    Media.netMumbai
    Our proprietary contextual technology is at the forefront of enhancing Programmatic buying, the latest industry standard in ad buying for digital platforms. HQ is based in New York, and the Global H...Show moreLast updated: 22 days ago
    • Promoted
    MindCraft Software - Site Reliability Engineer - DevOps

    MindCraft Software - Site Reliability Engineer - DevOps

    MindCraft Software Pvt. Ltd.Thane
    SRE (Site Reliability Engineer) Exp : 5-7 years Location : Thane - 5+ years in SRE or DevOps roles supporting high-scale platforms (fint...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.Kalyan-Dombivli, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Associate Platform Reliability Engineer (SRE)

    Associate Platform Reliability Engineer (SRE)

    JefferiesMumbai, Maharashtra, India
    Jefferies,’’ ‘‘we,’’ ‘‘us’’ or ‘‘our’’) is a U.Our largest subsidiary, Jefferies LLC, a U.Jefferies International Limited, a U. Our strategy focuses on continuing to build out our investment banking...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    XequalstoMumbai
    Description : Senior Site Reliability Engineer (SRE) Location : Mumbai , Navi Mumbai - Hybrid office visits will be scheduled as and when requi...Show moreLast updated: 7 days ago