Talent.com
This job offer is not available in your country.
Platform Engineering, Monitoring and Observability Lead - SRE Focus (Lead Systems Operations Engineer)

Platform Engineering, Monitoring and Observability Lead - SRE Focus (Lead Systems Operations Engineer)

WELLS FARGO BANKbangalore, India
4 hours ago
Job description

About this role :

"Wells Fargo is seeking a Lead Systems Operations Engineer. We believe in the power of working together because great ideas can come from anyone. Through collaboration, any employee can have an impact and make a difference for the entire company. Explore opportunities with us for a career in a supportive environment where you can learn and grow."

In this role, you will :

  • Lead complex, broad impact initiatives including provision of high level systems consultation for the technology teams
  • Work as key participant in large scale planning of computer systems and network infrastructure for Systems Operations functional area
  • Review and analyze complex technical challenges, as well as escalated support issues related to core business solutions that require in depth evaluation of multiple factors, such as alternatives, enhancements, periodic systems reviews, or improvements to existing systems
  • Make decisions on technical changes and enhancements
  • Consult with engineering team on change design requiring solid understanding of technical process controls or standards that influence and drive new initiatives
  • Collaborate and consult with technical peers, colleagues, and mid to more experienced level managers to resolve systems support issues and achieve goals

Required Qualifications :

  • 5+ years of Systems Engineering, Technology Architecture experience, or equivalent demonstrated through one or a combination of the following : work experience, training, military experience, education
  • Desired Qualifications :

  • Lead the strategy and execution of monitoring and observability initiatives across infrastructure and applications.
  • Architect and maintain dashboards, alerts, and telemetry pipelines using tools like Grafana, Prometheus, and Elastic APM.
  • Integrate and optimize observability platforms including Splunk, AppDynamics, ThousandEyes, and ITRS Geneos.
  • Collaborate with SRE and DevOps teams to ensure system reliability, scalability, and performance.
  • Develop automation scripts in Python and Shell for data collection, analysis, and alerting.
  • Drive root cause analysis and incident response using observability data.
  • Evaluate and implement Gen AI solutions to enhance observability and predictive analytics.
  • Mentor junior engineers and promote best practices in monitoring and reliability engineering.
  • Bachelor's or Master's degree in Computer Science, Engineering, or related field.
  • 5+ years of experience in IT operations, with at least 3 years in a lead role focused on observability and SRE.
  • Proven expertise in tools such as :
  • Splunk, ITRS Geneos, Grafana, Prometheus, Elastic APM
  • ThousandEyes, AppDynamics
  • Strong scripting skills in :
  • Python (especially for data analytics and automation)
  • Shell scripting
  • Deep understanding of SRE principles including SLIs, SLOs, error budgets, and incident management.
  • Experience with cloud platforms (AWS, Azure, or GCP) and containerized environments (Kubernetes, Docker).
  • Certifications in observability tools or cloud platforms (e.g., Splunk Certified Admin, AWS Cloud Practitioner).
  • Experience with machine learning or Gen AI frameworks applied to observability (e.g., anomaly detection, predictive alerting).
  • Familiarity with CI / CD pipelines and infrastructure as code (Terraform, Ansible).
  • Strong analytical mindset with a passion for data-driven decision-making.
  • Excellent communication and stakeholder management skills.
  • Job Expectations :

  • The team operates on a 16x5 schedule , ensuring coverage across critical business hours and extended support windows.
  • Candidates must be willing to participate in weekend on-call rotations , providing support for high-priority incidents and system health checks.
  • As part of production management responsibilities , the lead is expected to be available during off-hours when necessary to support major incidents, deployments, or escalations.
  • Flexibility and responsiveness are key, especially in high-impact scenarios where rapid resolution is essential to maintaining system reliability and performance.
  • Posting End Date : 2 Oct 2025

  • Job posting may come down early due to volume of applicants.
  • We Value Equal Opportunity

    Wells Fargo is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other legally protected characteristic.

    Employees support our focus on building strong customer relationships balanced with a strong risk mitigating and compliance-driven culture which firmly establishes those disciplines as critical to the success of our customers and company. They are accountable for execution of all applicable risk programs (Credit, Market, Financial Crimes, Operational, Regulatory Compliance), which includes effectively following and adhering to applicable Wells Fargo policies and procedures, appropriately fulfilling risk and compliance obligations, timely and effective escalation and remediation of issues, and making sound risk decisions. There is emphasis on proactive monitoring, governance, risk identification and escalation, as well as making sound risk decisions commensurate with the business unit's risk appetite and all risk and compliance program requirements.

    Candidates applying to job openings posted in Canada : Applications for employment are encouraged from all qualified candidates, including women, persons with disabilities, aboriginal peoples and visible minorities. Accommodation for applicants with disabilities is available upon request in connection with the recruitment process.

    Applicants with Disabilities

    To request a medical accommodation during the application or interview process, visit Disability Inclusion at Wells Fargo .

    Drug and Alcohol Policy

    Wells Fargo maintains a drug free workplace. Please see our Drug and Alcohol Policy to learn more.

    Wells Fargo Recruitment and Hiring Requirements :

    a. Third-Party recordings are prohibited unless authorized by Wells Fargo.

    b. Wells Fargo requires you to directly represent your own experiences during the recruiting and hiring process.

    Create a job alert for this search

    Engineering Lead Platform • bangalore, India

    Related jobs
    • Promoted
    • New!
    Platform Engineer, SF Analytics and Integration

    Platform Engineer, SF Analytics and Integration

    Astellas Pharma Inc.bangalore, India
    Platform Development and Configuration : Design, develop, and configure business platforms to meet the specific needs of our organization. This could involve programming, configuring settings, and in...Show moreLast updated: 4 hours ago
    • Promoted
    • New!
    Sr. Technical Engineer - Manage Engine Deployment

    Sr. Technical Engineer - Manage Engine Deployment

    POWER BRIDGE SYSTEMS PRIVATE LIMITEDbagalur, India
    Lead the deployment, configuration, and administration of Endpoint Management solutions such as Microsoft Endpoint Manager (Intune, SCCM), Manage Engine Endpoint Central, for a large and diverse de...Show moreLast updated: 4 hours ago
    • Promoted
    Lead Platform Engineer - Observability Services

    Lead Platform Engineer - Observability Services

    neemtreeBangalore
    Roles & Responsibilities : - Solution Packaging : Lead the end-to-end development of observability packages for 100+ standard technologies across infrastructure, d...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Lead Systems Engineer (DevOps & SRE)

    Lead Systems Engineer (DevOps & SRE)

    Epambangalore, India
    Join our organization as a Lead Systems Engineer (DevOps & SRE) and play a crucial role in ensuring the reliability, scalability, capacity planning, and performance of our infrastructure and applic...Show moreLast updated: 4 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ViewSonicBengaluru, Karnataka, India
    Bachelor's degree in Computer Science, Engineering, or a related field.Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory. Basic understanding of AWS solutions in...Show moreLast updated: 17 days ago
    • Promoted
    Lead Platform Engineer - Site Reliability

    Lead Platform Engineer - Site Reliability

    NeemtreeBangalore
    Responsibilities : - Solution Packaging : Lead the end-to-end development of observability packages for 100+ standard technologies across infrastructure, data...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Lead Solutions Engineer

    Lead Solutions Engineer

    Netcore UnbxdBengaluru, Karnataka, India
    At Unbxd, we're building the world's largest search intelligence products.We are a bunch of close-knit, highly driven, and highly skilled engineers & Support Function teams who think big and execut...Show moreLast updated: 3 hours ago
    • Promoted
    • New!
    Sr. Systems Engineer, DNS, Platform Engineering

    Sr. Systems Engineer, DNS, Platform Engineering

    Netskopebangalore, India
    Today, there's more data and users outside the enterprise than inside, causing the network perimeter as we know it to dissolve. We realized a new perimeter was needed, one that is built in the cloud...Show moreLast updated: 4 hours ago
    • Promoted
    Site Reliability Engineering Lead - Cloud Platform

    Site Reliability Engineering Lead - Cloud Platform

    Leap India Stack FoundationBangalore
    About the job : Position Purpose : At Brambles there is a need to make sure that platforms built on cloud hypervisors run smo...Show moreLast updated: 18 days ago
    • Promoted
    Site Reliability Engineer - Chaos Management

    Site Reliability Engineer - Chaos Management

    XebiaBengaluru, Karnataka, India
    AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 8 days ago
    • Promoted
    Lead Platform Engineer - Observability Services

    Lead Platform Engineer - Observability Services

    Vunet SystemsBangalore
    Role : Lead Platform Engineer Experience : 7 -10 Years Education : B.Tech / Masters ...Show moreLast updated: 30+ days ago
    • Promoted
    Vulnerability Management - L3

    Vulnerability Management - L3

    ITC InfotechBengaluru, Karnataka, India
    On the portal where vulnerabilities are listed, each vulnerability must be analyzed;.Within each record of each vulnerability, analyze the required fixes and the vendor involved.Contact the vendor ...Show moreLast updated: 19 days ago
    • Promoted
    • New!
    Lead Engineer - DevOps & Platform Engineering

    Lead Engineer - DevOps & Platform Engineering

    Minderabangalore, India
    DevOps and Platform Engineering practice to power next-generation software delivery across GCP and Azure.Lead Engineer – DevOps & Platform Engineering. In this high-impact leadership role, you'll ow...Show moreLast updated: 4 hours ago
    • Promoted
    • New!
    Specialist System Engineering

    Specialist System Engineering

    AT&Tbangalore, India
    Application Management (5-10 Years).Hands on enterprise application lifecycle management(On-prem, Cloud) experience.Application Deployment(DevOps),Maintenance, Monitoring and Performance Management...Show moreLast updated: 4 hours ago
    • Promoted
    • New!
    Platform Engineer – Azure Monitoring

    Platform Engineer – Azure Monitoring

    5100 Kyndryl Solutions Private Limitedbangalore, India
    At Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward – always pushing ...Show moreLast updated: 4 hours ago
    • Promoted
    • New!
    SRE Observability Architect

    SRE Observability Architect

    Virtusabangalore, India
    SRE Observability Architect - Description Experience : .Minimum 10 years of relevant work experience with monitoring setup using any product (Dynatrace, Datadog, ELK stack, Splunk, Grafana / Promethe...Show moreLast updated: 4 hours ago
    • Promoted
    • New!
    Senior SRE Cloud (Site Reliability Engineering)

    Senior SRE Cloud (Site Reliability Engineering)

    Henkelbangalore, India
    A Cloud Site Reliability Engineering Engineer closely works with app developers to tide the cloud infrastructure to the application behavior or deployment like a software engineer.The close collabo...Show moreLast updated: 4 hours ago
    • Promoted
    • New!
    Senior Systems Engineer (DevOps & SRE)

    Senior Systems Engineer (DevOps & SRE)

    Epambangalore, India
    We are seeking a talented and motivated.Site Reliability Engineer (SRE).The SRE will play a crucial role in ensuring the Reliability, Scalability, Capacity Planning and performance of our infrastru...Show moreLast updated: 4 hours ago