Career Area :
Technology Digital and Data
Job Description :
Your Work Shapes the World at Caterpillar Inc.
When you join Caterpillar yourejoining a global team who cares not just about the work we do but also about each other. We are the makers problem solvers and future world builders who are creating stronger more sustainable communities. We dontjust talk about progress and innovation here we make it happen with our customers where we work and live. Together we are building a better world so we can all enjoy living in it.
Job Posting Title
Manager Software Engineering
Job Description Summary
As a manager you will be a process technology and results oriented team member for Operations to deliver top notch service quality and metrics for Cat Digital data Platform and applications.
You will fit this role if you can
Think about systems - edge cases failure modes behaviors specific implementations.
Debug production issues across services and levels of the stack.
Make monitoring and alerting alert on symptoms and not on outages.
Have an enthusiastic go-for-it attitude. When you see something broken you cant help but fix it.
Have an urge to collaborate and communicate asynchronously.
Have an urge for delivering quickly and iterating fast.
Basic Qualifications :
- Bachelors degree preferably in Computer Science Software Engineering or any other Engineering field.
- 12 years with AWS DevOps with Production Support expertise.
Key Responsibilities :
Lead Tier-2 and Tier-2.5 across platform and application support ensuring timely resolution of day to day operational issues.Responsible for driving operational excellence streamlining support processes and mentoring the Tier-2 and Tier-2.5 team to ensure high service levels and continuous improvement.Maintain ownership of Tier-2 scope of work and ensure alignment with ITSM standards.Work closely with product owners development teams and subject matter experts to ensure seamless support handoffs and knowledge retention.Participate in sprint planning with the development team to stay informed of upcoming features and proactively prepare support strategies.Knowledge of the decision-making process and associated tools and techniques ability to accurately analyze situations and reach productive decisions based on informed judgement.Understanding of effective communication concepts tools and techniques; ability to effectively transmit receive and accurately interpret ideas information and needs through the application of appropriate communication behaviors.Monitor and troubleshoot production systems to identify and resolve performance scalability and reliability issues proactively.Work closely with developers to identify and fix bugs and performance bottlenecks in the application codeContinuously evaluate systems and processes to identify areas for improvement and implement changes as neededCollaborate with cross-functional teams to define and document operational processes best practices and procedures.Meeting SLO SLA SLIs defined in the Operations modelSetting task prioritization and troubleshoot to closure of incidentsImprove Service observability.Proactively testing the flexibility and resilience of the system.People management Effectively manages the team by providing technical guidance and offering the right direction whenever neededTechnical Experience :
12 years prior experience in DevOps / Operations Support and / or application development teams.Hands on experience using large scale softwaredevelopment preferably in one of these languages : Java Python scripting languages is a mustKnowledge of CI / CD solution on any platform with prior experience is must.Expert knowledge of Infrastructure components. (E.g. routers load balancers cloud products container systems compute storage and networks).Deep experience on Key AWS services : EC2 S3 VPC Route 53 RDS CloudFormation EC2 DynamoDB (NoSQL) Lambda logging / CloudWatch IAM Certificate Manager ELB EBS ECS CloudFront / WAF SQS SNS SES.Expertise in tools like Prometheus Grafana AppDynamics CloudWatch and Thousand Eyes for system health monitoring and alertingHands on experience on Docker and at least one Docker Container orchestration ECS KubernetesExpertise with configuration Management tools like Ansible / Puppet / Chef / PowerShell or Terraform.Skills in diagnosing and resolving production issues conducting root cause analysis and writing postmortemsExperience in ITSM tools like ServiceNow and incident response protocolsITIL Foundation V3 certification added advantage plus knowledge and process in ITSM ITIL is a mustPrecision in monitoring alerting and writing reliable code and continuous ImprovementA mindset geared toward reducing toil automating repetitive tasks and improving system reliabilityKnowledge on Azure Cloud an added advantage.Expertise inELKMonitoring Tool that ensure Open-Source IT monitoring network monitoring server and applications monitoring is an added advantage.Understanding of Restful API Apigee or any other API Gateway will be plus.Expertise with Git Bitbucket Jira Jenkins Sonar Splunk Maven AIM and / or Continuous Delivery tools.Soft Skills & Operational Mindset and Problem Solving & ResilienceAbility to think critically under pressure and resolve complex issues calmlyStrong interpersonal skills to work across engineering product and operations teamsPosting Dates :
November 14 2025 - November 27 2025
Caterpillar is an Equal Opportunity Employer. Qualified applicants of any age are encouraged to apply
Not ready to apply Join our Talent Community.
Required Experience :
Manager
Key Skills
Hospitality Experience,Go,Management Experience,React,Redux,Node.js,AWS,Mechanical Engineering,Team Management,Leadership Experience,Mentoring,Distributed Systems
Employment Type : Full-Time
Experience : years
Vacancy : 1