Who We Are Looking For
WebPT is seeking a TOC (Technology Operations Center) engineer to oversee our SaaS applications health and technology operational tasks. In this role, you will be responsible for monitoring the availability, performance, and capacity of our SaaS products. A successful Sr TOC engineer will follow SOP to assist with service recovery on Linux, Windows, and Cloud hosted applications, coordinate, collaborate, and tune monitoring-based alerts, participate in operational incidents, and become immersed in challenging technology problems. Additional responsibilities include working with the release management team to deploy applications and other operational changes into the environment.
What You'll Be Doing As A Part of Our Team
- Proactively monitor the availability and performance of infrastructure and SaaS application services hosted in the cloud and data center
- Serve as the first point of contact for incidents, escalations, and business-impacting technology issues
- Respond to monitoring events within defined SLAs and collaborate with partner teams to reach a resolution
- Manage operational incidents, communicate status updates, and facilitate incident bridges
- Monitor scheduled batch jobs (backups, data processing)
- Escalate operational tasks to partner teams as needed
- Administer the TOC dashboard, create, and update alerts
- Assist with root cause analysis and provide essential troubleshooting for networks, systems, and applications
- Write technical documentation, including incident details and root-cause analysis
- In addition to assisting the lead in teams progress, you will be responsible for identifying areas of improvement.
- Help with transition of newer tasks from partner teams to help grow TOC's scope.
- Support SRE and SysOps teams with deployments, patching, and backup activities
- Participate in 24x7 rotational shifts including on call and weekend coverage as required by business.
What You Should Have To Qualify
8+ years of experience working in a TOC, supporting a large US-based organization.3+ years of experience supporting a hosted application (SaaS) in some capacity.Experience using and configuring infrastructure and application performance monitoring tools like Dynatrace (preferred), Cloudwatch, Zabbix, or similar services.Knowledge of operating systems and backup tools (Windows / Linux).Experience working in hybrid-cloud environments.Exposure to cloud technology (AWS, Azure).Experience with application deployments and performing patch management.Exposure to AWS or Azure is a plus.Ideally, You Would Also Have These
Ability to conform to defined processesFollow defined SLAs for communications, responses, and schedulesGood verbal and written communication skills is a mustConversant will be Infrastructure and technology terms.Work in rotational shifts covering US and India business hours including handling oncallBasic understanding of information security best practicesGood critical thinking and decision-making skillExcellent time management, ability to juggle priorities while keeping organizedAbility to perform high-level troubleshooting of infrastructure and application issuesAble to multitask while staying focused on core operational prioritiesCulture is at our Core
Service : Create Raving FansAccountability : Follow Up; Own UpAttitude : Possess True GritPersonality : Be MintyWork Ethic : Be Rock SolidCommunity Outreach : Give BackHealth and Wellness : Live BetterResource Efficiency : Do Más With MenosAbout Us
Here, we work hard—but we have lots of fun doing it. We believe in equal opportunity for all, autonomy, trailblazing, and always doing right by our Members. Most importantly, though, we believe in empowering rehab therapy professionals to achieve greatness in practice. So, if you're a can-do kinda person who loves to help Members win and enjoys working from just about anywhere—then you'll fit right in. We've got big plans, but we can't achieve them without you. Join us, and let's achieve greatness.
Onsite
Skills Required
Cloudwatch, Patch Management, Windows, Zabbix, Cloud, Linux, Dynatrace, Azure, Aws