Talent.com
This job offer is not available in your country.
Software Engineer - Site Reliability

Software Engineer - Site Reliability

Workdaychennai, India
9 hours ago
Job description

About the Role

You will be a key contributor on the Analytics Delivery Engineering team. As an SRE, you will help in building a clean, scalable, reliable, and automated services framework.

The Delivery Engineering team for Workday Analytics is a team of engineers for engineers. We get the opportunity to support a lot of different groups including back-end, front-end, SDETs, etc. We spend our time writing software and manage how we deliver, deploy, monitor and everything in-between.

What You Will Do

Work on projects ranging from building out robust CI / CD to automation for both the Prism engineering team and SRE team

Work on enhancements and improvements for monitoring, alerting, and tracing of not only internal services but most importantly, production services.

Collaborate with others on the SRE team to help set technical direction while ensuring requirements are met.

Help enhance, build, and maintain internal infrastructure running in the public cloud as well as orchestration across the Workday Private Cloud.

Interact and be the primary contact with multiple teams both internally in Workday Prism Analytics alongside external Workday teams

Participate in and facilitate production on-call duties and events to ensure reliability of the analytics applications and play a pivotal role in building the foundation that delivers Workday Analytics to the cloud.

About You

Basic Qualifications :

3+ years experience working in Unix / Linux from kernel to shell, file systems, client-server protocols, etc.

2+ years coding experience and can utilize various languages (We focus and build tooling and automation using Python, GoLang and Java.)

Other Qualifications :

Experience in designing, analyzing, and troubleshooting large-scale distributed systems built on technologies like Spark, YARN, Hadoop, Kubernetes

Experience building infrastructure and tooling in the cloud and using managed services where possible, we focus on AWS

Working knowledge of building immutable services and functions utilizing Docker, Kubernetes and Serverless frameworks (AWS Lambda, API Gateway)

Working knowledge of building Highly Available, Scalable, Reliable multi-tenanted big data applications on Cloud (AWS, GCP) and / or Data Center architectures.

Pursuant to applicable Fair Chance law, Workday will consider for employment qualified applicants with arrest and conviction records.

Create a job alert for this search

Site Reliability Engineer • chennai, India