At EY, you’ll have the chance to build a career as unique as you are, with the global scale, support, inclusive culture and technology to become the best version of you. And we’re counting on your unique voice and perspective to help EY become even better, too. Join us and build an exceptional experience for yourself, and a better working world for all.
JD for Resilience And Reliability M :
Resilience And Reliability Engineering Architect (M)
Description
- Resilience & Reliability are fundamental to ensure modern architectures are available, performant and fault aware.
- A Resilience & Reliability Architect / Consultant will help in designing the roadmap to achieve Resilience for Enterprise IT
- They will also implement various SRE Solutions across the enterprise / line-of-businesses
- They will be able to assess the Reliability Maturity of an IT Organization and provide strategy and roadmap to achieve higher maturity levels
Responsibilities
Defining SLA / SLO / SLI for a product / serviceEngineering in resilient design and implementation practices into solutions as they go through the product life cycleDesigning & implementing Observability Solutions to track, report, and measure SLA adherenceEngineering out manual effort (Toil) through the development of automated processes and services (, Automated Management of Systems, CI / CD improvements)Optimize Cost of IT Infra & Operations - FinOpsTypical Skills and Background
12+ years of experience in software product engineering principles, processes and systemsHands-on experience in Java / J2EE, one of web server (Apache Tomcat or IBM HTTP Server), one of the application servers (Tomcat / WebSphere), and any major RDBMS like OracleHands-on experience in at least one CI-CD (Azure DevOps, GitLab CI / CD, Jenkins) and IaC tools (Terraform, AWS CloudFormation, Ansible etc.)Experience in at least one cloud technology (AWS / Azure / GCP etc. and Docker, Pivotal, Kubernetes, OpenShift etc.) and its reliability tools (Azure AppInsight, CloudWatch, Azure Monitor etc.)Experience in Observability - APM tools (Dynatrace, AppDynamics etc.), metrics / log consolidation (Splunk) and ELK StackExperience in Linux (RHEL) operating system performance monitoring parameters and their interpretation, commands used for monitoringExperience in Web Services, SOA, ESB (DataPower), RESTFulDefining NFRs and SLA / SLO / SLI agreement for a product / platform / servicesKnowledge on queuing models used, thread pools, request servicing processes etc.Knowledge of application design patterns, J2EE application architectures, Microservices, Spring boot & Cloud native architecturesProficiency in Java runtimes, Core Java, Garbage collection, JVM parameters tuningExperience in performance tuning on Application Servers (Tomcat / WAS)Experience in trouble shooting Performance / Scalability / Availability issuesThread dump, heap dump generation & analysisKnowledge on Query tuning and database architectureKnowledge at least one automation scripting language like PythonMastery of collaborative software development using Git, Jira, Confluence etc.AI / ML & Data Analytics knowledge and experience is a desirableEY | Building a better working world