Responsibilities :
- Work with product team on the shared full stack ownership of a collection of services and / or technology areas
- Understand the end-to-end configuration, technical dependencies, and overall behavioural characteristics of production services
- Responsible for the mitigating critical customer incidents, or deployments or testing required to improve security, performance, availability, and scalability of service
- Authority for end-to-end performance and operability
- Partner with development teams in meeting SLA to unblock customers
- Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio
- Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack
- Demonstrate clear understanding of automation and orchestration principles
- Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs)
- Utilise a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations
- Understand and explain the effect of product architecture decisions on distributed systems
- Professional curiosity and a desire to a develop deep understanding of services and technologies
Required Skills :
6+ years overall experience in IT industryMinimum 4 years of experience as a Sys Admin / SupportStrong systems architecture skillsStrong Linux administration (Understanding of different Hardware family)Virtualisation TechnologiesScripting Language (Python / Bash / Shell etc, basic understanding of Java / Go will be good to have)Understanding of Networking, Cloud Computing, Load BalancersHands on experience at Monitoring / Instrumentation tools (Prometheus / Grafana, new relic, elastic or equivalent).Experience with maintaining high scale deployments, managing high throughput and IO intensive services.Strong knowledge of system configuration tools such as Chef, Terraform, GIT, Jenkins / Hudson, ArtifactoryContinuous Integration development / deployment, e.g. Docker, KubernetesSkills Required
Linux Administration, Python, Prometheus, Grafana, Terraform, Docker