Talent.com
This job offer is not available in your country.
Site Reliability Developer 3

Site Reliability Developer 3

Oracleindia, null, India
12 hours ago
Job description

The NRE (Network Reliability Engineering) team is accountable for ensuring the robustness of the Oracle Cloud Network Infrastructure. A Network Reliability Engineer (NRE) role is primarily focused on applying an engineering approach to measure and automate a network's reliability to align with Organization's service-level objectives, agreements, and goals. The duties of the NRE team entail promptly responding to network disruptions, pinpointing the underlying cause, and collaborating with internal and external stakeholders to fully restore functionality. The NRE team members play a critical role in automation of recurring tasks in daily operations to streamline processes, enhance workflow efficiency, and increase overall productivity. As OCI is a cloud-based network with a global footprint, this support will include hundreds of thousands of network devices supporting millions of servers, connected over a mix of dedicated backbone infrastructure, CLoS Network, and the Internet. Some of the responsibilities include designing, writing, and deploying network monitoring and automation software, to improve the availability, scalability, and efficiency of Oracle products and services. Requirements : Bachelor’s degree in CS or related engineering field with 5+ years of Network Engineering experience or Master's with 5+ years of Network Engineering experience. Experience working in a large ISP or cloud provider environment. Experience working in a network operations role. Strong knowledge of protocols such as MPLS, BGP, IPv6, DNS, and DHCP, SSL. Also, VxLAN and EVPN will be an added advantage. Deeper understanding of Data Center build and design - CLoS architecture etc. Extensive experience with scripting or automation and data center design – Python preferred but must demonstrate expertise in scripting or compiled language. Experience with network monitoring and telemetry solutions. Hands on experience with Prometheus or other network monitoring software stack. Experience with network modeling and programming – YANG, OpenConfig, NETCONF. Ability to use professional concepts and company objectives to resolve complex issues in creative and effective ways. Capable of working under limited supervision. Excellent organizational, verbal, and written communication skills. Excellent judgment in influencing product roadmap direction, features, and priorities. Bachelor’s or master’s degree in computer science, Electrical / Hardware Engineering, or related field. Participate in an on-call rotation.

Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and / or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services.

Supports the design, deployment, and operations of a large-scale global Oracle Cloud Infrastructure (OCI). Primarily focused on the development and support of network fabric and systems through a combination of a deep level understanding of networking at the protocol level coupled with programming skills. As OCI is a cloud-based network with a global footprint. This support will include hundreds of thousands of network devices supporting millions of servers, connected over a mix of dedicated backbone infrastructure, CLos Network, and the Internet.

  • Collaborate with program / project managers to develop milestones and deliverables.
  • Will primarily use existing procedures and tools to develop and safely execute network change. However, will also contribute to developing new procedures from time to time.
  • Develop solutions to enable front line support teams to act on network failure conditions.
  • Mentor junior engineers.
  • Participates in network solution and architecture design process.
  • Participate in operational rotations as either primary or secondary on-call.
  • Provide break-fix support for events. Serve as the escalation point for event remediation. Lead post-event root cause analysis.
  • Frequently develops scripts to automate routine tasks for team and business units.
  • Coordinate with networking automation services for the development and integration of support tooling.
  • Coordinate with network monitoring to gather telemetry and create alerts rules using them.
  • Build dashboards to represent data at various network layers and device roles that help identify network issues, anomalies.
  • Serves as SME on software development projects for network automation and network monitoring.
  • Collaborate with network vendor technical account team and internal Quality Assurance team to drive bug resolution and assist in the qualification of new firmware and / or operating systems.

Career Level - IC3

Create a job alert for this search

Site Reliability • india, null, India

Related jobs
  • Promoted
  • New!
Senior Site Reliability Developer

Senior Site Reliability Developer

OracleIndia
Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence.Design, write, and deploy software to improve the availability, scalability, and e...Show moreLast updated: 12 hours ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

ExasoftIndia, India
Responsibilities and Requirements : .Experience must be at least 10+ years in SRE.Multi Cloud, Hybrid Cloud – on Data center sites. Experience with multiple operating systems (.Operating Systems, Kern...Show moreLast updated: 3 days ago
  • Promoted
Senior Site Reliability Engineer- ELK Expert

Senior Site Reliability Engineer- ELK Expert

iVedha Inc.Nagpur, IN
Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer - Chaos Management

Site Reliability Engineer - Chaos Management

Xebianagpur, maharashtra, in
AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 11 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

BirlasoftIndia
Responsibilities : Be primarily responsible for providing production, operations support and application administration to business and web applications, 3rd party applications and related ecosystem...Show moreLast updated: 28 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

XebiaIndia, India
AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

UplersNagpur, IN
Uplers is hiring for one of the clients.SRE (Oracle Cloud Infrastructure).Remote | Mon–Fri | 10 : 30 AM – 7 : 30 PM IST.Use of personal device required. OCI cloud infrastructure using Terraform and GitL...Show moreLast updated: 27 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

ViewSonicIndia
Job Requirements : Bachelor's degree in Computer Science, Engineering, or a related field.Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory.Basic understanding o...Show moreLast updated: 21 days ago
  • Promoted
  • New!
Site Reliability Developer 4

Site Reliability Developer 4

OracleIndia
Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and / or technology areas. Understand the end-to-end configuration, technical dependenc...Show moreLast updated: 12 hours ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

HaysIndia
Required skills and qualifications Exp- 7-12 Years • Experience : Proven experience in technical support or engineering, preferably in AI / ML / GenAI environments. Technical Proficiency : Expertise in Ge...Show moreLast updated: 29 days ago
  • Promoted
  • New!
Site Reliability Engineer

Site Reliability Engineer

Newfold DigitalIndia
Newfold Digital is a leading web technology company serving millions of customers globally.Our customers know us through our robust portfolio of brands. We have some of the industry's most prominent...Show moreLast updated: 12 hours ago
  • Promoted
  • New!
Site Reliability Engineer-II

Site Reliability Engineer-II

Bloomreachindia, null, India
Improve and manage infrastructure to drive efficiency and scalability.Write and review code, develop documentation, capacity plans, and optimize service costs. Set up Service Level Indicators (SLIs)...Show moreLast updated: 8 hours ago
  • Promoted
  • New!
Site Reliability Engineer

Site Reliability Engineer

SophosIndia
Central Operations is responsible for delivering the infrastructure to provide customer facing systems and cloud services to Sophos customers. The job role requires skill with Linux, cloud computing...Show moreLast updated: 12 hours ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

Luxoft IndiaIndia
We are looking for an experienced technical developer to work for one of our client from the banking industry.Project goal is to maintain and develop solutions. Design, develop, and improve the digi...Show moreLast updated: 21 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

BayOne Solutionsnagpur, maharashtra, in
Role : Site Reliability Engineer.The CXE Site Reliability Engineering (SRE) team manages the CI / CD pipelines and cloud infrastructure, ensuring seamless deployment, monitoring, and maintenance.Howev...Show moreLast updated: 3 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

ValueMomentumIndia
About the Role We are seeking an experienced.Site Reliability / Azure DevOps Engineer with Dynatrace Experience.CI / CD practices, infrastructure automation, and cloud operations.The ideal candidate ...Show moreLast updated: 3 days ago
  • Promoted
  • New!
Site Reliability Engineer - III

Site Reliability Engineer - III

RackspaceIndia
Site Reliability Engineer / Observability Engineer.Public Cloud - Offerings and Delivery – Workforce Mgmt & Delivery Ops / . Rackspace is building up its Professional Services Center of Excellence on...Show moreLast updated: 12 hours ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

ConcordIndia, India
Engineers (Individual Contributors).Strong SRE (Site Reliability Engineering).CI / CD, monitoring, automation, infrastructure as code, etc.Show moreLast updated: 21 days ago