Talent.com
Cloud Site Reliability Engineer

Cloud Site Reliability Engineer

Ford MotorChennai, Tamil Nadu, India
2 days ago
Job description

Description

Be at the Forefront of Mobilitys Future : Join Ford as a Site Reliability Engineer!

Enterprise Technology is the engine driving the future of transportation and were looking for a talented Site Reliability Engineer (SRE) to help us redefine this role youll leverage cutting-edge technology to enhance customer experiences improve lives and create vehicles as smart as you are.

As an SRE at Ford youll be instrumental in developing enhancing and expanding our global monitoring and observability platform. Youll blend software and systems engineering to ensure the uptime scalability and maintainability of our critical cloud services. Youll be at the intersection of SRE and Software Development building and driving the adoption of our global monitoring capabilities.

If youre passionate about using your IT expertise and analytical skills to shape the future of transportation this is your opportunity to make a real impact. Join us and be part of a team thats building the future of mobility!

Responsibilities

  • Write configure and deploy code that improves service reliability for existing or new systems; set standard for others with respect to code quality.
  • Provide helpful and actionable feedback and review for code or production changes.
  • Drive repair / optimization of complex systems with consideration towards a wide range of contributing factors.
  • Lead debugging troubleshooting and analysis of service architecture and design.
  • Participate in on-call rotation.
  • Write documentation : design system analysis runbooks playbooks. Provide design feedback and uplevel design skills of others.
  • Implement and manage SRE monitoring application backends using Golang Postgres and OpenTelemetry. Develop tooling using Terraform and other IaC tools to ensure visibility and proactive issue detection across our platforms.
  • Work within GCP infrastructure optimizing performance and cost and scaling resources to meet demand.
  • Collaborate with development teams to enhance system reliability and performance applying a platform engineering mindset to system administration tasks.
  • Develop and maintain automated solutions for operational aspects such as on-call monitoring performance tuning and disaster recovery.
  • Troubleshoot and resolve issues in our dev test and production environments.
  • Participate in postmortem analysis and create preventative measures for future incidents.
  • Implement and maintain security best practices across our infrastructure ensuring compliance with industry standards and internal policies. Participate in security audits and vulnerability assessments.
  • Participate in capacity planning and forecasting efforts to ensure our systems can handle future growth and demand. Analyze trends and make recommendations for resource allocation.
  • Identify and address performance bottlenecks through code profiling system analysis and configuration tuning. Implement and monitor performance metrics to proactively identify and resolve issues.
  • Develop maintain and test disaster recovery plans and procedures to ensure business continuity in the event of a major outage or disaster. Participate in regular disaster recovery exercises.
  • Contribute to internal knowledge bases and documentation.

Qualifications

  • Bachelors degree in Computer Science Engineering Mathematics or equivalent experience.
  • 3 years of experience as an SRE DevOps Engineer Software Engineer or similar role.
  • Strong experience with Cloud Infrastructure
  • Proficient with monitoring and observability tools particularly OpenTelemetry or other tools.
  • Proficient with cloud services with a strong preference for Kubernetes and Google Cloud Platform (GCP) experience.
  • Solid programming skills in Golang and scripting languages with a good understanding of software development best practices.
  • Experience with relational and document databases.
  • Ability to debug optimize code and automate routine tasks.
  • Strong problem-solving skills and the ability to work under pressure in a fast-paced environment.
  • Excellent verbal and written communication skills.
  • Key Skills

    Kubernetes,FMEA,Continuous Improvement,Elasticsearch,Go,Root cause Analysis,Maximo,CMMS,Maintenance,Mechanical Engineering,Manufacturing,Troubleshooting

    Employment Type : Full-Time

    Experience : years

    Vacancy : 1

    Create a job alert for this search

    Site Reliability Engineer • Chennai, Tamil Nadu, India

    Related jobs
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Tata Consultancy ServicesChennai, Tamil Nadu, India
    Role : Site Reliability Engineer.Locations : Chennai / Pune / Kolkata.Show moreLast updated: 27 days ago
    • Promoted
    Site Reliability Engineer / Architect - CI / CD Pipeline

    Site Reliability Engineer / Architect - CI / CD Pipeline

    Cling Multi SolutionsChennai
    Job Description : Role : Site Reliability Engineer (SRE) Location : Bangalore / Chennai / Pune (Hybrid) Experience : 5+ y...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Intellistaff Services Pvt. LtdChennai, Tamil Nadu, India
    Role : Cloud Engineer - SRE Experience : 6+ Location : Chennai Fulltime - Hybrid Required Skills : 6+ years' experience SRE, 3+ years in Public Cloud & Cloud Engineering GCP experience (preferred) Doc...Show moreLast updated: 5 days ago
    • Promoted
    AWS Site Reliability Engineer

    AWS Site Reliability Engineer

    HTC Global ServicesChennai, Tamil Nadu, India
    Troy, Michigan, is a leading global Information Technology solution and BPO provider.HTC assists clients across multiple industry verticals, offering turnkey project lifecycle in, e-business, data ...Show moreLast updated: 18 days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Yum! India Global Services Private Limitedchennai, India
    Design, test, implement, deploy, and support continuous integration pipelines that build and deploy to cloud-based environments (development, stage / testing, production). In this role, you will help ...Show moreLast updated: 7 hours ago
    • Promoted
    TCS Walkin Drive For Site Reliability Engineering (SRE)

    TCS Walkin Drive For Site Reliability Engineering (SRE)

    Tata Consultancy ServicesChennai, Tamil Nadu, India
    TCS Walkin Drive_Site Reliability Engineering (SRE)Ops Greetings from TCS!!! TCS has been a great pioneer in feeding the fire of young Techies like you. We are a global leader in the technology ar...Show moreLast updated: 4 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Datum Technologies GroupChennai, Tamil Nadu, India
    Job Title : Site Reliability Engineer (SRE) – AWS.AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog.We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experi...Show moreLast updated: 9 days ago
    • Promoted
    Lead Engineer

    Lead Engineer

    HyqooChennai, IN
    Design, deploy, and manage AWS cloud infrastructure, including EC2 instances, S3 buckets, VPCs, RDS databases, and Lambda functions. Assist in the design, implementation, and maintenance of backup, ...Show moreLast updated: 13 days ago
    • Promoted
    • New!
    Site Reliability Engineer (SRE) / DevOps Engineer

    Site Reliability Engineer (SRE) / DevOps Engineer

    Stoopa AIChennai, Tamil Nadu, India
    AI is building next-generation AI-driven platforms for ports and is focused on reliability, speed, and intelligent automation. As we scale our next generation smart port product Turi, we are hiring ...Show moreLast updated: 22 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Grootan TechnologiesChennai, Tamil Nadu, India
    Site Reliability Engineer (SRE).In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications.You will leverage your e...Show moreLast updated: 8 days ago
    • Promoted
    Site Reliability Engineer - DevOps

    Site Reliability Engineer - DevOps

    Aim Plus Staffing SolutionsChennai
    Mandatory skills : We are seeking a highly skilled Site Reliability Engineer (SRE) with strong expertise in Google Cloud Platform (GCP) and CI / CD automation to lead cloud infra...Show moreLast updated: 16 days ago
    • Promoted
    Senior DevOps & Database Reliability Engineer – 100% Remote

    Senior DevOps & Database Reliability Engineer – 100% Remote

    Hyly.AIChennai, IN
    Remote
    AI, we’re building the first AI + Data Fabric for the multifamily industry, transforming how clients manage, secure, and scale their marketing and operational data. As the industry moves toward a co...Show moreLast updated: 10 days ago
    • Promoted
    Keuro Life - Senior Site Reliability Engineer - DevOps

    Keuro Life - Senior Site Reliability Engineer - DevOps

    Keuro LifeChennai
    Site Reliability Engineer / DevOps We are seeking an experienced Site Reliability Engineer / DevOps professional with a minimum of 6 years in the industry.The ideal c...Show moreLast updated: 30+ days ago
    • Promoted
    Athenahealth - Senior Site Reliability Engineer - On-Premises Infrastructure

    Athenahealth - Senior Site Reliability Engineer - On-Premises Infrastructure

    athenaHealth Technology Private Limited.Chennai
    Description : Join us as we work to create a thriving ecosystem that delivers accessible, high-quality, and sustainable healthcare for al...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaServiceChennai, IN
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 16 days ago
    • Promoted
    Site Reliability Engineer - Elastic Kubernetes Service

    Site Reliability Engineer - Elastic Kubernetes Service

    MNR SolutionsChennai
    Description : Site Reliability Engineer (SRE) Kubernetes & Cloud Position Summary : We are seeking a...Show moreLast updated: 27 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Zyoin GroupChennai
    Description : MoneyForward is seeking a Site Reliability Engineer (SRE) to lead the reliability, scalability, and performance of our products.This role invol...Show moreLast updated: 17 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ElgebraChennai
    Role Overview : We are seeking a highly experienced and technically proficient Site Reliability Engineer (SRE) to join our team in support of our c...Show moreLast updated: 30+ days ago