Talent.com
SRE (Site Reliability Engineering)

SRE (Site Reliability Engineering)

ConfidentialAmritsar, India
6 days ago
Job description

Experience : 8.00 + years

Salary : Confidential (based on experience)

Shift : (GMT+05 : 30) Asia / Kolkata (IST)

Opportunity Type : Remote

Placement Type : Full time Permanent Position

  • Note : This is a requirement for one of Uplers' client - Forbes Advisor)

What do you need for this opportunity

Must have skills required :

CDNs, Akamai, Cloudflare, AWS Certification, DevOps

Forbes Advisor is Looking for :

Job Title : Staff Engineer- SRE

Forbes Advisor is a new initiative for consumers under the Forbes Marketplace umbrella that provides journalist- and expert-written insights, news and reviews on all things personal finance, health, business, and everyday life decisions. We do this by providing consumers with the knowledge and research they need to make informed decisions they can feel confident in, so they can get back to doing the things they care about most.

If you're looking for challenges and opportunities similar to those of a startup, with the benefits of a seasoned and successful company, then read on :

Responsibilities :

  • The Site Reliability Engineering (SRE) team is responsible for the reliability, scalability, stability and performance of systems and services.
  • They work with cross-functional teams to design, build and maintain systems and they troubleshoot issues when they arise. They bridge the gap between development and operations teams.
  • They work closely with business teams to define Service Level Objectives (SLO) and agreements (SLA) of critical systems. They also monitor and maintain the uptime of these systems in-line with the defined SLO's and SLA's.
  • They deploy and manage monitoring tools to gain insights on system health and performance.
  • They analyze performance, identify bottlenecks and implement solutions to improve a system's scalability and latency durations.
  • They develop scripts, implement tools and automation frameworks to reduce the manual intervention efforts of deployment, monitoring and scaling.
  • They work with development teams for design and development of observability practices like logging, metrics, tracing, etc. They aim to diagnose and troubleshoot issues proactively.
  • They create actionable alerts on monitoring systems to ensure rapid response for potential production incidents.
  • They forecast resource needs and provision adequately for current and future demand.
  • They design and execute 'chaos experiments' to test system's failure resiliency.
  • They own, define and implement the Disaster Recovery (DR) processes for systems.
  • They also conduct planned and unplanned mock DR drills to test for response preparedness during production incidents.
  • They ensure that security best practices are followed and implemented during design and operations of systems.
  • They also own and maintain documentation of processes, playbooks, and systems.
  • They publish KPI reports and other system health updates on a regular basis to the business.
  • Requirements :

  • Must-have - Bachelor's degree, preferably in CS or a related field, or equivalent experience
  • Must-have - 12+ years of overall IT experience
  • Must-have - 7+ year of proven work experience as a Senior Site Reliability Engineer or a similar position.
  • Must-have - 5+ years of AWS Cloud experience with AWS Certified DevOps Engineer or SysOps or Security etc.
  • Must-have - AWS experience - 3+ years' experience with using a broader range of AWS technologies (e.g. EC2, RDS, ELB, S3, VPC, CloudWatch & Monitoring Tools) to develop and maintain an Amazon AWS based cloud solution, with an emphasis on best practice cloud security.
  • Must-have - 2+ year of experience in CDN and / or Cache systems like Fastly, Akamai, CloudFront, etc.
  • Proven Understanding & strong experience with Cloud deployments (AWS / Docker / Kubernetes)
  • Knowledge on provisioning IAC Tools like Terraform, Chef, Ansible, Shell, groovy, python, etc.
  • Experience with monitoring systems such as CloudWatch, NewRelic, Datadog / Splunk, ELK stack.
  • Experience managing cloud network resources (AWS Preferred) such as CloudWatch, VPC, URL proxies, private link, DNS, ACLs, firewalls, and C2S access points.
  • Platform or Application Engineering and Operational Knowledge in any of the CI / CD tooling like GitHub Actions, Jenkins, etc.
  • Experience in other tooling Technologies like JIRA, Bitbucket, Jenkins, Fortify, SonarQube, Nexus, Nexus IQ
  • Experience with configuration automation tools like Puppet / Ansible / Chef / Salt
  • Scripting Skills : Strong scripting (e.g. Bash & Python) and automation skills.
  • Operating Systems : Windows and Linux system administration.
  • Problem Solving : Ability to analyze and resolve complex infrastructure resource and application deployment issues
  • Strong attention to detail. Excellent verbal and written communication skills. Strong documentation skills.
  • Good To Have :

  • Experience with Terraform / Ansible / Chef / Puppet
  • Experience with GitHub Actions
  • Experience with CloudFront, Fastly
  • Oversees team members performing these functions
  • Anticipates problems and future technical needs and takes necessary steps to address issues.
  • Work primarily in server side technologies and comfortable with client side whenever required
  • Enthusiastically follow technology trends, software engineering best practices and technologies
  • Perks :

  • Day off on the 3rd Friday of every month (one long weekend each month)
  • Monthly Wellness Reimbursement Program to promote health well-being
  • Paid paternity and maternity leaves
  • How to apply for this opportunity

  • Step 1 : Click On Apply! And Register or Login on our portal.
  • Step 2 : Complete the Screening Form & Upload updated Resume
  • Step 3 : Increase your chances to get shortlisted & meet the client for the Interview!
  • About Uplers :

    Our goal is to make hiring reliable, simple, and fast. Our role will be to help all our talents find and apply for relevant contractual onsite opportunities and progress in their career. We will support any grievances or challenges you may face during the engagement.

    (Note : There are many more opportunities apart from this on the portal. Depending on the assessments you clear, you can apply for them as well).

    So, if you are ready for a new challenge, a great work environment, and an opportunity to take your career to the next level, don't hesitate to apply today. We are waiting for you!

    Skills Required

    Newrelic, akamai, Chef, Elk Stack, Bash, Datadog, Jenkins, Devops, Cloudwatch, Docker, Terraform, Ansible, Splunk, Puppet, Python, Kubernetes, Aws

    Create a job alert for this search

    Site Reliability Sre • Amritsar, India

    Related jobs
    • Promoted
    Senior Site Reliability Engineer (Sre) – Datadog Observability

    Senior Site Reliability Engineer (Sre) – Datadog Observability

    Jade GlobalAmritsar, Republic Of India, IN
    Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote. Site Reliability Engineer (SRE).SRE...Show moreLast updated: 3 days ago
    • Promoted
    • New!
    Site Reliability Engineer (Sre) – Infrastructure & Automation

    Site Reliability Engineer (Sre) – Infrastructure & Automation

    InstaServiceAmritsar, Republic Of India, IN
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 15 hours ago
    • Promoted
    SDE II-Remote

    SDE II-Remote

    Cimpress Indiaamritsar, India
    Remote
    NASDAQ : CMPR) is the world leader in mass customisation.Our unmatched technology, production, and supply chain operations allow us to offer products that can be personalised by an individual custom...Show moreLast updated: 2 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Capgeminiamritsar, punjab, in
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 12 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CodeKarmaamritsar, punjab, in
    Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 23 days ago
    • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaServiceamritsar, punjab, in
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 23 hours ago
    • Promoted
    Senior Site Reliability Engineer- Elk Expert

    Senior Site Reliability Engineer- Elk Expert

    iVedha Inc.Amritsar, Republic Of India, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 17 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CitNOW Groupamritsar, punjab, in
    Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably.CitNOW’s app-based plat...Show moreLast updated: 1 day ago
    • Promoted
    Sr Engineer, Site Reliability [T500-21295]

    Sr Engineer, Site Reliability [T500-21295]

    TMUS Global Solutionsamritsar, punjab, in
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 1 day ago
    • Promoted
    Sr Engineer, Site Reliability T500-21295

    Sr Engineer, Site Reliability T500-21295

    TMUS Global SolutionsAmritsar, Republic Of India, IN
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 1 day ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.amritsar, punjab, in
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Nebula Tech SolutionsAmritsar, Republic Of India, IN
    SRE team supporting mission-critical applications for our.We’re now looking for engineers who can go beyond operations — those who can. Enhance application reliability through code.Add or modify cod...Show moreLast updated: 3 days ago
    • Promoted
    • New!
    Senior Engineer-SRE

    Senior Engineer-SRE

    Thalesamritsar, India
    Apply SRE core tenets of measurement (SLI / SLO / SLA), eliminate toil, and reliability modeling.Enable and educate development teams on industry best practice design patterns, ways of working and oper...Show moreLast updated: 5 hours ago
    • Promoted
    • New!
    Senior Manager Engineering - Operations (DevOps, SRE, Test, DB) [T500-21328]

    Senior Manager Engineering - Operations (DevOps, SRE, Test, DB) [T500-21328]

    Albertsons Companies Indiaamritsar, India
    About Albertsons Companies Inc : .As a leading food and drug retailer in the United States, Albertsons Companies, Inc.Our well-known banners across the United States, including Albertsons, Safeway, V...Show moreLast updated: 5 hours ago
    • Promoted
    Lead - Cloud Reliability Engineer

    Lead - Cloud Reliability Engineer

    Searce Incamritsar, punjab, in
    The ‘process-first’ AI-native modern tech consultancy that's rewriting the rules.As an engineering-led consultancy, we are dedicated to relentlessly improving the real business outcomes.Our solvers...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Grid Dynamicsamritsar, India
    Location-Bangalore / Chennai / Hyderabad.Core Skills (Some combination of : ).These might include (Tomcat, Apache, Springboot, SQS, JBoss, IBM MQ, IBM DataPower, Hazelcast, Flink, Connect Direct, SSL).Un...Show moreLast updated: 5 hours ago
    • Promoted
    Senior Site Reliability Engineer / Senior Cloud Engineer

    Senior Site Reliability Engineer / Senior Cloud Engineer

    CloudHireAmritsar, Republic Of India, IN
    The Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture.Repo...Show moreLast updated: 2 days ago
    • Promoted
    Sr Systems Engineer Linux – AI Infrastructure

    Sr Systems Engineer Linux – AI Infrastructure

    DC Tech Consultingamritsar, punjab, in
    Position : Senior Linux Administrator – AI / ML Infrastructure.We are seeking a highly skilled Senior Linux Administrator to join our team, focusing on the implementation and management of on-premises...Show moreLast updated: 30+ days ago