Talent.com
This job offer is not available in your country.
Site Reliability Engineer

Site Reliability Engineer

noonHyderabad, IN
21 days ago
Job description

Job Title : Site Reliability Engineer

About noon

noon.com is a technology leader with a simple mission : to be the best place to buy and sell things. In doing this we hope to accelerate the digital economy of the Middle East, empowering regional talent and businesses to meet the full range of consumers' online needs.

noon operates without boundaries; we are aggressively and voraciously ambitious. Starting in 2017 with noon.com, the region’s homegrown e-commerce platform and leading online shopping destination, noon is now a digital ecosystem of products and services - noon, noon Food, Noon in Minutes, NowNow, SIVVI, noon One, and noon Pay.

At noon we have the courage to pursue what seems impossible, we work hard to get things done, we go to great lengths to ensure that the experience of everyone from our customers to our sellers or noon Bandidos is stellar but above all, we are grateful for the opportunities we have. If you feel the above values resonate with you – you will enjoy this incredible journey with us!

Job Description

As a Site Reliability Engineer (SRE) at noon payments, you will play a crucial role in maintaining and enhancing the reliability, availability, and performance of our cloud-based infrastructure and services.

You will be responsible for automating deployments, optimizing systems, and ensuring seamless performance across our platforms. This position requires a strong foundation in cloud infrastructure management, particularly with Azure - AKS and GCP-GKE, alongside hands-on experience with Azure DevOps and monitoring tools like Datadog.

You will :

  • Cloud Infrastructure Management : Manage and optimize cloud environments across Azure and GCP, ensuring efficient resource utilization, high system availability, and scalability (AKS-GKE).
  • Infrastructure as Code : Utilize Terraform for infrastructure provisioning, ensuring consistent and scalable deployments, and managing infrastructure via Azure DevOps pipelines.
  • Configuration Management : Implement and manage system configurations using Ansible to ensure consistency and streamline updates across different environments.
  • Continuous Integration / Continuous Deployment (CI / CD) : Develop, maintain, and optimize CI / CD pipelines within Azure DevOps to automate testing and deployment processes, reducing time from development to production.
  • Monitoring and Observability : Set up and maintain comprehensive monitoring and observability solutions using Datadog to track system health, performance, and proactively detect issues.
  • Container Orchestration : Deploy, manage, and optimize Kubernetes clusters to support scalable and resilient application deployments.
  • Incident Management : Participate in a 24 / 7 on-call or roster-based team to respond to incidents, conduct root cause analysis, and implement solutions to minimize downtime and ensure system reliability.
  • Performance Tuning : Continuously monitor system performance, identify bottlenecks, and implement optimizations to improve efficiency and response times.
  • Capacity Planning : Plan and manage system capacity to ensure resources meet current and future demands, enabling seamless service delivery.
  • Collaboration : Work closely with Network Operations Center (NOC) and DevOps teams to troubleshoot issues, optimize deployment processes, and drive continuous improvement .
  • Documentation : Create and maintain detailed documentation for system configurations, deployment processes, and incident reports.

Skill Requirements

  • Bachelor’s degree in computer science, Information Technology or any other related discipline or equivalent related experience.
  • Cloud, ITIL, CKA certifications are a plus.
  • 6+ years of directly related or relevant experience, preferably in information security.
  • Extensive experience with cloud platforms such as Azure, GCP, and Huawei Cloud.
  • Proficiency with Terraform for infrastructure automation and Ansible for configuration management.
  • Hands-on experience with Kubernetes for container orchestration mainly AKS and GKE.
  • Expertise in monitoring and observability tools such as Datadog.
  • Familiarity with Azure VMSS, GCP MIG for virtual machine scaling and management.
  • Experience in a 24 / 7 on-call or roster-based team environment, focusing on system uptime and incident response.
  • Strong understanding of SRE processes and best practices for system reliability, availability, and performance.
  • Excellent problem-solving skills and the ability to handle complex technical issues under pressure.
  • Effective communication skills and a collaborative approach to working with diverse teams.
  • Experience with payment gateway projects or similar high-transaction systems is preferred.
  • Additional knowledge in advanced monitoring techniques, performance tuning, and capacity planning is a plus.
  • Who will excel?

    We’re looking for candidates who thrive in a fast-paced, dynamic start-up environment. We’re searching for problem solvers, people who operate with a bias for action and have a deep understanding of the importance of resourcefulness over reliance.

    Candor is our only default. Demanding unequivocal high standards should be non-negotiable because quality matters. We want people who are radically candid, cohorts who commit to settling for nothing but the best - in hiring, in accepting work from colleagues, and in your own work.

    Ours is not an easy mission, but it is a meaningful one. Every hire must actively raise the bar of talent in the company to help us reach our vision.

    Create a job alert for this search

    Site Reliability Engineer • Hyderabad, IN

    Related jobs
    Site Reliability Engineer

    Site Reliability Engineer

    trellixINDIA
    Trellix, the trusted CISO ally, is redefining the future of cybersecurity and soulful work.Our comprehensive, GenAI-powered platform helps organizations confronted by todays most advanced threats g...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Thomson ReutersIND, Hyderabad, Raheja Mindspace
    Thomson Reuters is seeking a Site Reliability Engineer to join our Service Management, Technology team.This role calls for an individual who is capable of analyzing customer problems of various com...Show moreLast updated: 17 days ago
    Site Reliability Engineer

    Site Reliability Engineer

    NatWest GroupINDIA
    Join us as a Site Reliability Engineer.In this key role, youll support the improvement of non-functional and operational characteristics such as availability, performance, efficiency, change manage...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Alignity SolutionsHyderabad, Telangana, India
    Do you love a career where you Experience.If so we are excited to have bumped onto you.Learn how we are redefining the.Clients Job-seekers and Employees. If you are a Site Reliability Engineer.We ar...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Talent WorxHyderabad, TS, IN
    Quick Apply
    Site Reliability Engineer (SRE).At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of o...Show moreLast updated: 30+ days ago
    • Promoted
    Assurant - Site Reliability Engineer

    Assurant - Site Reliability Engineer

    AssurantHyderabad
    Role : Staff Engineer-Site Reliability Engineering, Assurant, GCC-India This job is responsible for basic administration, support, planning, implementation and monit...Show moreLast updated: 20 days ago
    Site Reliability Engineer III

    Site Reliability Engineer III

    McDonalds in IndiaHyderabad, India
    One of the worlds largest employers with locations in more than 100 countries, McDonalds Corporation has corporate opportunities in Hyderabad. Our global offices serve as dynamic innovation and oper...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    tcg digital solutions pvt ltdINDIA
    Bachelors or masters degree in Computer Science, Engineering, or related field.Essential Skills (Two top skills).AWS Ecosystem EKS, EC2, DynamoDB, Lambda, etc. The SRE team should include some memb...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Whitefield CareersHyderabad
    Job Description : Key Responsibilities : - Lead the e...Show moreLast updated: 21 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Concentrix CatalystHyderabad, IN
    Senior Site Reliability Engineer.Remote (may need to travel to nearby Concentrix office as per business need).Minimum Experience required : 8+ Years. Stakeholder Management Working with key technolog...Show moreLast updated: 2 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    HTC Global ServicesHyderabad, Telangana, India
    Positions available in Hyderabad (.GCP- GKE Google Kubernetes Engine.Datadog, Dynatrace or similar tools.Python or Any Scripting languages. If interested in the above requirement, please reply with ...Show moreLast updated: 18 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Softensity IncHyderabad, Telangana, India
    Senior Site Reliability Engineer (SRE).US-based IT outsourcing company with global software teams.We are headquartered in Atlanta, GA, USA with development teams in LATAM, Eastern Europe and Türkiy...Show moreLast updated: 2 days ago
    • Promoted
    Site Reliability Engineer - Splunk

    Site Reliability Engineer - Splunk

    Talent500Hyderabad
    What you will do : - Ensure key stakeholders, product owners, and platform owners are informed of reliability concerns...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Unison Consulting Pte LtdHyderabad, TS, IN
    Quick Apply
    Experience with supporting Java (J2EE / Spring Boot) based multi-tier applications with complex upstream downstream interactions having expertise in understanding the application request flow and ana...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ABC FitnessHyderabad, Telangana, India
    ABC is the trusted provider to boost performance and create a total fitness experience for over 41 million members of clubs of all sizes whether a multi-location chain, franchise or an independent ...Show moreLast updated: 2 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Synopsys IncHyderabad, Telangana, India
    Site Reliability Engineering, Sr Staff.The Engineering Excellence Group drives innovation velocity and enterprise infrastructure automation, which are critical elements of our growth and scaling st...Show moreLast updated: 2 days ago
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    Trigent Software Private LimitedTS, India
    Quick Apply
    We are seeking an experienced Senior Site Reliability Engineer (SRE) with 6+ years of hands-on experience to join our fast-paced and growing team. As an SRE, you will play a pivot...Show moreLast updated: 2 days ago
    Site Reliability Engineer

    Site Reliability Engineer

    HighRadiusHyderabad, Telangana, India
    Design, implement, and maintain scalable cloud infrastructure primarily on AWS, with some exposure to Azure.Manage and optimize CI / CD pipelines using Jenkins and Git-based version control systems (...Show moreLast updated: 13 days ago