Talent.com
This job offer is not available in your country.
Site Reliability Engineer

Site Reliability Engineer

Softensity IncHyderabad, Telangana, India
1 day ago
Job description

Senior Site Reliability Engineer (SRE)

Who We Are?

Softensity is a US-based IT outsourcing company with global software teams. We are headquartered in Atlanta, GA, USA with development teams in LATAM, Eastern Europe and Türkiye. When you have better teams, you build better software. Let’s do this together!

The Opportunity

Location : Hyderabad, India (Hybrid)

Hiring Method : Contractor

The Role

Key Responsibilities :

Reliability and Performance Management

  • Design, implement, and maintain highly available, scalable, and resilient cloud-native architectures for mission-critical SaaS products.
  • Develop and implement SLOs, SLIs, and SLAs to measure and improve service reliability.
  • Continuously optimize system performance and resource utilization across multiple cloud platforms.
  • Finetune / Optimize Application performance by analyzing the code, traces and database queries.

Incident Management and Troubleshooting

  • Lead incident response efforts, effectively troubleshooting complex issues to minimize downtime and impact.
  • Reduce Mean Time to Recover (MTTR) through proactive monitoring, automated alerting, and efficient problem-solving techniques.
  • Conduct thorough Root Cause Analysis (RCA) for all major incidents and implement preventive measures.
  • Observability and Monitoring

  • Design and implement end-to-end observability solutions across our distributed systems.
  • Develop and maintain comprehensive monitoring strategies using tools like ELK Stack, Prometheus, Grafana.
  • Create and optimize product status dashboards to provide real-time visibility into system health and performance.
  • Automation and Infrastructure as Code (IaC)

  • Implement Infrastructure as Code practices using tools like Terraform.
  • Develop and maintain automated deployment pipelines and CI / CD workflows.
  • Create self-healing systems and automate routine operational tasks to reduce manual intervention.
  • Cloud-Agnostic Architecture

  • Design and implement cloud-agnostic solutions that can operate efficiently across multiple cloud providers.
  • Develop expertise in event-driven architectures and related technologies (e.g., Apache Kafka / Eventhub, Redis, Mongo Atlas, IoTHub).
  • Implement and manage containerized applications using Kubernetes across different cloud environments.
  • Continuous Improvement

  • Regularly review and refine operational practices to enhance efficiency and reliability.
  • Stay updated with the latest industry trends and technologies in SRE, cloud computing, and DevOps.
  • Contribute to the development of internal tools and frameworks to support SRE practices.
  • Main Qualifications

  • 7+ years of experience in cloud infrastructure management and optimization.
  • Solid experience with automating infrastructure processes.
  • Hands-on technical expertise in CI / CD pipeline implementation.
  • Strong background in microservices and Docker.
  • Mid-level experience supporting Java or .NET applications deployment.
  • Expertise in cloud platforms (AWS, Azure, GCP) and their associated services.
  • Strong understanding of networking concepts, load balancing, and security practices.
  • Why Join Us

    We are passionate about top quality talent and giving our employees the tools they need in order for them to keep on growing and learning.

    The sky is truly the limit and we want you to feel challenged and motivated in every single project that you're a part of all while working with cutting edge technologies and amazing clients.

    What to expect?

  • Measurable goals
  • Remote work
  • Paid-time-off
  • Coursera Credentials
  • Create a job alert for this search

    Site Reliability Engineer • Hyderabad, Telangana, India

    Related jobs
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    HighRadiusHyderabad, Telangana, India
    Design, implement, and maintain scalable cloud infrastructure primarily on AWS, with some exposure to Azure.Manage and optimize CI / CD pipelines using Jenkins and Git-based version control systems (...Show moreLast updated: 14 days ago
    Site Reliability Engineer

    Site Reliability Engineer

    trellixINDIA
    Trellix, the trusted CISO ally, is redefining the future of cybersecurity and soulful work.Our comprehensive, GenAI-powered platform helps organizations confronted by todays most advanced threats g...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Thomson ReutersIND, Hyderabad, Raheja Mindspace
    Thomson Reuters is seeking a Site Reliability Engineer to join our Service Management, Technology team.This role calls for an individual who is capable of analyzing customer problems of various com...Show moreLast updated: 16 days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Alignity SolutionsHyderabad, Telangana, India
    Do you love a career where you Experience.If so we are excited to have bumped onto you.Learn how we are redefining the.Clients Job-seekers and Employees. If you are a Site Reliability Engineer.We ar...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Anicalls (Pty) LtdHyderabad, India
    Mentor teammates on SRE best practices and guide technical direction.Work closely with the product engineering team to rapidly deliver capabilities. Automate and optimize developer pipelines.Build m...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    VistexHyderabad, Telangana, IND
    The Vistex Site Reliability Engineer will be primarily responsible for service availability, performance, monitoring, incident response, and capacity planning. This is a highly technical, hands-on r...Show moreLast updated: 17 days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Talent WorxHyderabad, TS, IN
    Quick Apply
    Site Reliability Engineer (SRE).At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of o...Show moreLast updated: 30+ days ago
    • Promoted
    Assurant - Site Reliability Engineer

    Assurant - Site Reliability Engineer

    AssurantHyderabad
    Role : Staff Engineer-Site Reliability Engineering, Assurant, GCC-India This job is responsible for basic administration, support, planning, implementation and monit...Show moreLast updated: 20 days ago
    Site Reliability Engineer III

    Site Reliability Engineer III

    McDonalds in IndiaHyderabad, India
    One of the worlds largest employers with locations in more than 100 countries, McDonalds Corporation has corporate opportunities in Hyderabad. Our global offices serve as dynamic innovation and oper...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Concentrix CatalystHyderabad, IN
    Senior Site Reliability Engineer.Remote (may need to travel to nearby Concentrix office as per business need).Minimum Experience required : 8+ Years. Stakeholder Management Working with key technolog...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Whitefield CareersHyderabad
    Job Description : Key Responsibilities : - Lead the e...Show moreLast updated: 21 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    HTC Global ServicesHyderabad, Telangana, India
    Positions available in Hyderabad (.GCP- GKE Google Kubernetes Engine.Datadog, Dynatrace or similar tools.Python or Any Scripting languages. If interested in the above requirement, please reply with ...Show moreLast updated: 17 days ago
    • Promoted
    Site Reliability Engineer - Splunk

    Site Reliability Engineer - Splunk

    Talent500Hyderabad
    What you will do : - Ensure key stakeholders, product owners, and platform owners are informed of reliability concerns...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Unison Consulting Pte LtdHyderabad, TS, IN
    Quick Apply
    Experience with supporting Java (J2EE / Spring Boot) based multi-tier applications with complex upstream downstream interactions having expertise in understanding the application request flow and ana...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ABC FitnessHyderabad, Telangana, India
    ABC is the trusted provider to boost performance and create a total fitness experience for over 41 million members of clubs of all sizes whether a multi-location chain, franchise or an independent ...Show moreLast updated: 1 day ago
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    Trigent Software Private LimitedTS, India
    Quick Apply
    We are seeking an experienced Senior Site Reliability Engineer (SRE) with 6+ years of hands-on experience to join our fast-paced and growing team. As an SRE, you will play a pivot...Show moreLast updated: 1 day ago
    Site Reliability Engineer

    Site Reliability Engineer

    Qure.aiINDIA
    AI is one of the fastest-growing startups in India, which develops Artificial intelligence-enabled products and platforms for healthcare diagnostics. We create cutting-edge solutions that positively...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    noonHyderabad, IN
    Job Title : Site Reliability Engineer.In doing this we hope to accelerate the digital economy of the Middle East, empowering regional talent and businesses to meet the full range of consumers' onlin...Show moreLast updated: 20 days ago