Talent.com
This job offer is not available in your country.
▷ Apply in 3 Minutes : Site Reliability Engineer

▷ Apply in 3 Minutes : Site Reliability Engineer

XebiaIndia
6 hours ago
Job description

Performance & Reliability Engineer ( Senior, Lead , Principal & Manager)

Hybrid

Location : Pune, Chennai, Bangalore & Gurgaon

Need immediate joiners only

Job description

Role : Performance & Reliability Engineer

Job Location : Gurgaon, Chennai, Pune, Bangalore

Hybrid

Job Overview :

We are seeking a highly skilled and motivated Performance & Reliability Engineer to join our team. In this role, you will be responsible for ensuring the reliability, scalability, and performance of our systems and applications. You will leverage tools such as Dynatrace , CloudWatch , and Python to monitor and optimize system performance, troubleshoot issues, and enhance the overall reliability of our infrastructure with SRE Best Practices .

Key Responsibilities :

  • Performance Monitoring & Optimization :
  • Use Dynatrace and CloudWatch to monitor system performance and availability.
  • Implement performance tuning techniques to ensure high availability and optimal system performance.
  • Identify performance bottlenecks and optimize applications and infrastructure for scalability.
  • System Observability
  • AppDynamics and monitoring dashboards.
  • Collaborate with development and operations teams to troubleshoot incidents and provide recommendations for performance improvements.
  • Proactively identify areas of risk and implement preventive measures.
  • Automation & Scripting :
  • Develop automation scripts in Python to enhance monitoring, incident response, and reporting processes.
  • Write and maintain Python-based tools for proactive monitoring, alerting, and issue resolution.
  • Cloud Monitoring & Alerts :
  • Configure CloudWatch for real-time monitoring and alerting of cloud infrastructure,
  • Develop and manage dashboards to visualize system health and performance metrics.
  • Prepare and present performance reports, incident post-mortems, and improvement recommendations to senior leadership.
  • Chaos Engineering, Fault management
  • Vulnerability identification, Failure simulation, Stress Management

Required Skills and Experience :

  • Strong experience with Dynatrace for application performance monitoring and root cause analysis.
  • Proficiency in CloudWatch for monitoring AWS cloud infrastructure, configuring alerts, and visualizing metrics.
  • Solid understanding of Python for automating tasks, building performance tools, and writing scripts to enhance operations.
  • Experience in analyzing system logs, troubleshooting performance issues, and providing technical recommendations.
  • Hands-on experience with cloud environments (AWS preferred), including development knowledge
  • Experience with load testing and performance benchmarking.
  • About Xebia :

    Create a job alert for this search

    Site Reliability Engineer • India

    Related jobs
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.Nagpur, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer - Chaos Management

    Site Reliability Engineer - Chaos Management

    Xebianagpur, maharashtra, in
    AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 21 days ago
    • Promoted
    Engineer, Site Reliability [T500-20503]

    Engineer, Site Reliability [T500-20503]

    TMUS Global SolutionsIndia
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 2 days ago
    • Promoted
    AWS Site Reliability Engineer

    AWS Site Reliability Engineer

    HTC Global ServicesIndia
    HTC – A brief profile Established in 1990, HTC Inc.Troy, Michigan, is a leading global Information Technology solution and BPO provider. HTC assists clients across multiple industry verticals, offer...Show moreLast updated: 8 days ago
    • Promoted
    Sr Site Reliability Engineer

    Sr Site Reliability Engineer

    Media.netIndia
    Our proprietary contextual technology is at the forefront of enhancing Programmatic buying, the latest industry standard in ad buying for digital platforms. HQ is based in New York, and the Global H...Show moreLast updated: 13 days ago
    • Promoted
    Engineer, Site Reliability [T500-20521]

    Engineer, Site Reliability [T500-20521]

    TMUS Global SolutionsIndia
    About T-Mobile : T-Mobile US, Inc.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship b...Show moreLast updated: 2 days ago
    • Promoted
    Principal Site Reliability Engineer

    Principal Site Reliability Engineer

    Rakuten IndiaIndia
    Design, develop SLA, SLO, SLI of services within the Business Unit.Involve in whole process of Development, Production System Operation including system maintenance, monitoring, automation, backend...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    HuntingCube Recruitment SolutionsIndia
    Lead, Tech (Site Reliability Engineering) – Systems.Strict Eligibility Criteria – Please Read Before Applying This role is with a. High-Frequency Trading (HFT) firm.Only the following branches are e...Show moreLast updated: 19 days ago
    • Promoted
    Engineer, Site Reliability [T500-20515]

    Engineer, Site Reliability [T500-20515]

    TMUS Global SolutionsIndia
    About T-Mobile : T-Mobile US, Inc.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship b...Show moreLast updated: 2 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Teksands.aiIndia
    Experience in One Identity tool (preferred) operations or similar IAM tools.Devops Engineer with expertise in Kubernetes, Docker, Azure, AWS, Deployment Vmware. Knowledge in DevOps tools of Github / A...Show moreLast updated: 8 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Luxoft IndiaIndia
    Project Description : We are looking for an experienced technical developer to work for one of our client from the banking industry. Project goal is to maintain and develop solutions.Responsibilities...Show moreLast updated: 30+ days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-20425]

    Sr Engineer, Site Reliability [T500-20425]

    TMUS Global SolutionsIndia
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 2 days ago
    • Promoted
    Site Reliability Engineer II

    Site Reliability Engineer II

    RecRootsIndia
    Key Job Responsibilities and Duties : .The core premise for the SRE lies in treating operational issues as a software problem. We code our way out of problems where operations are concerned addressing...Show moreLast updated: 8 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ValueMomentumIndia
    About the Role We are seeking an experienced.Site Reliability / Azure DevOps Engineer with Dynatrace Experience.CI / CD practices, infrastructure automation, and cloud operations.The ideal candidate ...Show moreLast updated: 13 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    SynechronIndia
    We have immediate opportunity for Senior Site Reliability Engineer.Job Role : Senior Site Reliability Engineer.Job Location : Synechron ( Bengaluru / Pune). At Synechron, we believe in the power of dig...Show moreLast updated: 30+ days ago
    • Promoted
    Engineer, Site Reliability [T500-20517]

    Engineer, Site Reliability [T500-20517]

    TMUS Global SolutionsIndia
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 2 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    TechVeritoIndia
    About the Role : 3-5 years of proven and progressive experience as an.As a SRE Engineer, you will have a strong background in cloud infrastructure management, migration and deployment, with expertis...Show moreLast updated: 13 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    QualityKiosk Technologies Pvt. Ltd.India
    QualityKiosk Technologies is one of the world's largest independent Quality Engineering (QE) providers and digital transformation enablers, helping companies build and manage applications for optim...Show moreLast updated: 8 days ago
    • Promoted
    Engineer, Site Reliability [T500-20519]

    Engineer, Site Reliability [T500-20519]

    TMUS Global SolutionsIndia
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 2 days ago
    • Promoted
    Engineer, Site Reliability [T500-20266]

    Engineer, Site Reliability [T500-20266]

    TMUS Global SolutionsIndia
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 2 days ago