Talent.com
This job offer is not available in your country.
Site Reliability Engineer

Site Reliability Engineer

Alter DomusHyderabad, IN
16 days ago
Job description

ABOUT US

We are Alter Domus. Meaning “The Other House” in Latin, Alter Domus is proud to be home to 85% of the top 30 asset managers in the alternatives industry, and more than 5,000 professionals across 23 countries.

With a deep understanding of what it takes to succeed in alternatives, we believe in being different. Invest yourself in the alternative, and join an organization where you progress on merit, where you can speak openly with whoever you are speaking to, and where you will be supported along whichever path you choose to take.

Find out more about life at Alter Domus at careers.alterdomus.com

Job Description :

DevOps Engineer - Site Reliability Engineering (Observability)

We are looking for an experienced and motivated DevOps Engineer to join our Site Reliability Engineering (SRE) team . This role involves spearheading the Grafana Cloud and Backstage implementations as part of our Observability project. The ideal candidate will bring a blend of technical expertise in observability tools, strong problem-solving skills, and a passion for creating efficient, reliable systems.

Key Responsibilities :

  • Configure and manage data sources, including Prometheus and Azure Monitor, to build dashboards in Grafana.
  • Collaborate with DevOps engineers, system administrators, and software developers to understand monitoring requirements and design robust observability solutions.
  • Customize and extend Grafana functionalities by developing and implementing plugins and scripts.
  • Enhance visualizations for observability solutions to meet organizational needs.
  • Optimize dashboard performance and usability by fine-tuning data queries.
  • Troubleshoot and resolve issues related to Grafana configuration, data ingestion, and visualizations.
  • Participate in the administration, maintenance, and development of observability tools, including Grafana and ELK stack.
  • Troubleshoot network communication problems and ensure smooth operations.
  • Support Backstage implementation to enhance developer experience within the organization.

Required Skills :

  • Familiarity with Event Management and Application Monitoring concepts.
  • Experience in building and enhancing visualizations for observability solutions.
  • Proficiency with observability tools such as Grafana , Prometheus , Dynatrace , Splunk , Azure Monitor , or AWS CloudWatch .
  • Expertise in scripting with one or more of the following languages : Unix Shell , Windows PowerShell , JavaScript , Python , or Go .
  • Strong problem-solving and analytical skills, with the ability to troubleshoot complex network communication issues.
  • Hands-on experience with the administration, maintenance, and development of Grafana or ELK stack.
  • Minimum of 5-7 years of domain experience in monitoring or related fields.
  • Comfortable working with both Windows and Linux command lines.
  • Excellent communication and collaboration skills, with the ability to work effectively within a team and interact with stakeholders.
  • Core / Must-Have Skills

  • Observability Subject Matter Expertise (SME)
  • Prometheus
  • Azure Monitor
  • Grafana
  • Open Telemetry
  • Good-to-Have Skills

  • Proficiency in Unix Shell, Windows PowerShell, JavaScript, Python, or Go.
  • Familiarity with Backstage implementation.
  • Experience troubleshooting network communication problems.
  • What We Offer

  • An opportunity to work with cutting-edge technologies in observability and developer experience.
  • A collaborative and dynamic team environment.
  • The chance to make an impact on critical infrastructure and monitoring solutions.
  • If you are passionate about observability, reliability, and working with advanced tools like Grafana and Backstage, we’d love to hear from you!

    WHAT WE OFFER :

    We are committed to supporting your development, advancing your career, and providing benefits that matter to you.

    Our industry-leading Alter Domus Academy offers six learning zones for every stage of your career, with resources tailored to your ambitions and resources from LinkedIn Learning.

    Our global benefits also include :

  • Support for professional accreditations such as ACCA and study leave
  • Flexible arrangements, generous holidays, birthday leave
  • Continuous mentoring along your career progression
  • Active sports, events and social committees across our offices
  • Support with mental, physical, emotional and financial support 24 / 7 from our Employee Assistance Program
  • The opportunity to invest in our growth and success through our Employee Share Plan
  • Plus additional local benefits depending on your location
  • Equity in every sense of the word

    We are in the business of equity, in every sense of the word. For us, this means taking action to ensure every colleague has equal opportunity, valuing every voice and experience across our organisation, maintaining an inclusive culture where you can bring your whole self to work, and making Alter Domus a workplace where everyone feels they belong.

    We celebrate our differences, and understand that our success relies on diverse perspectives and experiences, working towards shared goals and a common purpose. Thanks to the work of our Group DE&I Committee and network of DE&I Champions, we empower all of our people to be truly invested in the alternative.

    We are committed to ensuring an inclusive recruiting and onboarding process. Please contact our hiring team if you require any accommodations to make our recruitment process more accessible for you.

    LI-HYBRID

    Create a job alert for this search

    Site Reliability Engineer • Hyderabad, IN

    Related jobs
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    HighRadiusHyderabad, Telangana, India
    Design, implement, and maintain scalable cloud infrastructure primarily on AWS, with some exposure to Azure.Manage and optimize CI / CD pipelines using Jenkins and Git-based version control systems (...Show moreLast updated: 13 days ago
    Site Reliability Engineer

    Site Reliability Engineer

    trellixINDIA
    Trellix, the trusted CISO ally, is redefining the future of cybersecurity and soulful work.Our comprehensive, GenAI-powered platform helps organizations confronted by todays most advanced threats g...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    NatWest GroupINDIA
    Join us as a Site Reliability Engineer.In this key role, youll support the improvement of non-functional and operational characteristics such as availability, performance, efficiency, change manage...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Alignity SolutionsHyderabad, Telangana, India
    Do you love a career where you Experience.If so we are excited to have bumped onto you.Learn how we are redefining the.Clients Job-seekers and Employees. If you are a Site Reliability Engineer.We ar...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Anicalls (Pty) LtdHyderabad, India
    Mentor teammates on SRE best practices and guide technical direction.Work closely with the product engineering team to rapidly deliver capabilities. Automate and optimize developer pipelines.Build m...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    GSPANNHyderabad, IN
    Description GSPANN is hiring a Site Reliability Engineer with to ensure high availability and performance of critical systems using tools like Prometheus and Nagios. The role involves developing rel...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    VistexHyderabad, Telangana, IND
    The Vistex Site Reliability Engineer will be primarily responsible for service availability, performance, monitoring, incident response, and capacity planning. This is a highly technical, hands-on r...Show moreLast updated: 16 days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Talent WorxHyderabad, TS, IN
    Quick Apply
    Site Reliability Engineer (SRE).At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of o...Show moreLast updated: 30+ days ago
    • Promoted
    Assurant - Site Reliability Engineer

    Assurant - Site Reliability Engineer

    AssurantHyderabad
    Role : Staff Engineer-Site Reliability Engineering, Assurant, GCC-India This job is responsible for basic administration, support, planning, implementation and monit...Show moreLast updated: 19 days ago
    Site Reliability Engineer III

    Site Reliability Engineer III

    McDonalds in IndiaHyderabad, India
    One of the worlds largest employers with locations in more than 100 countries, McDonalds Corporation has corporate opportunities in Hyderabad. Our global offices serve as dynamic innovation and oper...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Concentrix CatalystHyderabad, IN
    Senior Site Reliability Engineer.Remote (may need to travel to nearby Concentrix office as per business need).Minimum Experience required : 8+ Years. Stakeholder Management Working with key technolog...Show moreLast updated: 19 hours ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Whitefield CareersHyderabad, Telangana, India
    Required Education, Experience, Skills.Bachelor's degree in computer engineering, computer science or related field.Extensive experience with windows operating systems, IIS (Internet Information Se...Show moreLast updated: 19 hours ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    HTC Global ServicesHyderabad, Telangana, India
    Positions available in Hyderabad (.GCP- GKE Google Kubernetes Engine.Datadog, Dynatrace or similar tools.Python or Any Scripting languages. If interested in the above requirement, please reply with ...Show moreLast updated: 16 days ago
    • Promoted
    Site Reliability Engineer - Splunk

    Site Reliability Engineer - Splunk

    Talent500Hyderabad
    What you will do : - Ensure key stakeholders, product owners, and platform owners are informed of reliability concerns...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Softensity IncHyderabad, Telangana, India
    Senior Site Reliability Engineer (SRE).US-based IT outsourcing company with global software teams.We are headquartered in Atlanta, GA, USA with development teams in LATAM, Eastern Europe and Türkiy...Show moreLast updated: 17 hours ago
    Site Reliability Engineer

    Site Reliability Engineer

    Unison Consulting Pte LtdHyderabad, TS, IN
    Quick Apply
    Experience with supporting Java (J2EE / Spring Boot) based multi-tier applications with complex upstream downstream interactions having expertise in understanding the application request flow and ana...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ABC FitnessHyderabad, Telangana, India
    ABC is the trusted provider to boost performance and create a total fitness experience for over 41 million members of clubs of all sizes whether a multi-location chain, franchise or an independent ...Show moreLast updated: 19 hours ago
    • New!
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    Trigent Software Private LimitedTS, India
    Quick Apply
    We are seeking an experienced Senior Site Reliability Engineer (SRE) with 6+ years of hands-on experience to join our fast-paced and growing team. As an SRE, you will play a pivot...Show moreLast updated: 11 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    noonHyderabad, IN
    Job Title : Site Reliability Engineer.In doing this we hope to accelerate the digital economy of the Middle East, empowering regional talent and businesses to meet the full range of consumers' onlin...Show moreLast updated: 19 days ago
    • Promoted
    • New!
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Synopsys IncHyderabad, Telangana, India
    Site Reliability Engineering, Sr Staff.The Engineering Excellence Group drives innovation velocity and enterprise infrastructure automation, which are critical elements of our growth and scaling st...Show moreLast updated: 17 hours ago