Talent.com
This job offer is not available in your country.
Staff Site Reliability Engineer

Staff Site Reliability Engineer

Session AIMumbai, Maharashtra, India
8 hours ago
Job description

Are you ready to make your mark with a true industry disruptor? ZineOne, a subsidiary of Session AI , the pioneer of in-session marketing, is looking to add talented team members to help us grow into the premier revenue tool for e-commerce. We work with some of the leading brands nationwide and we innovate how brands connect with and convert customers.

Job Description

This position offers a hands-on, technical opportunity as a vital member of the Site Reliability Engineering Group. Our SRE team is dedicated to ensuring that our Cloud platform operates seamlessly, efficiently, and reliably at scale. The ideal candidate will bring over five years of experience managing cloud-based Big Data solutions, with a strong commitment to resolving operational challenges through automation and sophisticated software tools.

Candidates must uphold a high standard of excellence and possess robust communication skills, both written and verbal. A strong customer focus and deep technical expertise in areas such as Linux, automation, application performance, databases, load balancers, networks, and storage systems are essential.

Key Responsibilities :

As a Session AI SRE, you will :

  • Design and implement solutions that enhance the availability, performance, and stability of our systems, services, and products.
  • Develop, automate, and maintain infrastructure as code for provisioning environments in AWS, Azure, and GCP.
  • Deploy modern automated solutions that enable automatic scaling of the core platform and features in the cloud.
  • Apply cybersecurity best practices to safeguard our production infrastructure.
  • Collaborate on DevOps automation, continuous integration, test automation, and continuous delivery for the Session AI platform and its new features.
  • Manage data engineering tasks to ensure accurate and efficient data integration into our platform and outbound systems.
  • Utilize expertise in DevOps best practices, shell scripting, Python, Java, and other programming languages, while continually exploring new technologies for automation solutions.
  • Design and implement monitoring tools for service health, including fault detection, alerting, and recovery systems.
  • Oversee business continuity and disaster recovery operations.
  • Create and maintain operational documentation, focusing on reducing operational costs and enhancing procedures.
  • Demonstrate a continuous learning attitude with a commitment to exploring emerging technologies.

Preferred Skills :

  • Experience with cloud platforms like AWS, Azure, and GCP, including their management consoles and CLI.
  • Proficiency in building and maintaining infrastructure on :
  • AWS using services such as EC2, S3, ELB, VPC, CloudFront, Glue, Athena, etc.
  • Azure using services such as Azure VMs, Blob Storage, Azure Functions, Virtual Networks, Azure Active Directory, Azure SQL Database, etc.
  • GCP using services such as Compute Engine, Cloud Storage, Cloud Functions, VPC, Cloud IAM, BigQuery, etc.
  • Expertise in Linux system administration and performance tuning.
  • Strong programming skills in Python, Bash, and NodeJS.
  • In-depth knowledge of container technologies like Docker and Kubernetes.
  • Experience with real-time, big data platforms including architectures like HDFS / Hbase, Zookeeper, and Kafka.
  • Familiarity with central logging systems such as ELK (Elasticsearch, LogStash, Kibana).
  • Competence in implementing monitoring solutions using tools like Grafana, Telegraf, and Influx.
  • Benefits

  • Comparable salary package and stock options
  • Opportunity for continuous learning
  • Fully sponsored EAP services
  • Excellent work culture
  • Opportunity to be an integral part of our growth story and grow with our company
  • Health insurance for employees and dependents
  • Flexible work hours
  • Remote-friendly company
  • Create a job alert for this search

    Site Reliability Engineer • Mumbai, Maharashtra, India

    Related jobs
    • Promoted
    • New!
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    WebMD Health CorpMumbai, Maharashtra, India
    Internet Brands Company, is the leading provider of health information services, serving patients, physicians, health care professionals, employers, and health plans through our public and private ...Show moreLast updated: 7 hours ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.Mumbai, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    BayOne Solutionsdombivli, maharashtra, in
    Role : Site Reliability Engineer.The CXE Site Reliability Engineering (SRE) team manages the CI / CD pipelines and cloud infrastructure, ensuring seamless deployment, monitoring, and maintenance.Howev...Show moreLast updated: 3 days ago
    • Promoted
    • New!
    Senior Site Reliability Engineer 1

    Senior Site Reliability Engineer 1

    RELXMumbai, Maharashtra, India
    We are looking for a Senior DevOps / Site Reliability Engineer (SRE) with 7+ years of experience to join our high-performing engineering team. This role is pivotal in building scalable systems, redu...Show moreLast updated: 8 hours ago
    • Promoted
    • New!
    Senior Site Reliability Engineer I

    Senior Site Reliability Engineer I

    RELXMumbai, Maharashtra, India
    LexisNexis Risk Solutions is looking for a Senior SRE / DevSecOps Engineer to join our collaborative and innovative SRE team. In this role, you’ll help design, build, and maintain secure, scalable s...Show moreLast updated: 8 hours ago
    • Promoted
    Sr Site Reliability Engineer

    Sr Site Reliability Engineer

    Media.netMumbai, Maharashtra, India
    Our proprietary contextual technology is at the forefront of enhancing Programmatic buying, the latest industry standard in ad buying for digital platforms. HQ is based in New York, and the Global H...Show moreLast updated: 3 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    XebiaMumbai, IN
    AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 29 days ago
    • Promoted
    Reliability Engineer and Planning Engineer

    Reliability Engineer and Planning Engineer

    JobTravia Pvt. Ltd.Mumbai, IN
    Reliability / Planning Superintendent.Lead reliability and maintenance planning across the processing plant to ensure safe, efficient, and cost-effective operations. Drive continuous improvement, asse...Show moreLast updated: 3 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ConcordThane, IN
    Engineers (Individual Contributors).Strong SRE (Site Reliability Engineering).CI / CD, monitoring, automation, infrastructure as code, etc.Show moreLast updated: 21 days ago
    • Promoted
    • New!
    Site Reliability Engineer III

    Site Reliability Engineer III

    RELXMumbai, Maharashtra, India
    We are seeking a Site Reliability Engineer (SRE) with experience in Azure and a track record of success in cloud migration project initiatives. The successful candidate will help design and coordina...Show moreLast updated: 7 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ExasoftKalyan-Dombivli, IN
    Responsibilities and Requirements : .Experience must be at least 10+ years in SRE.Multi Cloud, Hybrid Cloud – on Data center sites. Experience with multiple operating systems (.Operating Systems, Kern...Show moreLast updated: 3 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    UplersThane, IN
    Uplers is hiring for one of the clients.SRE (Oracle Cloud Infrastructure).Remote | Mon–Fri | 10 : 30 AM – 7 : 30 PM IST.Use of personal device required. OCI cloud infrastructure using Terraform and GitL...Show moreLast updated: 27 days ago
    • Promoted
    Site Reliability Engineer - Observability Services

    Site Reliability Engineer - Observability Services

    TeamWare SolutionsMumbai
    Role Summary : We are seeking a highly skilled Site Reliability Engineer (SRE) with a strong focus on observability.The ideal candidate will have 5-8 years of experie...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Senior Site Reliability Engineer II

    Senior Site Reliability Engineer II

    RELXMumbai, Maharashtra, India
    DevOps / Site Reliability Engineer (SRE).Whether your background is software engineering or SRE-focused, what matters most is your ability to automate, optimize, and improve systems through smart scr...Show moreLast updated: 8 hours ago
    • Promoted
    • New!
    Reliability system engineer

    Reliability system engineer

    Anicalls (Pty) LtdMumbai, Maharashtra, India
    Experience with commonly used services,.Experience with Infrastructure as code.Terraform to define infrastructure standards for cloud services. Ability to use various technologies to host container ...Show moreLast updated: 8 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    HaysMumbai, Maharashtra, India
    Required skills and qualifications.Experience : Proven experience in technical support or engineering, preferably in AI / ML / GenAI environments. Technical Proficiency : Expertise in GenAI models (e.GPT,...Show moreLast updated: 27 days ago
    • Promoted
    Site Reliability Engineer - Chaos Management

    Site Reliability Engineer - Chaos Management

    Xebianavi mumbai, maharashtra, in
    AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 11 days ago
    • Promoted
    • New!
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ISS | Institutional Shareholder ServicesMumbai, Maharashtra, India
    Senior Site Reliability Engineer.Working hours (10 AM IST to 7 PM IST).This role expects rotational on-call support 24X7. This role is critical in ensuring the reliability, scalability and performan...Show moreLast updated: 8 hours ago