Talent.com
Staff Site Reliability Engineer

Staff Site Reliability Engineer

ConfidentialChennai, India
3 days ago
Job description

We're looking for an experienced Site Reliability Engineer to fill the mission-critical role of ensuring that our complex, web-scale systems are healthy, monitored, automated, and designed to scale. You will use your background as an operations generalist to work closely with our development teams from the early stages of design all the way through identifying and resolving production issues. The ideal candidate will be passionate about an operations role that involves deep knowledge of both the application and the product, and will also believe that automation is a key component to operating large-scale systems.

6-Month Accomplishments

  • Familiarize with poshmark tech stack and functional requirements.
  • Get comfortable with automation tools / frameworks used within cloudops organization and deployment processes associated with.
  • Gain in depth knowledge related to related product functionality and infrastructure required for it.
  • Start Contributing by working on small to medium scale projects.
  • Understand and follow on call rotation as a secondary to get familiarized with the on call process.

12+ Month Accomplishments

  • Execute projects independently with little guidance from lead.
  • Create meaningful alerts and dashboards for various sub-system involved in targeted infrastructure.
  • Identify gaps in infrastructure and suggest improvements or work on it.
  • Get involved in on-call rotation.
  • Responsibilities

  • Serve as a primary point responsible for the overall health, performance, and capacity of
  • one or more of our Internet-facing services.

  • Gain deep knowledge of our complex applications.
  • Assist in the roll-out and deployment of new product features and installations to
  • facilitate our rapid iteration and constant growth.

  • Develop tools to improve our ability to rapidly deploy and effectively monitor custom
  • applications in a large-scale UNIX environment.

  • Work closely with development teams to ensure that platforms are designed with
  • 'operability' in mind.

  • Function well in a fast-paced, rapidly-changing environment.
  • Participate in a 12x7 on-call rotation.
  • Desired Skills

  • 4+ years of experience in Systems Engineering / Site Reliability Operations role is
  • required, ideally in a startup or fast-growing company.

  • 4+ years in a UNIX-based large-scale web operations role.
  • 4+ years of experience in doing 12 / 7 support for large scale production environments.
  • Battle-proven, real-life experience in running a large scale production operation.
  • Experience working on cloud-based infrastructure e.g AWS, GCP, Azure.
  • Hands-on experience with continuous integration tools such as Jenkins, configuration
  • management with Ansible, systems monitoring and alerting with tools such as Nagios,

    New Relic, Graphite.

  • Experience scripting / coding
  • Ability to use a wide variety of open source technologies and tools.
  • Technologies we use :

  • Ruby, JavaScript, NodeJs, Tomcat, Nginx, HaProxy
  • MongoDB, RabbitMQ, Redis, ElasticSearch.
  • Amazon Web Services (EC2, RDS, CloudFront, S3, etc.)
  • Terraform, Packer, Jenkins, Datadog, Kubernetes, Docker, Ansible and other DevOps
  • tools.

    Please note that Poshmark will not be able to sponsor work-related visa for this position.

    Skills Required

    Nginx, Tomcat, Datadog, Elasticsearch, Javascript, Docker, Terraform, Ruby, Aws, Nodejs, Redis, Packer, Unix, New Relic, Jenkins, Rabbitmq, Gcp, Haproxy, Ansible, graphite, Mongodb, Nagios, Azure, Kubernetes

    Create a job alert for this search

    Site Reliability Engineer • Chennai, India

    Related jobs
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Tata Consultancy ServicesChennai, Tamil Nadu, India
    GKE(Preferable); Kubernetes (Any cloud) + PostgresSQL, SQL(Must).Linux (Optional), Java (Optional) , Kubernetes (CLI), Prior Production support experience, Release Management, Prior Deployment expe...Show moreLast updated: 24 days ago
    • Promoted
    AWS Site Reliability Engineer

    AWS Site Reliability Engineer

    HTC Global ServicesChennai, Tamil Nadu, India
    Troy, Michigan, is a leading global Information Technology solution and BPO provider.HTC assists clients across multiple industry verticals, offering turnkey project lifecycle in, e-business, data ...Show moreLast updated: 14 days ago
    • Promoted
    Site Reliability Engineering (SRE)

    Site Reliability Engineering (SRE)

    Tata Consultancy ServicesChennai, Tamil Nadu, India
    TCS has been a great pioneer in feeding the fire of Young Techies like you.We are a global leader in the technology arena and there's nothing that can stop us from growing together.Location - Benga...Show moreLast updated: 3 days ago
    • Promoted
    Site Engineer

    Site Engineer

    Davidson Engineers and ContractorsChennai, Tamil Nadu, India
    A Site Engineer is responsible for managing and supervising construction projects on-site.They work closely with the project team, subcontractors, and construction workers to.Oversee and manage the...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Intellistaff Services Pvt. LtdChennai, Tamil Nadu, India
    SRE Public Cloud & Cloud Engineering.Docker / Kubernetes, Terraform (incl.DevOps & CI / CD (GitHub, Cloud Build).Scripting : Python, Go, PowerShell, Java, JS / Node. Messaging : Kafka, RabbitMQ, ActiveMQ.Mo...Show moreLast updated: 2 days ago
    • Promoted
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    PoshmarkChennai, Tamil Nadu, India
    We’re looking for an experienced.You will use your background as an operations generalist to work closely with our development teams from the early stages of design all the way through identifying ...Show moreLast updated: 27 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    VXI Global Solutionschennai, tamil nadu, in
    We are looking for a Site Reliability Engineer with 3+ years for Experience into design, implement, and manage robust observability solutions across our cloud infrastructure and applications.The id...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Grootan TechnologiesChennai, Tamil Nadu, India
    Site Reliability Engineer (SRE).In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications.You will leverage your e...Show moreLast updated: 4 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Synechronchennai, tamil nadu, in
    We have immediate opportunity for.SRE (Senior Site Reliability Engineer) 5+ years.SRE (Senior Site Reliability Engineer). We began life in 2001 as a small, self-funded team of technology specialists...Show moreLast updated: 1 day ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdgeChennai, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 27 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ACL Digitalchennai, tamil nadu, in
    ACL Digital is Hiring for the Below position.ACL Digital, part of the ALTEN Group, is a trusted AI-led, Digital & Systems Engineering Partner driving innovation by designing and building intelligen...Show moreLast updated: 13 days ago
    • Promoted
    • New!
    TCS Walkin Drive For Site Reliability Engineering (SRE)

    TCS Walkin Drive For Site Reliability Engineering (SRE)

    Tata Consultancy ServicesChennai, Tamil Nadu, India
    Site Reliability Engineering (SRE)Ops.TCS has been a great pioneer in feeding the fire of young Techies like you.We are a global leader in the technology arena and there’s nothing that can stop us ...Show moreLast updated: 9 hours ago
    • Promoted
    Site Engineer

    Site Engineer

    Solarsurechennai, tamil nadu, in
    We are hiring a detail-oriented and technically skilled Site Engineer to monitor and support on-ground civil, electrical and mechanical works as per engineering drawings and quality standards, ensu...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    People Prime Worldwidemount, India
    Our client is a French multinational information technology (IT) services and consulting company, headquartered in Paris, France. Founded in 1967, It has been a leader in business transformation for...Show moreLast updated: 8 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Datum Technologies GroupChennai, Tamil Nadu, India
    Site Reliability Engineer (SRE) – Azure & AI.Work Location : Chennai / Mumbai / Gurgaon.We are looking for an experienced. Site Reliability Engineer (SRE).The ideal candidate will have a solid background...Show moreLast updated: 5 days ago
    • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaServiceChennai, IN
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 13 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ElgebraChennai
    Role Overview : We are seeking a highly experienced and technically proficient Site Reliability Engineer (SRE) to join our team in support of our c...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Zyoin GroupChennai
    Description : MoneyForward is seeking a Site Reliability Engineer (SRE) to lead the reliability, scalability, and performance of our products.This role invol...Show moreLast updated: 14 days ago