We’re looking for an experienced
Site Reliability Engineer
to fill the mission-critical role of ensuring that our complex, web-scale systems are healthy, monitored, automated, and designed to scale. You will use your background as an operations generalist to work closely with our development teams from the early stages of design all the way through identifying and resolving production issues. The ideal candidate will be passionate about an operations role that involves deep knowledge of both the application and the product, and will also believe that automation is a key component to operating large-scale systems.
6-Month Accomplishments
Familiarize with poshmark tech stack and functional requirements.
Get comfortable with automation tools / frameworks used within cloudops organization and deployment processes associated with.
Gain in depth knowledge related to related product functionality and infrastructure required for it.
Start Contributing by working on small to medium scale projects.
Understand and follow on call rotation as a secondary to get familiarized with the on call process.
12+ Month Accomplishments
Execute projects independently with little guidance from lead.
Create meaningful alerts and dashboards for various sub-system involved in targeted infrastructure.
Identify gaps in infrastructure and suggest improvements or work on it.
Get involved in on-call rotation.
Responsibilities
one or more of our Internet-facing services.
facilitate our rapid iteration and constant growth.
applications in a large-scale UNIX environment.
"operability" in mind.
Desired Skills
required, ideally in a startup or fast-growing company.
management with Ansible, systems monitoring and alerting with tools such as Nagios,
New Relic, Graphite.
Technologies we use :
tools.
Please note that Poshmark will not be able to sponsor work-related visa for this position.
Site Reliability Engineer • India