Apply SRE core tenets of measurement (SLI / SLO / SLA), eliminate toil, and reliability modeling
Enable and educate development teams on industry best practice design patterns, ways of working and operational knowledge to ensure platform continuity
Develop and architect solutions to infrastructure and operational aspects of new products and feature sets
Assist with go / no go preplanning, verification / validation, and review of existing and new product / services
Proactively analyze data and test the integrity of network / systems to ensure production applications and services are operating optimally
Work within development teams to troubleshoot and resolve business affecting issues
Escalations, incident response, RCA, and blameless postmortem
Participate in on-call rotation
Qualifications
At least 3 years of professional experience within a cloud / web / CDN scale infrastructure
Experience with Python and Go. C / C++ a plus
Expert knowledge of Linux systems, network programming and protocols TCP, UDP, DNS, TLS / SSL, HTTP
Experience with BGP and Anycast routing is a plus
Experience with DevOps principles and concepts such as Infrastructure as Code (Ansible / Saltstack), CI / CD (Gitlab, Jenkins, Git), monitoring and visualization (Prometheus, Grafana)
Experience with big data technologies such as NoSQL / RDBMS, Redis, ElasticSearch, Kafka
Experience with containers and container management (Docker, Kubernetes)
Experience analyzing and building data telemetry, modeling, pipelines, UI visualization
Experience in developing software, troubleshooting, and monitoring large scale distributed systems
Implement software engineering best practices / standards and software development life cycle
Working knowledge and experience of Agile software development methodologies
Create a job alert for this search
Senior • kanpur, India
Related jobs
Promoted
Sr Engineer, Site Reliability [T500-21295]
TMUS Global Solutionskanpur, uttar pradesh, in
NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 1 day ago
Promoted
Lead Sustenance Engineer - Storage
DDNKanpur, IN
This is an incredible opportunity to be part of a company that has been at the forefront of AI and high-performance data storage innovation for over two decades.
DataDirect Networks (DDN) is a globa...Show moreLast updated: 30+ days ago
Promoted
Senior Networking Software Developer - SONiC / SAI
ACL Digitalsonic, uttar pradesh, in
ACL Digital is actively hiring for experienced.Senior Networking Software Developer -SONiC / SAI Architecture with strong networking operating system development background.Job Requirement - Senior S...Show moreLast updated: 30+ days ago
Promoted
Senior MLOps Engineer
Mitchell Martin Inc.Kanpur, IN
Include, but are not limited to, the following : .Own productionizing models—from tracked experiments to governed releases—ensuring resilient services with clear SLOs, runbooks, and fast, safe rollba...Show moreLast updated: 30+ days ago
Promoted
Senior Site Reliability Engineer / Senior Cloud Engineer
CloudHirekanpur, uttar pradesh, in
The Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture.Repo...Show moreLast updated: 2 days ago
Promoted
Full Stack Engineer
MerilKanpur (division)
Job Title : Full stack Developer.Location : Kanpur (New Technopark @ IIT Kanpur).We are seeking a motivated and detail-oriented Full Stack Developer to join our dynamic team.The ideal candidate will ...Show moreLast updated: 1 day ago
Promoted
Senior Site Reliability Engineer- ELK Expert
iVedha Inc.Kanpur, IN
Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone.
Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
Promoted
New!
Site Reliability Engineer (SRE) – Infrastructure & Automation
InstaServiceKanpur, Uttar Pradesh, India
About InstaService InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly.
We’re growing fast across 23+ ...Show moreLast updated: 17 hours ago
Promoted
Sr Systems Engineer Linux – AI Infrastructure
DC Tech ConsultingKanpur, IN
Position : Senior Linux Administrator – AI / ML Infrastructure.We are seeking a highly skilled Senior Linux Administrator to join our team, focusing on the implementation and management of on-premises...Show moreLast updated: 30+ days ago
Promoted
Sr. / Software Engineer
BrightEdgeKanpur, IN
BrightEdge is a global leader in enterprise SEO and content performance solutions, driving AI-powered digital marketing success for the world’s top brands.
Our culture is product-first, innovation-d...Show moreLast updated: 30+ days ago
Promoted
Senior Solutions Engineer
Zeotapkanpur, uttar pradesh, in
Founded in Berlin in 2014, Zeotap started with a mission to provide high-quality data to marketers.As we evolved, we recognized a greater challenge : helping brands create personalized, multi-channe...Show moreLast updated: 30+ days ago
Promoted
Civil Site Supervisor
Best Infosystems Ltd.Rura, Uttar Pradesh, India
Job Opening : Civil Site Supervisor / Junior Engineer – Finishing, Plumbing, Electrical & Landscaping (School Building) Location : Kanpur Dehat Project : Final Furnishing & Completion of a 60,000 sq.S...Show moreLast updated: 30+ days ago
Promoted
Senior Site Reliability Engineer (Sre) – Datadog Observability
Jade GlobalKanpur, Republic Of India, IN
Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote.
Site Reliability Engineer (SRE).SRE...Show moreLast updated: 3 days ago
Promoted
Senior Site Reliability Engineer
IntraEdgeKanpur, IN
Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 16 days ago
Promoted
Senior Site Reliability Engineer
Nebula Tech SolutionsKanpur, Uttar Pradesh, India
At Nebula Tech Solutions , we’re building a high-performing SRE team supporting mission-critical applications for our US-based enterprise clients.
We’re now looking for engineers who can go beyond...Show moreLast updated: 3 days ago
Promoted
Senior Software Engineer DevRel
ApplicantzKanpur, IN
THIS IS A LONG TERM CONTRACT POSITION WITH ONE OF THE LARGEST, GLOBAL, TECHNOLOGY LEADER.Partner with application teams to.
Accelerate application onboarding.Troubleshoot platform integration issues...Show moreLast updated: 1 day ago
Promoted
Senior Site Reliability Engineer (SRE) – Datadog Observability
Jade Globalkanpur, uttar pradesh, in
Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote.
Site Reliability Engineer (SRE).SRE...Show moreLast updated: 3 days ago
Promoted
Senior Distributed Systems Engineer
INDI Staffing Serviceskanpur, uttar pradesh, in
At INDI, we're passionate about empowering individuals and businesses worldwide.Our cutting-edge recruiters connect leading companies with top talent, fostering a dynamic environment where innovati...Show moreLast updated: 2 days ago