Talent.com
Site reliability engineer
Site reliability engineerSails Software Inc • Vizag, Andhra Pradesh, India
No longer accepting applications
Site reliability engineer

Site reliability engineer

Sails Software Inc • Vizag, Andhra Pradesh, India
2 days ago
Job description

SRE- AWS

Job Summary

We are looking for an experienced and driven Senior Site Reliability Engineer (SRE) to architect, implement, and maintain robust cloud infrastructure. This role demands a deep understanding of AWS, Kubernetes, ECS, and the ability to build scalable, secure, and highly available infrastructure from scratch. The ideal candidate will be a strong advocate for Dev Ops principles, automation, and reliability, and will possess the skills to support and optimize complex microservices-based architectures.

Key Responsibilities

  • Infrastructure Design & Implementation
  • Design and build highly scalable, fault-tolerant, and secure cloud infrastructure using AWS, Kubernetes, and ECS.
  • Lead efforts in infrastructure as code (Ia C) using tools like Terraform or Cloud Formation.
  • Develop and enforce best practices for infrastructure provisioning, security, and cost optimization.

System Reliability & Performance

  • Ensure availability, performance, scalability, and security of production systems.
  • Implement observability strategies including monitoring, logging, and alerting using tools such as Prometheus, Grafana, ELK, or Datadog.
  • Analyse system performance metrics and proactively identify potential issues and bottlenecks.
  • Dev Ops & Automation

  • Build and maintain CI / CD pipelines to streamline code deployments across environments.
  • Drive automation in infrastructure provisioning, configuration management, and operational tasks.
  • Ensure repeatable and reliable deployments using containers and orchestration tools like Kubernetes and ECS.
  • Service Management

  • Own the SRE lifecycle, including incident management, postmortems, root cause analysis, and runbook creation.
  • Collaborate closely with development and QA teams to ensure seamless microservices integration, deployment, and lifecycle management.
  • Maintain service-level objectives (SLOs), service-level agreements (SLAs), and error budgets.
  • Security & Compliance

  • Implement and enforce cloud security best practices for networking, identity and access management, and data protection.
  • Support audits, compliance assessments, and vulnerability remediation.
  • Monitor for security anomalies and work with security teams to respond to threats.
  • Technical Skills

  • 6+ years of hands-on experience in Site Reliability Engineering, Dev Ops, or Cloud Engineering.
  • Expertise in AWS services such as EC2, S3, RDS, IAM, VPC, Lambda, Cloud Watch, etc.
  • Strong knowledge of Kubernetes and container orchestration best practices.
  • Experience managing services on Amazon ECS (Fargate or EC2).
  • Proficient in infrastructure-as-code tools like Terraform, Cloud Formation, or Pulumi.
  • Skilled in scripting languages such as Python, Bash, or Go.
  • Solid grasp of networking, load balancing, DNS, and firewall rules in cloud environments.
  • Deep understanding of microservices architectures, API gateways, and service meshes.
  • Soft Skills

  • Proven leadership and cross-functional collaboration skills.
  • Strong problem-solving and incident-resolution mindset.
  • Clear communication, documentation, and stakeholder reporting abilities.
  • Passion for continuous improvement and automation.
  • Preferred Qualifications

  • AWS certifications such as AWS Certified Dev Ops Engineer, Solutions Architect – Professional, or equivalent.
  • Familiarity with service meshes like Istio or Linkerd.
  • Experience with serverless architectures and event-driven systems.
  • Knowledge of regulatory compliance (SOC2, ISO 27001, GDPR) in cloud environments.
  • Skills – AWS Cloud, CICD, EC2, Kubernete, Grafana, Datadog, Python

    Key Responsibilities :

    Cloud Platform : GCP

  • Infrastructure Automation : Design, implement, and manage infrastructure as code using Terraform to provision and manage GCP resources.
  • Container Orchestration : Deploy and manage Kubernetes clusters, ensuring efficient operation of containerized applications.
  • Continuous Integration / Continuous Deployment (CI / CD) : Develop and maintain CI / CD pipelines using Jenkins to automate application build, test, and deployment processes.
  • Containerization : Collaborate with development teams to containerize applications using Docker and manage deployments with Helm Charts.
  • Code Quality Assurance : Integrate and manage Sonar Qube to ensure code quality and security standards are met.
  • Monitoring and Logging : Implement and manage monitoring solutions using Datadog to ensure system health, performance, and security.
  • Collaboration : Work closely with cross-functional teams, including developers, QA, and operations, to streamline processes and improve productivity.
  • Requirements :

  • Experience : 5+ years in Dev Ops or cloud engineering roles, with at least 3 years of relevant experience in the specified technologies.
  • Technical Proficiency :
  • o Hands-on experience with GCP services and architecture.

    o Proficiency in Terraform for infrastructure as code implementations.

    o Strong understanding and experience with Kubernetes and Docker.

    o Experience in setting up and managing CI / CD pipelines using Jenkins.

    o Familiarity with Helm Charts for application deployment.

    o Experience with Sonar Qube for code quality analysis.

    o Proficiency in monitoring and logging tools, particularly Datadog.

  • Scripting Skills : Proficiency in scripting languages such as Bash or Python is an added advantage.
  • o Strong problem-solving abilities and analytical thinking.

    o Excellent communication skills, both verbal and written.

    o Ability to work collaboratively in a team environment.

    o Strong organizational and time management skills.

    Skills – Terraform, Kubernetes, Cluster, Docker, GCP, Sonar

    Technical Skills

  • 6+ years of hands-on experience in Site Reliability Engineering, Dev Ops, or Cloud Engineering.
  • Expertise in AWS services such as EC2, S3, RDS, IAM, VPC, Lambda, Cloud Watch, etc.
  • Strong knowledge of Kubernetes and container orchestration best practices.
  • Experience managing services on Amazon ECS (Fargate or EC2).
  • Proficient in infrastructure-as-code tools like Terraform, Cloud Formation, or Pulumi.
  • Skilled in scripting languages such as Python, Bash, or Go.
  • Solid grasp of networking, load balancing, DNS, and firewall rules in cloud environments.
  • Deep understanding of microservices architectures, API gateways, and service meshes.
  • Soft Skills

  • Proven leadership and cross-functional collaboration skills.
  • Strong problem-solving and incident-resolution mindset.
  • Clear communication, documentation, and stakeholder reporting abilities.
  • Passion for continuous improvement and automation.
  • Preferred Qualifications

  • AWS certifications such as AWS Certified Dev Ops Engineer, Solutions Architect – Professional, or equivalent.
  • Familiarity with service meshes like Istio or Linkerd.
  • Experience with serverless architectures and event-driven systems.
  • Knowledge of regulatory compliance (SOC2, ISO 27001, GDPR) in cloud environments.
  • Skills – AWS Cloud, CICD, EC2, Kubernete, Grafana, Datadog, Python

    Create a job alert for this search

    Site Reliability Engineer • Vizag, Andhra Pradesh, India

    Related jobs
    Site Reliability Engineer

    Site Reliability Engineer

    Pagos Consultants • Visakhapatnam, IN
    This team will play a pivotal role in spearheading innovation.As such, you will have the opportunity to shape the early architecture and design of the system and set the trajectory for its future d...Show more
    Last updated: 2 days ago • Promoted
    Systems Reliability & Performance Engineer

    Systems Reliability & Performance Engineer

    Sails Software Inc • Visakhapatnam, Republic Of India, IN
    We are looking for an experienced and driven Senior Site Reliability Engineer (SRE) to architect, implement, and maintain robust cloud infrastructure. This role demands a deep understanding of AWS, ...Show more
    Last updated: 20 days ago • Promoted
    System Engineer II - SE 2

    System Engineer II - SE 2

    Straive • Visakhapatnam, IN
    LearningMate / Straive and MGT Impact Solutions, LLC (MGT) have established a strategic global partnership designed to deliver world-class advisory, technology, and operational solutions for public s...Show more
    Last updated: 1 day ago • Promoted
    Technical Lead

    Technical Lead

    Mphasis • Visakhapatnam, IN
    Looking for Senior Ingenium Developer with 10+ years' experience and following skills.Experience in Mainframe O / S and Development using COBOL programming language & JCL. Experience in development an...Show more
    Last updated: 10 days ago • Promoted
    Lead Engineer

    Lead Engineer

    Hyqoo • Visakhapatnam, IN
    Design, deploy, and manage AWS cloud infrastructure, including EC2 instances, S3 buckets, VPCs, RDS databases, and Lambda functions. Assist in the design, implementation, and maintenance of backup, ...Show more
    Last updated: 20 days ago • Promoted
    Senior AMS Verification Engineer

    Senior AMS Verification Engineer

    Eximietas Design • Visakhapatnam, Andhra Pradesh, India
    Hiring : Senior AMS Verification Leads & Architects.Location : Bengaluru / Visakhapatnam.Eximietas Hiring : Senior AMS Verification Engineers - 10+ Years. Lead AMS, including reviewing design specific...Show more
    Last updated: 10 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Capgemini • Visakhapatnam, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show more
    Last updated: 30+ days ago • Promoted
    Senior DevOps & Database Reliability Engineer – 100% Remote

    Senior DevOps & Database Reliability Engineer – 100% Remote

    Hyly.AI • Visakhapatnam, IN
    Remote
    AI, we’re building the first AI + Data Fabric for the multifamily industry, transforming how clients manage, secure, and scale their marketing and operational data. As the industry moves toward a co...Show more
    Last updated: 17 days ago • Promoted
    Reservoir Engineer

    Reservoir Engineer

    Sofomation Energy PVT Ltd • Visakhapatnam, IN
    Position : Senior Reservoir Engineer.Minimum 10+ years of Experience.Must have Bachelor degree or higher qualification in Petroleum Engineering from a recognized university.Plan & guide Reservoir En...Show more
    Last updated: 23 days ago • Promoted
    Field Services Engineer

    Field Services Engineer

    SRINIVASA SALES & SERVICE PRIVATE LIMITED • Visakhapatnam, IN
    Job Safety Assessment report for every event and report the unsafe condition to the customer and send Near hit report with photographs to dealer safety manager. Gather customer feedback and address ...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Sails Software Inc • Visakhapatnam, Andhra Pradesh, India
    We are looking for an experienced and driven Senior Site Reliability Engineer (SRE) to architect, implement, and maintain robust cloud infrastructure. This role demands a deep understanding of AWS, ...Show more
    Last updated: 30+ days ago • Promoted
    Founding MLOps Engineer

    Founding MLOps Engineer

    Vectorial AI • Visakhapatnam, IN
    Vectorial is a simulation engine platform powered by millions of synthetic users—state-of-the-art models that capture real human behavior—to deliver instant, nuanced validation across the entire pr...Show more
    Last updated: 19 days ago • Promoted
    Technical Operations Lead

    Technical Operations Lead

    ClearTrail Technologies • Visakhapatnam, IN
    Computer Science, Information Technology, or a related field.We are seeking a highly skilled and experienced.The ideal candidate will have a strong background in Linux system administration, incide...Show more
    Last updated: 30+ days ago • Promoted
    Health Safety Environment Engineer

    Health Safety Environment Engineer

    Sofomation • Visakhapatnam, IN
    OPENING FOR A WELL KNOWN OIL AND GAS COMPANY IN MIDDLE EAST : .Position : Senior HSE Engineer – Offshore.Experience : Minimum 10+ years offshore experience. Bachelor’s Degree or equivalent.HSE, Engineer...Show more
    Last updated: 9 hours ago • Promoted • New!
    Design Engineer - Plumbing (Hospitals)

    Design Engineer - Plumbing (Hospitals)

    WSP in India • Visakhapatnam, IN
    The role involves raising the team's technical competence by fostering continuous learning and keeping skills aligned with the latest industry practices. This includes implementing robust delivery a...Show more
    Last updated: 9 days ago • Promoted
    Senior RTL Design Engineer

    Senior RTL Design Engineer

    MosChip® • Visakhapatnam, IN
    Experience in Logic design / RTL coding is a must.Experience is SoC design and integration for complex SoCs is a must.Experience in Verilog / System-Verilog is a must. Experience in Multi Clock design...Show more
    Last updated: 10 days ago • Promoted
    HYPERVISOR TEST ENGINEER (Foundation Level)

    HYPERVISOR TEST ENGINEER (Foundation Level)

    Piepeople Consulting Inc. • Visakhapatnam, IN
    Solid understanding of hypervisors, virtual machines (VMs), and core concepts like CPU, memory, and I / O allocation.Basic operating systems (especially Linux), hardware basics, and fundamental progr...Show more
    Last updated: 6 days ago • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaService • Visakhapatnam, IN
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show more
    Last updated: 23 days ago • Promoted