Talent.com
DevOps / Site Reliability Engineer - Configuration Management Tools

DevOps / Site Reliability Engineer - Configuration Management Tools

QUARKS TECHNOSOFT PRIVATE LIMITEDBangalore
18 days ago
Job description

Description :

  • Design and implement robust monitoring, alerting, and reliability solutions for infrastructure.
  • Support and manage diverse platforms including (but not limited to) : Service / Microservice-based architectures, Oracle ATG, ReactJS, Hybrid environments (on-premise and cloud).
  • Develop and maintain tooling to enable shorter release cycles, while improving performance, supportability, and scalability of the ecommerce platform.
  • Collaborate with cross-functional teams for delivery of large-scale, complex projects, often involving multiple internal and external stakeholders.
  • Assist in provisioning environments and software necessary for various programs and projects.
  • Optimize and manage infrastructure-related costs effectively.
  • Maintain a high level of security across infrastructure and applications.
  • Collaborate with development teams to improve CI / CD workflows and pipelines.

Qualifications & Skills :

Education & Experience :

  • Bachelors or Masters degree in Engineering or related field.
  • 4 to 8 years of experience in Software development, Infrastructure setup on cloud platforms, Build / Release engineering, CI / CD, Configuration / Change Management.
  • Technical Proficiency :

  • Strong experience in Linux / Unix administration.
  • Hands-on experience with relational databases such as PostgreSQL.
  • Practical knowledge of Docker and related tools : Cassandra, Rancher, Kubernetes
  • Exposure to configuration management tools such as : Ansible, Chef, Puppet, Terraform
  • Experience with cloud technologies, especially Microsoft Azure.
  • Familiarity with monitoring and alerting tools : TICK Stack, ELK Stack, Nagios, PagerDuty
  • Understanding of cloud networking concepts : Subnets, Routing Tables, Security Groups (or equivalents).
  • Experience with container networking.
  • Ability to design, implement, and test Disaster Recovery plans.
  • Familiarity with Access Control management in infrastructure environments.
  • Desirable / Nice-to-Have :

  • Experience in both on-premise and cloud infrastructure environments.
  • Prior exposure to the eCommerce domain.
  • Experience with distributed systems and messaging technologies (e.g., NSQ, RabbitMQ, SQS).
  • Experience in scaling data stores (PostgreSQL, Scylla, Redis).
  • (ref : hirist.tech)

    Create a job alert for this search

    Reliability Engineer • Bangalore