Job Title : DevOps & Site Reliability Engineer
Experience : 4+ Yrs
Qualification : B.E. / B.Tech / M.E. / M.SC IT / MCA
Duties and responsibilities :
- Engage, Improve, develop, measure, and implement processes and tools for Continues Integrations and Delivery, Site Reliability Engineering, and automation of deployment and support of products into the cloud.
- Coordinate with product, development, support, operations, quality, and management teams to ensure installation and integration efforts are automated and well documented.
- Deliver industry-leading public cloud CI / CD pipelines using cutting-edge cloud-native technologies.
- Identify and analyze production events and outcomes to improve SaaS application reliability, operability, and customer experience and lead the SLI / SLO / SLA efforts.
- Develop and improve production site monitoring and application performance insights tools to improve MTTD and MTTR and bring operational excellence via automation.
- Identify operational efficiencies, potential outages, and security threats in Development, UAT, Staging, and Production environments and provide Incident response and resolution support for infrastructure with product teams, Software, and QA engineers.
- Participate in operational on-call support rotations, triaging and resolving issues and requests.
- Ensure delivered product has passed all required gates such as QA and customer acceptance testing.
- Participate in software testing efforts and provide feedback to the development community.
- Provide technical expertise and ownership in diagnosing and resolving issues, including the determination and provision of workaround solutions or escalation to service owners.
Minimum Qualifications :
Experience in SRE / DevOps delivering continuous integration / continuous delivery tools, REST API deployment, containerization, IaaS / PaaS, public / private cloud, data pipelines, application observability, and monitoring using site reliability engineering methodology. Knowledge of Linux system administration and automation.Experience creating, executing, and troubleshooting complex, automated processes to configure servers and / or deploy code.Experience in building CI / CD pipelines for large-scale applications on AWSExperience with AWS resources.Experience with infrastructure as code tools, such as Terraform or CloudFormation.Experience with build automation and continuous integration and continuous delivery ecosystem : using GitHub, AWS CI / CD pipeline or any tools like Jenkins, GitLab, etc., and deployment management tools like Packer, Ansible, Salt, Chef, and / or Puppet.Experience with monitoring and troubleshooting solutions using cloud-native tools, New Relic, and Alert Logic.Experience in migrating & supporting the applications on Docker / Kubernetes.Experience programming skills in Unix, scripting, and automation.Experience in release management and improving rollback solutions for deployments.Experience working in a fast-paced, collaborative environment utilizing the efficiencies of an agile, IterateFast mentality in a remote and multi-time-zone environment.
Exceptional collaboration and communication skills across distributed teams.Experience in clinical research or Healthcare tech is strongly preferred.Experience in HIPAA / HITECH / HITRUST / PHI / PII and PCI DSS working environment.Preferred Qualification :
Demonstrated expertise in DevOps concepts and the SRE lifestyle.Experience managing a large infrastructure on AWS.Ability to understand the application requirements (Performance, Security, Scalability, etc.) and assess the suitable services / topology on AWS Deploy & Support applications.Familiarity with security benchmarks such as OWASP, PCI-DSS, CIS, SOC2, FedRAMP, etc.Ability to troubleshoot application-specific, core network, system & performance issues.Solid Experience in deployment, automatic, and delivery of large-scale, durable, and reliable full-stack SaaS and mobile applications in a cloud environment.Excellent written and verbal communication skills with the ability to present complex technical information clearly and concisely to various audiences.Comfortable multi-tasking and working as part of a global team, as well as working independently andtaking ownership
A deep understanding of the software development lifecycle and a track record of shipping software on time.(ref : hirist.tech)