Talent.com
This job offer is not available in your country.
Site Reliability Engineer - CI / CD Pipeline

Site Reliability Engineer - CI / CD Pipeline

CoreLogicNoida
30+ days ago
Job description

Job Responsibilities :

We are seeking a skilled Site Reliability Engineer to manage the day-to-day operations and performance of multiple critical applications in a dynamic, high-demand environment.

The ideal candidate will have hands-on experience with SQL databases, cloud platforms, and a range of other enterprise applications, combined with problem-solving skills and the ability to troubleshoot and resolve issues with minimal Description Production Support Processes and SLAs :

  • Document production support processes that encompass the full lifecycle of a delivery request through to the development team and a production release.
  • Support defined SLAs based on severity and work with DevOps and Engineering to meet those SLAs.

System and Application Deployments :

  • Plan and execute application and database deployments following established processes with adherence to Corporate Change Management standards.
  • Incident Management :

  • Participate in the troubleshooting, and resolution of production issues in real time with timely communication to affected parties.
  • Ensure that incidents are logged, tracked, and escalated as & Alerting :
  • Implement and optimize monitoring tools to proactively detect issues and ensure the health and performance of production Stability & Performance :
  • Work closely with the development, infrastructure, and operations teams to ensure the stability and scalability of production systems.
  • Recommend and implement improvements to increase system Cause Analysis (RCA) :
  • Contribute to post-incident reviews, drive root cause analysis efforts, and ensure that lessons learned are shared across teams.
  • Continuous Improvement :

  • Engage in continuous improvement efforts by identifying gaps in the support process and implementing best practices.
  • Optimize incident response times and overall system with Stakeholders :
  • Engage with business stakeholders, product owners, and other cross-functional teams to ensure effective communication and Management :
  • Maintain and update documentation for support procedures, system configurations, and incident management.
  • Create knowledge-based articles and ensure the team is well-trained on new systems and procedures.
  • On-Call Rotation :

  • Participate in on-call rotation for critical incidents, ensuring that production environments are supported 24 / 7 / 365.
  • Job Qualifications Skills & Qualifications :

  • Bachelors degree in computer science, Information Technology, or a related field.
  • 2+ years of experience in production support, system administration, or related technical roles with a focus on cloud-based systems management (GCP and Azure)
  • Proven experience in a production support or IT operation team.
  • Knowledge of incident management, system monitoring, and troubleshooting methodologies.
  • Understanding of production systems, system architectures, and distributed systems.
  • Hands-on experience with monitoring tools.
  • Familiarity with scripting languages (e.g., Python, Shell) for automation and troubleshooting.
  • Solid communication and interpersonal skills to engage with stakeholders.
  • Ability to work under pressure and manage incidents in a fast-paced production environment.
  • Proficiency in Windows / Linux / Unix environments and system administration.
  • Familiarity with CI / CD pipelines and tools (e.g., Jenkins, GitHub).
  • Hands-on experience with .NET Core, .NET Framework, Apache, IIS, PowerShell, and Python
  • for application support.

  • Ability to query SQL databases for application troubleshooting, reporting and deployments.
  • Additional technologies : JIRA, Confluence, Pager Duty, Uptrends, Teams, O365
  • ref : hirist.tech)

    Create a job alert for this search

    Site Reliability Engineer • Noida