Description GSPANN is hiring Azure Data Engineers with expertise in Site Reliability Engineering (SRE) to optimize and automate large-scale data applications. The role involves ensuring system reliability and performance using Azure Data Factory, Databricks, Cosmos DB, and Power BI.
Role and Responsibilities
- Develop a deep understanding of the business and analyze the end-to-end customer journey.
- Collaborate with stakeholders to enhance the design, visibility, availability, scalability, and performance of services.
- Work closely with Data Engineering teams to implement necessary improvements and enhancements.
- Identify and escalate potential production-impacting issues proactively in collaboration with Engineering teams.
- Automate manual processes efficiently, conduct in-depth incident analysis, and drive blameless postmortems.
- Optimize alert management, decision-making, and performance analysis by leveraging standardized telemetry data.
- Assist in deployment, post-deployment monitoring, and dashboard / alert creation to track system changes effectively.
- Ensure adherence to critical company controls required for internal and external audit compliance.
- Develop value-proposition presentations, case studies, and accelerators to drive business impact.
Skills and Experience
8+ years of experience in software development, technical operations, and managing large-scale applications.7+ years of experience in developing or supporting Azure Data Factory (Application Programming Interface / API & API Management / APIM), Azure Databricks, Azure DevOps, Azure Data Lake Storage (ADLS), SQL, Synapse Data Warehouse, and Azure Cosmos DB.5+ years of hands-on experience in Data Engineering and coding.Experience in data virtualization products like Denodo is desirable.Holding an Azure Data Engineer or Solutions Architect certification is a plus.Work with container platforms such as Docker and Kubernetes, assessing applications / platforms periodically for architectural improvements.Apply strong troubleshooting skills to quickly identify and resolve application issues with minimal business impact.Handle high-volume, mission-critical applications with hands-on experience.Utilize expertise in IT tools, techniques, systems, and solutions to drive operational excellence.Lead triage calls with multiple technical stakeholders and communicate effectively across teams.Solve cross-functional issues creatively, adapting to changing priorities.Manage escalations effectively, taking full ownership of critical issues to ensure timely resolution.Basic knowledge of Data Science and Machine Learning (DSML), Artificial Intelligence / Machine Learning (AI / ML), and Machine Learning Operations (ML Ops).Familiarity with Azure Event Hub, IoT Hub, and Azure Application Insights.Knowledge of the IT Infrastructure Library (ITIL) framework and IT Service Management (ITSM) tools.Understanding of SAP HANA and willingness to work in rotational shifts.