Description :
In this role, we are seeking a data engineer to design, implement, and optimize cloud-based
data pipelines using Microsoft Azure services including ADF, Synapse, and :
- Collaborate with our business analytics and data science teams, gathering requirements and
delivering complete business intelligence solutions
Mentor junior software developers and build a strong teamModel data and metadata to support discovery, ad-hoc, and pre-built reportingDesign and implement data pipelines using Hadoop, Spark, and Azure services such as Blob Storage, SQL Database, Event Hubs, Data Factory, Synapse Analytics, and DatabricksShould be able to write Programs & Scripting, Strong in SQL, Proficiency in Python or Scala.Experience with PowerShell or Azure CLI for automation is a plusPartner with security, privacy, and legal teams to deliver solutions that comply with security and privacy policiesOwn the design, development, and maintenance of datasets our business analytics teams will use to drive key business decisionsDevelop and promote standard methodologies in data engineering, including scalability,reusability, maintainability, and usability
Tune and ensure compute performance by optimizing queries, databases, files, tables, Ensure data and report service level agreements are metAnalyze and solve problems at their root, stepping back to understand the broader contextOwn continuous engineering operational excellence of the datasets that drive key business decisionsLearn and understand a broad range of data resources and know when, how, and which to use and which not to useKeep up to date with advances in big data technologies and run pilots to design the dataarchitecture to scale with increased data volume using Azure
Continually improve ongoing reporting and analysis processes, automating or simplifyingself-service support for datasets
Triage many possible courses of action in a high-ambiguity environment, making use of bothquantitative analysis and business we seek in you! : Qualifications / Skills :
Bachelors degree in computer science or related technical field10+ years of experience in data architecture and business intelligence5+ years of experience in developing solutions in distributed technologies such as Hadoop, SparkExperience in delivering end-to-end solutions using Azure services Blob Storage, SQL Database, Event Hubs, Data Factory, Synapse Analytics, and HDInsightExperience in programming using Python, Java, or ScalaExpert in data modeling, metadata management, and data qualitySQL performance tuningStrong interpersonal and multitasking skills with the ability to balance competing prioritiesExcellent communication (verbal and written) and interpersonal skills and an ability to effectively communicate with both business and technical teamsAn ability to work in a fast-paced ambiguous environment where continuous innovation is occurringExperience with a business intelligence reporting Qualifications / Skills :Experience with Databricks for advanced analytics and data processingUnderstanding of well-architected data pipeline designsExpertise in monitoring and fault-tolerant pipeline designsKnowledge of cluster configuration for optimal performanceAbility to create cost-optimal solutionsExperience in exploratory data analysis (dashboarding, plotting) using machine learning technologies and algorithms is desirableGood knowledge of standard machine learning techniques (like regression, classification, anomaly detection, forecasting) by using standard machine learning libraries part of Spark, Python is desirablePrior experience in gen AI and related tools and techniques (such as large language models, prompt engineering) is desirable(ref : hirist.tech)