Talent.com
This job offer is not available in your country.
Sr. Data Engineer

Sr. Data Engineer

ConfidentialPune
30+ days ago
Job description
  • We are seeking a Data Engineer to help build and integrate a Generative AI-powered conversational assistant, into our website and mobile app. This role is crucial in handling data pipelines, model training, and infrastructure setup to deliver a seamless, privacy-compliant experience for users seeking personalized health insights. The Data Engineer will work closely with our AI and software development teams to design scalable data solutions within Google Cloud Platform (GCP) to support this next-generation AI service.
  • Key Responsibilities

    • Data Integration & Pipeline Development  : Design and implement data pipelines to support training and finetuning of knowledge base and user data, ensuring data quality, scalability, and efficiency.
    • Data Processing & Transformation  : Develop data transformation processes to prepare data for Natural Language Processing (NLP) models, facilitating personalized and accurate health recommendations.
    • Privacy & Security Compliance  : Ensure all data handling practices comply with privacy and security standards, focusing on user data protection within AI model training and deployment.
    • Infrastructure Setup & Management  : Build and maintain foundational cloud infrastructure on GCP to host, deploy, and scale securely and efficiently across platforms.
    • Collaboration with AI & DevOps Teams  : Partner with AI / ML and DevOps teams to finetune, test, and optimize NLP models for production, focusing on deployment performance and user experience.
    • Website & Mobile Integration Support  : Work alongside frontend developers to ensure smooth data flow and integration between the backend, website and mobile app.
    • Monitoring & Optimization  : Implement monitoring, logging, and automated alerts to ensure data pipelines, model interactions, and infrastructure meet performance and reliability requirements.
    • Qualifications

    • Education  : Bachelor s or Master s in Computer Science, Data Engineering, or a related field.
    • Experience  :
    • 3+ years in data engineering, preferably within Generative AI or NLP-focused projects.
    • Hands-on experience with Google Cloud Platform (GCP), including BigQuery, Dataflow, and Cloud Storage.
    • Proven ability in data pipeline design and data transformations for AI model training.
    • Skills  :
    • Strong programming skills in Python and familiarity with SQL.
    • Experience with DevOps tools (e.g., Kubernetes, Docker) and CI / CD pipelines in GCP.
    • Proficient in data management practices, data privacy, and security protocols.
    • Familiarity with AI / ML workflows, specifically NLP model training and finetuning.
    • Nice to Have  :
    • Experience working with Contentful, or React Native integrations.
    • Knowledge of MLOps practices to support continuous model training and deployment.
    • Role :   Data Engineer

      Industry Type :   IT Services & Consulting

      Department :   Engineering - Software & QA

      Employment Type :   Full Time, Permanent

      Role Category :   Software Development

      Skills Required

      Sql Queries, Devops, Delta, Scala, Spark, Microsoft Azure, Data Validation, Troubleshooting, Monitoring

    Create a job alert for this search

    Sr Data Engineer • Pune