Talent.com
No longer accepting applications
Data Engineer Ii

Data Engineer Ii

ClearDemandVellore, Republic Of India, IN
5 days ago
Job description

Job Summary :

  • Building on the foundation of the SDE-I role, the DE- II position takes on a greater level of responsibility and leadership.
  • You'll play a crucial role in driving the evolution and efficiency of our data collection and analytics platform, capable of handling terabyte-scale data and billions of data points.

Key Responsibilities

  • Lead the design, development, and optimization of large-scale data pipelines and infrastructures using technologies like Apache Airflow, Spark, Kafka, and more.
  • Architect and implement distributed data processing solutions to handle terabyte-scale datasets and billions of records efficiently across multi-region cloud infrastructure (AWS, GCP, DO).
  • Develop and maintain real-time data processing solutions for high-volume data collection operations using technologies like Spark Streaming and Kafka.
  • Optimize data storage strategies using technologies such as Amazon S3, HDFS, and Parquet / Avro file formats for efficient querying and cost management.
  • Build and maintain high-quality ETL pipelines, ensuring robust data collection and transformation processes with a focus on scalability and fault tolerance.
  • Collaborate with data analysts, researchers, and cross-functional teams to define and maintain data quality metrics, implement robust data validation, and enforce security best practices.
  • Mentor junior engineers (SDE-I) and foster a collaborative, growth-oriented environment.
  • Participate in technical discussions, contributing to architectural decisions, and proactively identifying improvements for scalability, performance, and cost-efficiency.
  • Ensure application performance monitoring (APM) is in place, utilizing tools like Datadog, New Relic, or similar to proactively monitor and optimize system performance, detect bottlenecks, and ensure system health.
  • Implement effective data partitioning strategies and indexing for performance optimization in distributed databases such as DynamoDB, Cassandra, or HBase.
  • Stay current with advancements in data engineering, orchestration tools, and emerging cloud technologies, continually enhancing the platform’s capabilities
  • Qualifications & Experience :

  • 4-5+ years of hands-on experience with Apache Airflow and other orchestration tools for managing large-scale workflows and data pipelines.
  • Expertise in AWS technologies, Athena, AWS Glue, DynamoDB, Apache Spark, PySpark, SQL, and NoSQL databases.
  • Experience in designing and managing distributed data processing systems that scale to terabyte and billion-scale datasets using cloud platforms like AWS, GCP, or Digital Ocean.
  • Proficiency in web crawling frameworks, including Node.Js, HTTP protocols, Puppeteer, Playwright, and Chromium for large-scale data extraction.
  • Experience with monitoring and observability tools such as Grafana, Prometheus, Elasticsearch, and familiarity with monitoring and optimizing resource utilization in distributed systems.
  • Strong understanding of infrastructure as code using Terraform, automated CI / CD pipelines with Jenkins, and event-driven architecture with Kafka.
  • Experience with data lake architectures and optimizing storage using formats such as Parquet, Avro, or ORC.
  • Strong background in optimizing query performance and data processing frameworks (Spark, Flink, or Hadoop) for efficient data processing at scale.
  • Knowledge of containerization (Docker, Kubernetes) and orchestration for distributed system deployments.
  • Deep experience in designing resilient data systems with a focus on fault tolerance, data replication, and disaster recovery strategies in distributed environments.
  • Strong data engineering skills, including ETL pipeline development, stream processing, and distributed systems.
  • Excellent problem-solving abilities, with a collaborative mindset and strong communication skills.
  • Create a job alert for this search

    Data Engineer Ii • Vellore, Republic Of India, IN

    Related jobs
    • Promoted
    • New!
    Data Engineer

    Data Engineer

    meanSquare.aiVellore, IN
    Data Engineer (onsite | Offshore | PST Overlap Required).We’re looking for an experienced and independent.If you enjoy working with modern data tools, solving real-world data problems, and collabor...Show moreLast updated: less than 1 hour ago
    • Promoted
    AWS Data Engineer / Snowflake Data Engineer

    AWS Data Engineer / Snowflake Data Engineer

    Numeric Technologiesvellore, tamil nadu, in
    Please apply only if you are comfortable to work in rotational shift.Apply only if you are an immediate to 15 days joiner. Work Mode - Monthly once to office in Bangalore.Years of experience - 2 to ...Show moreLast updated: 4 days ago
    • Promoted
    Data Engineer

    Data Engineer

    IntraEdgeVellore, IN
    We are seeking a highly skilled Data Engineer with strong experience in Python, PySpark, Snowflake, and AWS Glue to join our growing data team. You will be responsible for building scalable and reli...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    DraconXVellore, IN
    DraconX specializes in creating intelligent, scalable digital solutions that drive growth and innovation for startups and enterprises. As pioneers in AI business automation and SaaS platforms, we ex...Show moreLast updated: 1 day ago
    • Promoted
    Senior Data Engineer (AWS / Databricks)

    Senior Data Engineer (AWS / Databricks)

    Accoladevellore, tamil nadu, in
    The multifamily real estate industry is undergoing a massive transformation, and Accolade is at the forefront.We are building the industry's first AI-native Operations Centralization Platform, desi...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Data Engineer

    Data Engineer

    Pro5.aiVellore, IN
    You’ll play a key role in ensuring reliable data flows, integrations, and preprocessing frameworks that power advanced GenAI systems. This is a high-impact opportunity within an.Design, build, and o...Show moreLast updated: less than 1 hour ago
    • Promoted
    Data Engineer

    Data Engineer

    RecroVellore, IN
    Data Pipeline Engineering : Design, build, and maintain ingestion, transformation, and storage pipelines using Azure Data Factory, Synapse Analytics, and Data Lake. AI Data Enablement : Collaborate wi...Show moreLast updated: 30+ days ago
    • Promoted
    GCP Data Engineer

    GCP Data Engineer

    Xsell ResourcesVellore, IN
    Seeking a GCP Certified Data Engineer to work remotely from India for our Fortune 5 healthcare client in the US.No notice periods more than 15 days. GCP Google Cloud Professional Data Engineer Certi...Show moreLast updated: 4 days ago
    • Promoted
    Data & Analytics Engineer

    Data & Analytics Engineer

    APPIT Software IncVellore, IN
    Data Engineer : Snowflake -Mandatory – Hands -on Experience.ETL Tool -Informatica [IVS version],BDT.GCP : Big query – Mandatory -handson experience. Data Modelling & Data Warehouse -Mandatory -Hands...Show moreLast updated: 3 days ago
    • Promoted
    Data Engineer

    Data Engineer

    Insight GlobalVellore, IN
    GCP DATA ENGINEER - Contract (Long term).Data Engineer with hands-on support for Google Looker.Strong experience in data modeling and building data marts. Proficiency in ETL / ELT pipeline development...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    Tata Consultancy Servicesvellore, tamil nadu, in
    Required Technical Skill Set -.Create and maintain optimal data pipeline architecture,.Assemble large, complex data sets that meet functional / non-functional business requirements.Identify, design...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Snowflake Data Engineer

    Senior Snowflake Data Engineer

    iVoyantVellore, IN
    One of our clients is looking for an experienced Senior Snowflake Data Engineer to join their team.We are seeking a Senior Data Engineer with 8+ years of experience in end-to-end data engineering a...Show moreLast updated: 5 days ago
    • Promoted
    AI / ML & Data Engineer

    AI / ML & Data Engineer

    Mindfire SolutionsVellore, IN
    We are looking for an experienced AI / ML & Data Engineer to design, develop, and deploy scalable machine learning models and data infrastructure on AWS. You will work closely with cross-functional te...Show moreLast updated: 18 days ago
    • Promoted
    Snowflake Data Engineer

    Snowflake Data Engineer

    Newpage SolutionsVellore, IN
    Location : Remote | Type : Contract.Newpage Solutions is a global digital health innovation company helping people live longer, healthier lives. We partner with life sciences organizations—including p...Show moreLast updated: 5 days ago
    • Promoted
    AWS Data Engineer

    AWS Data Engineer

    Tata Consultancy Servicesvellore, tamil nadu, in
    Aws data engineer having strong experience of Python.Technical / Behavioral Competency.Proficient in Python, with experience in deploying Python packages and OOP, Experience in ingesting data from di...Show moreLast updated: 14 days ago
    • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Mastekvellore, tamil nadu, in
    We are looking for a Senior Data Engineer with good exposure to Cloud Services in AWS(IAM, RDS, Lambda, DMS) along with Snowflake / Postgresql, ETL and Streaming Services like Kafka.Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    Response Informaticsvellore, tamil nadu, in
    AWS services : Must be proficient in building scalable data pipelines and managing cloud-native ETL workflows.Snowflake : Moderate understanding of Snowflake architecture. CICD - Terraform or CloudFo...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    DigitalzoneVellore, IN
    As a Data Engineer, you will design, build, and optimize data pipelines and real-time systems that power AI-driven decisioning and analytics. Develop and maintain scalable ETL / ELT pipelines using Py...Show moreLast updated: 5 days ago