Talent.com
No longer accepting applications
Lead Data Pipeline Engineer

Lead Data Pipeline Engineer

TruecallerBengaluru, Republic Of India, IN
30+ days ago
Job description

Hello, Truecaller is calling you from Bangalore, India! Ready to pick up?

Our goal is to make communication smarter, safer, and more efficient, while building trust across the world. With our roots in Sweden and a global reach, we deliver smart services that create meaningful social impact. We are committed to protecting you from fraud, harassment, scam calls, and unwanted messages, so you can focus on the conversations that matter.

  • Top 20 most downloaded apps globally, and world’s #1 caller ID and spam-blocking service for Android and iOS, with extensive AI capabilities, with more than 450 million active users per month.
  • Founded in 2009, listed on Nasdaq OMX Stockholm and is categorized as a Large Cap. Our focus on innovation, operational excellence, sustainable growth, and collaboration has resulted in consistently high profitability and strong EBITDA margins.
  • A team of 400 people from ~45 different nationalities spread across our headquarters in Stockholm and offices in Bangalore, Mumbai, Gurgaon and Tel Aviv with high ambitions .

We in the Insights Team are responsible for SMS Categorization, Fraud detection and other Smart SMS features within the Truecaller app. The OTP & bank notifications, bill & travel reminder alerts are some examples of the Smart SMS features. The team has developed a patented offline text parser that powers all these features and the team is also exploring cutting edge technologies like LLM to enhance the Smart SMS features. The team’s mission is to become the World’s most loved and trusted SMS app which is aligned with Truecaller’s vision to make communication safe and efficient. Smart SMS is used by over 90M users every day.

As a Senior Data Engineer, you will play an important role in the development of data pipelines, frameworks and models to support the understanding of our users and making better product decisions. You will contribute to empowering the product teams with a complete self-serve analytics platform by working on scalable and robust solutions while collaborating with data engineers, data scientists and data analysts across the company.

What you bring in :

  • 6+ years of experience as a Data Engineer
  • Hands-on experience with Airflow for managing workflows and building complex data pipelines in a production environment.
  • Experience working with big data and ETL development.
  • Strong proficiency in SQL and experience working with relational databases
  • Programming skills in PySpark, Spark with Scala, Apache Spark, Kafka, or Flink.
  • Experience working with cloud computing services (eg : GCP, AWS, Azure).
  • Experience with Data Science workflows.
  • Experience in data modeling and creating data lakes using GCP services like BigQuery and Cloud Storage.
  • Expertise in containerization and orchestration using Docker and Kubernetes (GKE) for scaling applications and services on GCP.
  • Build data models and transformations using DBT following software engineering best practices (modularity, testing).
  • Version control experience with Git and familiarity with CI / CD pipelines (e.G., Github actions).
  • Strong understanding of data security, encryption, and GCP IAM roles to ensure privacy and compliance (especially in relation to GDPR and other regulations).
  • Experience in ML model lifecycle management (model deployment, versioning, and retraining) using GCP tools like AI Platform, TensorFlow Extended (TFX), or Kubeflow, and Vertex AI.
  • Experience in working with Data Analysts and Scientists in building Systems in Production.
  • Excellent problem solving and communication skills both with peers and experts from other areas.
  • Self-motivated and have a proven ability to take initiative to solve problems.
  • The impact you will create :

  • Design, develop, and maintain scalable data pipelines to process and analyze large data sets in real-time and batch environments.
  • Play a crucial role in the team and own ETL pipelines.
  • Collaborate with data scientists, analysts, and stakeholders to gather data requirements, translate them into robust ETL solutions, and optimize the data flows.
  • Implement best practices for data ingestion, transformation, and data quality to ensure data consistency and accuracy.
  • Develop, test, and deploy complex data models and ensure the performance, reliability, and security of the infrastructure.
  • Own the architecture and design of data pipelines and systems, ensuring they are aligned with business needs and capable of handling growing volumes of data.
  • Make data-driven decisions accompanied by past experience.
  • Monitor data pipeline performance and troubleshoot any issues related to data ingestion, processing, or extraction.
  • Work with big data technologies to enable storage, processing, and analysis of massive datasets.
  • Ensure compliance with data protection and privacy regulations, particularly in regions like the EU where GDPR compliance is essential.
  • It would be great if you also have :

  • Familiarity with event-driven architecture and microservices using Cloud Pub / Sub, Cloud Run, or GKE to build highly scalable, resilient, and loosely coupled systems.
  • Proficiency in backend programming languages like Go, Python, Java, or Scala specifically for building highly scalable, low-latency data services and APIs.
  • Hands-on experience in designing and implementing RESTful APIs or gRPC services for seamless integration with data pipelines and external systems.
  • Hands-on experience with GCP-native tools for advanced analytics, such as Looker, Data Studio, or BigQuery BI Engine, for building visualizations and reporting dashboards.
  • Knowledge of real-time data processing and analytics using Apache Flink, Kafka Streams, or Druid for ultra-low latency use cases.
  • Experience with data observability tools such as Monte Carlo, Databand.Ai, or OpenLineage, ensuring the integrity and quality of data across pipelines.
  • Experience optimizing Cloud Storage, BigQuery partitioning, and clustering strategies for large-scale datasets, ensuring cost-effectiveness and query performance.
  • Domain knowledge in specific industries (e.G., telecom, calls, and message communication) where large-scale data pipelines and regulatory compliance are critical, allowing you to bring domain-specific expertise to complex challenges
  • Life at Truecaller - Behind the code : https : / / www.Instagram.Com / lifeattruecaller /

    Sounds like your dream job?

    We will fill the position as soon as we find the right candidate, so please send your application as soon as possible. As part of the recruitment process, we will conduct a background check.

    This position is based in Bangalore, India.

    We only accept applications in English .

    What we offer :

  • A smart, talented and agile team : An international team where ~35 nationalities are working together in several locations and time zones with a learning, sharing and fun environment.
  • A great compensation package : Competitive salary, 30 days of paid vacation, flexible working hours, private health insurance, parental leave, telephone bill reimbursement, Udemy membership to keep learning and improving and Wellness allowance.
  • Great tech tools : Pick the computer and phone that you fancy the most within our budget ranges.
  • Office life : We strongly believe in the in-person collaboration and follow an office-first approach while offering some flexibility. Enjoy your days with great colleagues with loads of good stuff to learn from, daily lunch and breakfast and a wide range of healthy snacks and beverages. In addition, every now and then check out the playroom for a fun break or join our exciting parties and or team activities such as Lab days, sports meetups etc. There something for everyone!
  • Come as you are : Truecaller is diverse, equal and inclusive. We need a wide variety of backgrounds, perspectives, beliefs and experiences in order to keep building our great products. No matter where you are based, which language you speak, your accent, race, religion, color, nationality, gender, sexual orientation, age, marital status, etc. All those things make you who you are, and that’s why we would love to meet you.

    Create a job alert for this search

    Data Pipeline Engineer • Bengaluru, Republic Of India, IN

    Related jobs
    • Promoted
    Azure Data Engineer-Technical Lead

    Azure Data Engineer-Technical Lead

    SigmoidBengaluru, IN
    Kindly find the Job Description Below.Job Title : Technical Lead-Azure Data Engineer.Years of Experience : 10+ years of experience. We are looking for a detailed oriented self-starter to assist our eng...Show moreLast updated: 1 day ago
    • Promoted
    Data Engineer

    Data Engineer

    Alp Consulting Ltd.Bangalore, IN
    Architect and maintain our Amazon Redshift Serverless data warehouse.Design and implement ETL pipelines from operational Redshift to staging (DSA), landing (DLA), and TDW layers.Model data using st...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Senior Data Engineer

    Senior Data Engineer

    TEKsystems Global Services in Indiahosur, tamil nadu, in
    Develop and manage data pipelines using ADF and Azure data bricks.Expert knowledge in data warehousing, data modeling and ETL. Good to have datastage exposure.Expertise in on-prem to azure cloud mig...Show moreLast updated: 22 hours ago
    • Promoted
    Lead Data Engineer

    Lead Data Engineer

    MakeMyTripBangalore Urban, Karnataka, India
    At MakeMyTrip (MMT), technology is at the heart of everything we do.As a leading player in the travel industry, we leverage cutting-edge solutions like AI, machine learning, and cloud infrastructur...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    HCLTechhosur, tamil nadu, in
    We’re Expanding Our Digital & Data Talent Pool.Are you a professional ready to take on enterprise-scale transformation journey? We are on a mass hiring drive and looking for experts who can bring i...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    LEAD DATA ENGINEER

    LEAD DATA ENGINEER

    Prophecy Technologieshosur, tamil nadu, in
    We’re Hiring : LEAD DATA ENGINEER.Notice Period : Immediate to 30 Days.Design, build, and optimize scalable data pipelines on Azure Data Lake, Azure Databricks, and Azure Synapse.Develop ETL / ELT work...Show moreLast updated: 22 hours ago
    • Promoted
    Senior Python Data Engineer

    Senior Python Data Engineer

    iVoyanthosur, tamil nadu, in
    Join a dynamic engineering team working on a high-impact tax reporting platform for the 2025 fiscal season.The core goal is to modernize and significantly accelerate the generation of Excel-based r...Show moreLast updated: 3 days ago
    • Promoted
    Data Engineer

    Data Engineer

    IntraEdgeBangalore, IN
    We are seeking a highly skilled Data Engineer with strong experience in Python, PySpark, Snowflake, and AWS Glue to join our growing data team. You will be responsible for building scalable and reli...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    ETL & Tableau Data Engineer

    ETL & Tableau Data Engineer

    Magma Consultancyhosur, tamil nadu, in
    ETL & Tableau Data Engineer (Remote | Full-Time | Join in 2 Weeks).We are a forward-thinking technology and data consulting firm dedicated to helping organizations unlock the full potential of thei...Show moreLast updated: 22 hours ago
    • Promoted
    Lead Data Engineer

    Lead Data Engineer

    ImpetusBengaluru, Karnataka, India
    GCP Lead for Bangalore / Gurgaon location.We are looking candidate should have strong experience in Bigdata, spark, pyspark, Python, GCP-Bigquery, Airflow, Dataflow, Dataproc, Pubsub, Cloud Composer,...Show moreLast updated: 17 days ago
    • Promoted
    Lead GCP Data Engineer

    Lead GCP Data Engineer

    ImpetusBengaluru, Karnataka, India
    Lead Data Engineer – GCP (BigQuery • Composer • Python • PySpark).You will lead the design, build and operation of large-scale data platforms on the Google Cloud Platform.You will manage a team of ...Show moreLast updated: 26 days ago
    • Promoted
    AWS Lead Data engineer

    AWS Lead Data engineer

    Tata Consultancy ServicesBengaluru, Karnataka, India
    In this key leadership role, you will lead the development of foundational components for a Lakehouse architecture on AWS and drive the migration of existing data processing workflows to the new La...Show moreLast updated: 14 days ago
    • Promoted
    Lead Azure Data Engineer

    Lead Azure Data Engineer

    IQVIABengaluru, Karnataka, India
    Data Engineering, Python, Azure Databricks, Database, People management experience.Manages a team of professional-level employees. Develops and communicates plans and priorities to meet team perform...Show moreLast updated: 10 days ago
    • Promoted
    AWS Data Engineer

    AWS Data Engineer

    Tata Consultancy Serviceshosur, tamil nadu, in
    TCS is Hiring AWS Data Engineer Bangalore location.Strong hands-on experience in Python programming and PySpark.Experience using AWS services (RedShift, Glue, EMR, S3 & Lambda).Experience working w...Show moreLast updated: 25 days ago
    • Promoted
    Lead Data Engineer - Databricks

    Lead Data Engineer - Databricks

    Tredence Inc.Bengaluru, Karnataka, India
    Tredence is a global data science solutions provider founded in 2013 by Shub Bhowmick, Sumit Mehra, and Shashank Dubey focused on solving the last-mile problem in AI. Headquartered in San Jose, Cali...Show moreLast updated: 17 days ago
    • Promoted
    Lead Data Engineer

    Lead Data Engineer

    CoffeeBeansBengaluru, Karnataka, India
    Founded in the year 2017, CoffeeBeans specialises in offering high end consulting services in technology, product, and processes. We help our clients attain significant improvement in quality of del...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Data Engineer

    Data Engineer

    Ubique Systemshosur, tamil nadu, in
    Primary skills : Python, SQL, data lakes, azure.Pipeline Development & Automation.Design, build, and maintain CI / CD pipelines to automate deployment of DQ rules and data services across environments...Show moreLast updated: 22 hours ago
    • Promoted
    Lead Data Engineer

    Lead Data Engineer

    CimpressBengaluru, Karnataka, India
    Our Team : Enterprise Business Solutions.Vista’s Enterprise Business Solutions (EBS) domain is working to make our company one of the most data-driven organizations to support Finance, Supply Chain,...Show moreLast updated: 17 days ago