Job Title : Data Engineer
Location : [Remote]
Reporting to : VP Client Success
Employment Type : Full-Time
About Us
Datastruct accelerates third-party data. We help companies discover, test, and productionalize external (third-party) data - quickly, reliably, and safely.
We’re a lean, high-performance company building data infrastructure and tooling for a small but growing client base. With a footprint spanning 6 enterprise clients and over 50 third-party data vendors, our operations are complex, dynamic, and evolving fast.
About the Role
As a Data Engineer at Datastruct, you will be powering the latest technology from leading insurance and fintech institutions around the world. Using innovative data sets you will use your expertise and creativity to build best-in-class solutions. You will see projects through from start to finish, assisting in every stage from testing to integration. This is a fully remote position working with engineers, client success, and product across the globe.
Key Responsibilities
- Developing python data solutions to solve customer data integration challenges
- Building platform features that increase the efficiency and quality of our deliverables
- Collaborating with project management, and sales teams to solve clients' business and technical problems
- Performing data appends, extracts, and analysis to deliver curated data to clients
- Understanding wide arrays of data provider landscapes including consumer, business, property, and digital data
- Opportunities to work on entity detection, record linking, and NLP projects will also be available
- Transforming one-time projects into repeatable code for production
- Rolling out changes to production use cases in close coordination with clients' IT teams
- Other tasks as necessary
Required Qualifications
Computer Science, Data Science, or related engineering degree (and / or commensurate work experience); Master's degree preferred3 - 5 years of Python programming (with Pandas experience)Experience with CSV, JSON, parquet, Avro, and other common formatsData cleaning and structuring (ETL experience)Knowledge of API (REST and SOAP), HTTP protocols, API Security and best practicesExperience with SQLExperience with GitExperience with AirflowInterest or experience in machine learning and data modelingWhy Join Us
Own a high-impact domain at the core of how our product creates value.Work closely with founders and leadership on strategic decisions.Join a nimble, high-performing team tackling complex data infrastructure challenges - and automate the boring stuff while you're at it.