YASH Technologies is a leading technology integrator specializing in helping clients reimagine operating models, enhance competitiveness, optimize costs, foster exceptional stakeholder experiences, and drive business transformation.
At YASH, we’re a cluster of the brightest stars working with cutting-edge technologies. Our purpose is anchored in a single truth – bringing real positive changes in an increasingly virtual world and it drives us beyond generational gaps and disruptions of the future.
We are looking forward to hire Python Professionals in the following areas :
Job description :
Job Title : Senior Data Engineer / DevOps - Enterprise Big Data Platform
In this role, you will be part of a growing, global team of data engineers, who collaborate in DevOps mode, to enable business with state-of-the-art technology to leverage data as an asset and to take better informed decisions.
The Enabling Functions Data Office Team is responsible for designing, developing, testing, and supporting automated end-to-end data pipelines and applications on Enabling Function’s data management and analytics platform (Palantir Foundry, AWS and other components).
The Foundry platform comprises multiple different technology stacks, which are hosted on Amazon Web Services (AWS) infrastructure or own data centers. Developing pipelines and applications on Foundry requires :
- Proficiency in SQL / Scala / Python (Python required; all 3 not necessary)
- Proficiency in PySpark for distributed computation
- Proficiency in Ontology, Slate, Familiarity with Workshop App basic design / visual competency
- Familiarity with common databases (e.g. Oracle, mySQL, Microsoft SQL). Not all types required
This position will be project based and may work across multiple smaller projects or a single large project utilizing an agile project methodology.
Roles & Responsibilities :
Tech / B.Sc. / M.Sc. in Computer Science or related field and overall 6+ years of industry experienceStrong experience in Big Data & Data AnalyticsExperience in building robust ETL pipelines for batch as well as streaming ingestion.Experience with Palantir FoundryMost important Foundry apps : Code Repository, Data Lineage and Scheduling, Ontology Manager, Contour, Object View Editor, Object Explorer, Quiver, Workshop, VertexExperience with Data Connection, external transforms, Foundry APIs, SDK and Webhooks is a plusInteracting with RESTful APIs incl. authentication via SAML and OAuth2Experience with test driven development and CI / CD workflowsKnowledge of Git for source control managementAgile experience in Scrum environments like JiraExperience in visualization tools like Tableau or Qlik is a plusExperience in Palantir Foundry, AWS or Snowflake is an advantageBasic knowledge of Statistics and Machine Learning is favorableProblem solving abilitiesProficient in English with strong written and verbal communicationPrimary Responsibilities
Responsible for designing, developing, testing and supporting data pipelines and applicationsIndustrialize data pipelinesEstablishes a continuous quality improvement process to systematically optimize data qualityCollaboration with various stakeholders incl. business and ITEducation
Bachelor (or higher) degree in Computer Science, Engineering, Mathematics, Physical Sciences or related fieldsProfessional Experience
6+ years of experience in system engineering or software development4+ years of experience in engineering with experience in ETL type work with databases and Hadoop platforms.Skills
Hadoop General|Deep knowledge of distributed file system concepts, map-reduce principles and distributed computing. Knowledge of Spark and differences between Spark and Map-Reduce. Familiarity of encryption and security in a Hadoop cluster.Data management / data structures|Must be proficient in technical data management tasks, i.e. writing code to read, transform and store dataXML / JSON knowledgeExperience working with REST APIs.Spark Experience in launching spark jobs in client mode and cluster mode. Familiarity with the property settings of spark jobs and their implications to performance.Application Development- Familiarity with HTML, CSS, and JavaScript and basic design / visual competencySCC / Git Must be experienced in the use of source code control systems such as GitETL Experience with developing ELT / ETL processes with experience in loading data from enterprise sized RDBMS systems such as Oracle, DB2, MySQL, etc.Authorization Basic understanding of user authorization (Apache Ranger preferred)Programming Must be at able to code in Python or expert in at least one high level language such as Java, C, Scala.Must have experience in using REST APIsSQL Must be an expert in manipulating database data using SQL. Familiarity with views, functions, stored procedures and exception handling.AWS General knowledge of AWS Stack (EC2, S3, EBS, …)IT Process Compliance|SDLC experience and formalized change controlsWorking in DevOps teams, based on Agile principles (e.g. Scrum)ITIL knowledge (especially incident, problem and change management)Languages Fluent English skills.Specific information related to the position :
Physical presence in primary work location (Bangalore)Flexible to work CEST and US EST time zones (according to team rotation plan)Willingness to travel to Germany, US and potentially other locations (as per project demand)At YASH, you are empowered to create a career that will take you to where you want to go while working in an inclusive team environment. We leverage career-oriented skilling models and optimize our collective intelligence aided with technology for continuous learning, unlearning, and relearning at a rapid pace and scale.
Our Hyperlearning workplace is grounded upon four principles
Flexible work arrangements, Free spirit, and emotional positivityAgile self-determination, trust, transparency, and open collaborationAll Support needed for the realization of business goals,Stable employment with a great atmosphere and ethical corporate culture