Job Summary : We are seeking a highly skilled Database Software Engineer with 4-5 years of experience to join our team. The successful candidate will be responsible for designing, developing, and maintaining our database systems and ETL pipelines, ensuring high performance, scalability, and reliability. The Database Software Engineer will work closely with our development team to ensure that our database systems meet the requirements of our applications and services.
Key Responsibilities :
- Database Design and Development :
- Design and develop database architectures to support business requirements
- Develop and maintain database models, including entity-relationship diagrams and schema definitions
- Implement database security measures to ensure data integrity and confidentiality
- ETL Pipeline Development :
- Design and develop ETL pipelines to extract, transform, and load data from various sources
- Develop and maintain ETL scripts using Python and other programming languages
- Ensure data quality and integrity throughout the ETL process
- Data Integration and Migration :
- Integrate data from various sources, including APIs, files, and databases
- Develop and execute data migration plans to ensure seamless data transition
- Database Performance Optimization :
- Analyze and optimize database performance, including query optimization and indexing
- Implement caching and other techniques to improve database performance
- Collaboration and Communication :
- Work closely with development teams to ensure database systems meet application requirements
- Communicate database design and implementation plans to stakeholders
- Provide technical guidance and support to junior team members
Technical Requirements :
Programming Languages :Python (required)PySparkDatabase Management Systems :Experience with relational databases (e.g. MySQL, PostgreSQL, Oracle)ETL Tools :Experience with any open source ETL toolsData Formats :Experience with various data formats (e.g. CSV, JSON, Avro, Parquet)Operating Systems :Experience with Linux / Unix operating systemsFamiliarity with AWS cloud-based platformsNice to Have :
Certifications :Experience with :Data warehousing and business intelligence tools (e.g. Tableau, Power BI)Containerization and orchestration tools (e.g. Docker, Kubernetes)Example of Python skills we are looking for :
Python 3.xPandas, NumPy, and other data science librariesExperience with Python ETL frameworks : PySparkExample of ETL skills we are looking for :
Experience with any open source ETL toolsFamiliarity with data integration frameworks (e.g. Apache NiFi, AWS Glue)Knowledge of data quality and data governance best practicesExperience with data migration and data warehousing conceptsSkills Required
Unix, Csv, Postgresql, Pyspark, Json, Avro, Linux, Mysql, Oracle, Python, Aws