Data Engineer
Indexed description
Check us out!
👇https://holisticon.pl/holisticon-insight/
🚀 We are looking for a Data Engineers who will serve as the foundational builder of this project. You will build and maintain the data infrastructure to ingest, standardize and move data into the ML training environment. The core challenge is dealing with highly sparse, fragmented biological data across disconnected systems (like Genesys, GRIN-Global, and GIGWA).
As a Data Engineer, You Will
- Develop API and SQL-based connectors for diverse data sources and build automated ingestion scripts to centralize tables.
- Standardize raw data into a unified staging format, implement version control for raw datasets, and establish data lineage tracking from source to staging.
- Map data schemas across different tables to identify overlaps, quantify sparsity levels, and identify missingness patterns.
- Implement secure credential management for database access.
- Core Tech: Advanced SQL, Python (Pandas, PySpark), and deep experience building REST API connectors.
- Data Architecture: Proven track record of designing data staging environments and managing complex data topologies.
- Domain Context: Previous experience handling fragmented, sparse datasets; familiarity with biological or agricultural data schemas is a strong plus.
- Life insurance
- Multisport card
- Fully remote job
- Private medical care
- Flexible working hours
- B2B contract
- Amazing integration events on a regular basis
- Training budget (e.g. Microsoft Azure Certifications)
- Opportunity to impact our company culture build-up
- Work equipment (laptop, 2 monitors, and accessories)
Create a free Caio profile to unlock the full index and keep your job-search signal for future recommendations.
Unlock free search