Senior Data Engineer

  • Full time
  • Prague
  • Posted 2 months ago
PySpark (nice to have)
kubeflow (nice to have)
CDK (regular)
REST API (advanced)
Pandas (advanced)
Python (advanced)
AWS (master)
We’re looking for a Senior Data Engineer, who’s ready to join the project from healthcare/pharma industry!
100% remote
Salary – depending on skills and experience – 
1000 – 1400 PLN net+VAT/day/B2B
(contract of employment is also possible)
You’ll be responsible for:
– the data on-boarding and hence custom integration work
– building different connectors such as FTP, API or JDBC integrations (therefore, a solid knowledge of AWS infrastructure is a must)
– making the data available in the data lake through AWS Glue, Appflow and Lake formation 
– writing unit/data tests and monitoring the quality of the overall on-boarding process

We require:
– experience building data pipelines in Python (experience with PySpark is a plus)
– understanding on AWS Cloud fundamentals (AWS certification is an advantage)
– solid knowledge in infrastructure as code -> CDK
– git & CI/CD knowledge is a must
– experience with common data Python libraries (Pandas, awswrangler, etc.)
– understanding of REST APIs from the consumer perspective
– ML knowledge is a plus (Kubeflow)
– good English – at least B2 (writing & speaking)

To apply for this job please visit