BI Data Engineer
We are looking for an BI Data Engineer to help build a cloud-native petascale+ data platform. Your contributions will enable data-centric AI and drive our mission to diagnose and treat cancer at it's most actionable and early stages.
How you’ll contribute:
? Develop internal tooling and analytic utilises
? Holistically visualize, analyze and curate the data lake
? Engineer robust and reproducible data pipelines that enable collaboration within Kaiko and with external parties
? Involve in the design and development of data-centric processes, as well as in their integration with existing information systems
? Continuously improve our security policies
? Involve into the coding process (reviews, testing, CI/CD)
What you’ll bring:
? 3+ years of experience with production infrastructure, automation, and software engineering
? B.S or M.S. in computer science, a related technical field, or comparable experience
? Experience with cloud providers (e.g., Azure, AWS, or GCP) ? Software design and development expertise, especially in Python
? Knowledge of open-source data lake(house) architectures like delta.io
? Proven track record in real-time streaming frameworks like Kafka
? Experience in BI tooling
? Experience building and launching projects in a production software environment, including use of automated testing, version control, and deployment systems
? A systematic problem-solving approach, effective communication skills and a strong desire to own and drive your work
Note: this position does not involve data analyst-related activities such statistical analysis or application of machinelearning methods.
Nice to Haves:
• Experience with (functional) programming languages such as Scala and Haskell • Distributed systems like Kafka, Spark, Beam, or Flink
• Azure experience • Experience with software in a regulated environment • Genomics or bioinformatics backgroun