Original listing text, shown exactly as published by the company.
About the Role
As a Senior Data Engineer, you will help build and scale our GCP Data Platform, reporting directly to the Data Platform Lead. You will own the full data lifecycle—from building our data lake and architecting ingestion pipelines for quantum computing experiments, to powering company-wide analytics and ML. In this role, you will drive platform infrastructure, data governance, quality, and engineering best practices.
Main Responsibilities
Data Platform & Lakehouse
- Design and operate a scalable GCP data lake (storage, partitioning, lifecycle, governance).
- Build ELT pipelines from APIs, event streams, internal systems, and scientific instruments.
- Develop lakehouse architectures using GCP services such as BigQuery, Composer/Airflow, Dataflow, Dataproc, Pub/Sub, and GCS.
- Work with open table formats such as Apache Iceberg, Delta Lake, or Hudi.
Quantum Experiment Data Ingestion
- Co-design ingestion pipelines for quantum computing experiment data alongside physicists and hardware engineers.
- Handle high-throughput scientific data, heterogeneous formats, metadata management, and traceability.
- Define scalable metadata standards and schemas.
Cross-Team Enablement
- Support engineering teams in designing reliable and maintainable data pipelines.
- Create reusable patterns, standards, and best practices.
- Mentor engineers on orchestration, data modeling, and testing.
Data Quality & Governance
- Implement validation, monitoring, data contracts, and governance standards.
- Own the data catalog and lineage documentation.
- Partner with analysts, ML engineers, and stakeholders to deliver trusted datasets and optimize BigQuery performance/costs.
Profile
- 5+ years of experience in Data Engineering and production-scale data platforms
- Strong expertise in SQL, Python, and GCP data services (BigQuery, Airflow, GCS, Pub/Sub)
- Experience with distributed processing tools and modern lakehouse technologies (Spark, Iceberg, Delta Lake, Hudi)
- Solid understanding of data modeling, orchestration, CI/CD, and data governance
- Experience with scientific, hardware, or experimental data pipelines is a plus
- Strong communication skills, analytical mindset, and ability to mentor cross-functional teams
- Fluent English required; experience in deeptech or quantum environments is a plus
Recruitment Process
Screening call with Doriane (30 min)
Hiring Manager Interview (60 min)
- Technical Interview with the Team (90 min)
Leadership Interview (30 min)…