Original listing text, shown exactly as published by the company.
What you will do
As a Platform Engineer, your responsibilities will include (but may not be limited to)
- Designing and implementing complex systems (e.g. scale our research CI with a strong focus toward reliability, reproducibility and speed)
- Building flexible yet solid and accessible development environment for researchers, so they can focus on core mission.
- Designing, implementing and advocating for solutions addressing large amounts of data and maintainable data pipelines.
- Optimizing a variety of builds: container images, large libraries compilation times, python environments...
- Building strong relationships with researchers, understanding their workflow and enabling them to achieve more by leveraging your expertise.
- Communicating and producing documentation or any content that will help them to make the most out of the tools and systems you'll build.
- Being part of the team that "platformizes" research and constantly improve the daily experience for researchers while avoiding future roadblocks.
About you
- 5+ years of successful experience in a similar DX / DevOps / SRE role.
- Proficiency in software development (Python, Go...) and programming best practices.
- Exposure to site reliability engineering: root cause analysis, in-production troubleshooting, on-call rotations...)
- Exposure to infrastructure management: CI/CD, containerization, orchestration, infra-as-code, monitoring, logging, alerting, observability...).
- Technical product mindset (e.g. understanding how to debug poor adoption).
- Excellent problem-solving and communication skills (ability to contextualizing, gauging risks and getting buy-in for high stakes and impactful solutions).
- Ownership, high agency and constantly seeking to learn and improving things for others.
- Autonomous, self-driven and able to work well in a fast-paced startup environment.
- Low ego and team spirit mindset.
Your application will be all the more interesting if you also have
- First hand Bazel (or equivalent) experience.
- Strong knowledge of Python's ecosystem.
- Familiarity with GPU based workloads and ecosystems.
- Experience of full remote environments (you're comfortable with having some of your users on the other side of the globe).
Hiring Process
- Intro Call - 30 min
- Tech Culture Interview - 30 min
- Technical Rounds - 2 x 45 min
- Culture-fit Discussion - 30 min
- Reference Calls
By applying, you agree to our Applicant Privacy Policy.