Original listing text, shown exactly as published by the company.
Responsibilities
- Build and own the infrastructure and platform capabilities that power Stytch’s identity platform as it scales across Twilio—ensuring security, reliability, and performance for every customer.
- Design, implement, and operate scalable cloud infrastructure (AWS/EKS, ECS, networking, data stores), balancing uptime, cost, and developer velocity.
- Partner closely with Product and Engineering leadership to set infrastructure direction, translate platform needs into technical plans, and deliver high-impact roadmap work.
- Collaborate across Twilio and Stytch teams to align on architecture, integrate platform capabilities, and unblock cross-team initiatives.
- Operate with deep technical ownership: author design docs, drive key technical decisions, review code, and stay close to the systems you ship.
- Build in ambiguity—break down complex problems, make pragmatic tradeoffs, and adopt new technologies or strategies when they improve outcomes.
- Improve production quality and resilience through strong observability, incident response, automated remediation, and continuous reliability engineering.
- Make developers’ lives easier by building self-service tooling, safer deployment patterns, and reliable platform primitives that accelerate product teams.
- Mentor and support other engineers through pairing, feedback, and knowledge-sharing, helping raise the team’s technical bar and culture.
Qualifications
Twilio values diverse experiences from all kinds of industries, and we encourage everyone who meets the required qualifications to apply. If your career is just starting or hasn't followed a traditional path, don't let that stop you from considering Twilio. We are always looking for people who will bring something new to the table!
*Required
- 6+ years of experience as an Infrastructure or Platform Engineer building and operating high-scale, mission-critical cloud production systems.
- Strong experience with containerization and orchestration (Kubernetes/EKS, Docker), Infrastructure as Code (Terraform, GitOps, or similar) and AWS.
- Hands-on proficiency in at least one modern programming language used in production.
- Experience designing and running observability and on-call systems (e.g., Datadog, ELK, Prometheus/Grafana).
- Experience scaling cloud infrastructure for distributed systems, including relational databases and high-availability service architectures.
- Excellent written and verbal communication skills; comfortable writing design docs and leading technical discussions.
- Bachelor’s degree in Computer Science or equivalent practical experience.
- Schedule: ability to work non-standard, on-call rotation weekend and holiday hours.
Desired
- Experience with multi-region or global infrastructure, including disaster recovery and data replication strategies.
- Familiarity with enterprise-scale platform challenges: multi-tenant infrastructure, compliance, and cost/performance optimization.
- Builder at heart. Through a hobby or your profession, you are passionate about being hands on and seeing your work come to life.
Location
This role will be remote, but is not eligible to be hired in San Francisco, CA, Oakland, CA, San Jose, CA, or the surrounding areas.
Travel
We prioritize connection and opportunities to build relationships with our customers and each other. For this role, you may be required to travel occasionally to participate in project or team in-person meetings.