Original listing text, shown exactly as published by the company.
Responsibilities
In this role, you’ll
- Lead the execution end-to-end: Design and lead the implementation of scalable, high-availability cloud infrastructure, moving beyond feature work to drive long-term platform strategy while championing Twilio’s "progress over perfection" mindset to deliver continuous, high-impact shipments.
- Operational Excellence: Own the reliability of services handling billions of weekly requests, setting the standard for operational best practices and incident response.
- Infrastructure Strategy: Drive the evolution of our Infrastructure as Code (IaC) patterns using Terraform, ensuring modularity, security, and reusability across the organization. Always aim for a self -service approach such that it keeps the team’s TOIL at minimum.
- Continuous optimization: Continuously looks for optimizations in our pipelines/releases/deliveries that balance rapid deployment with rigorous safety checks.
- Technical Mentorship and Reviews: Actively mentor L1 and L2 engineers as well as help with code reviews, design docs, and pair programming to foster a culture of technical excellence.
- Cross-Functional Influence: Collaborate with Cross teams, Product and Engineering leadership to align technical roadmap, debt reduction and new feature deliveries.
- Continuous Innovation: Be an owner and continuously research and prototype to optimise Twilio’s API infrastructure to provide best in class service to the customers
Qualifications
Twilio values diverse experiences from all kinds of industries, and we encourage everyone who meets the required qualifications to apply. If your career is just starting or hasn't followed a traditional path, don't let that stop you from considering Twilio. We are always looking for people who will bring something new to the table!
*Required
- 5+ years of professional experience in Cloud, DevOps, or Site Reliability Engineering (SRE), with deep proficiency in Python, Java, Go or another other language of choice.
- Architectural Depth: Proven track record of designing and deploying complex AWS cloud-native solutions (e.g., CloudFront, Multitenancies, DNS, Caching strategies, WAF, Lambda, S3, developing hosting solutions, etc) at scale.
- IaC Expert: Advanced experience with Terraform, including writing custom providers or managing state at scale across multiple environments.
- System Reliability: Deep understanding of SLIs, SLOs, Golden signals, Error Budgets and establishing monitoring strategies; in depth experience using Datadog, Grafana, or Athena to drive data-informed engineering decisions.
- Strong background in microservices architecture, specifically regarding traffic routing, rate limiting, and service discovery.
- Demonstrated ability to lead technical projects from conception to completion, navigating trade-offs and communicating complex technical concepts to non-technical stakeholders.
Desired
- Experience implementing security-at-scale, including WAF management, DDoS mitigation, and Zero Trust architectures.
- Platform Engineering previous experience, passion for building "Internal Developer Platforms" that abstract infrastructure complexity for product teams.
- A proven track record of contributing to the cloud-native or networking ecosystem, also has contributions to public Terraform providers or similar open-source initiatives
Location
This role will be remote, and based in Spain.
Travel
We prioritize connection and opportunities to build relationships with our customers and each other. For this role, you may be required to travel occasionally to participate in project or team in-person meetings.