Original listing text, shown exactly as published by the company.
About the Role
As an SRE 2 for Managed Gateways, you will be pivotal in ensuring the rock-solid reliability, scalability, and performance of Kong's critical managed services. Your expertise will directly impact customer trust and position Kong as a leader in the Agentic Era through unparalleled product stability.
What You’ll Do
- Implement and maintain robust automation for deploying and operating Kong's Managed Gateways across various cloud environments.
- Monitor system health, performance, and uptime, striving for 99.99% availability for our core infrastructure.
- Resolve complex production incidents efficiently, participating actively in on-call rotations to maintain service continuity.
- Build resilient tools and systems that enhance the overall reliability and operational efficiency of our platform.
- Contribute proactively to the prevention of technical debt, ensuring sustainable and scalable operations as Kong grows.
- Collaborate closely with engineering teams to design, review, and implement resilient and highly scalable services.
What You’ll Bring
- 2+ years of experience applying Site Reliability Engineering (SRE) principles and practices in a production environment.
- Proficiency in at least one of Golang or Python for automation, tooling, and infrastructure as code.
- Hands-on experience with Kubernetes and major cloud platforms such as AWS, GCP, or Azure.
- Familiarity with monitoring, logging, and alerting tools (e.g., Prometheus, Grafana, Datadog).
- Solid understanding of networking concepts, distributed systems, and API gateways.
The Kong DNA
- Own the reliability and performance of critical production systems with a strong sense of accountability.
- Drive urgent resolution of issues, demonstrating a bias for action and minimizing customer impact.
- Collaborate effectively with cross-functional teams, fostering an environment of shared understanding and collective success.
Bonus Points
- Experience with Kong Gateway or other API management platforms.
- Relevant cloud certifications (e.g., AWS Certified DevOps Engineer, Kubernetes Administrator).
- Active contributions to open-source projects or developer communities.
#LI-PC1