Sydicom insightsSydicom overview

A remote DevOps & Infrastructure role at Drivetrain. Experience: 5+ years of hands-on experience in Site Reliability Engineering, DevOps, or Cloud Infrastructure roles, preferably within a fast-paced SaaS…

Keywords this role’s ATS scans for

Sydicom tailors your CV and cover letter to match these.

PythonSQLAWSGoogle CloudDockerKubernetesTerraform

Level

Mid-level

Work

Remote

Focus

DevOps & Infrastructure

Pay

Est. $44k-$60k/yr

How Sydicom helps: we read this listing’s requirements and tune your CV and cover letter to the keywords its ATS (Lever) is scanning for, for candidates in India, then help you apply.

Related roles

Original listing text, shown exactly as published by the company.

Key Responsibilities

Cloud Infrastructure & Orchestration

•

Multi-Cloud Management: Architect, manage, and continuously optimize highly available cloud infrastructure across both AWS and GCP. Balance workload demands to ensure maximum cost-efficiency, scalability, and strict security compliance across both platforms.

•

Advanced Kubernetes Orchestration: Lead the design, deployment, and management of scalable Kubernetes clusters. Utilize configuration management tools like Kustomize to enforce standardized, repeatable, and automated deployment configurations across all environments.

•

Service Mesh & Security Integration: Implement and maintain service mesh technologies (e.g., Istio, Linkerd) to secure, control, and observe service-to-service communication. Drive container security best practices, including image scanning, runtime protection, and strict RBAC enforcement.

CI/CD & Automation

•

Pipeline Engineering: Architect, maintain, and optimize robust CI/CD pipelines using Git and Jenkins. Focus on reducing deployment friction, accelerating release velocity, and enforcing automated testing and security gates.

•

Infrastructure as Code (IaC): Treat infrastructure as software. Write, review, and maintain Terraform modules to provision and manage cloud resources predictably and safely.

•

Operational Automation: Aggressively reduce operational toil. Develop robust Python scripts and tooling to automate routine maintenance, data backups, scaling operations, and system recovery processes.

Observability & Reliability

•

Comprehensive Monitoring: Design and enhance our observability stack to provide deep, real-time insights into system health. Manage and scale tools including Prometheus, Grafana, ELK/EFK stack, AWS CloudWatch, and GCP Operations Suite.

•

Reliability Engineering: Spearhead reliability initiatives critical to a scaling SaaS platform. Drive rigorous capacity planning exercises to stay ahead of growth.

•

Incident Management & SLOs: Own the incident response lifecycle. Facilitate blameless postmortems to extract actionable learnings. Define, track, and enforce SLIs, SLOs, and SLAs, ensuring the platform consistently meets its reliability guarantees.

Collaboration & Leadership

•

DevOps Culture: Act as an embedded reliability advocate. Collaborate closely with software engineers early in the development lifecycle to ensure applications are designed for deployability, scalability, and resilience.

•

Continuous Improvement: Proactively identify system bottlenecks and architectural weaknesses. Contribute to process improvements, build internal developer tooling, and maintain comprehensive documentation to elevate team productivity and system understanding.

Required Proficiency & Qualifications

•

Experience: 5+ years of hands-on experience in Site Reliability Engineering, DevOps, or Cloud Infrastructure roles, preferably within a fast-paced SaaS environment.

•

Cloud Platforms: Deep, proven proficiency in AWS (EC2, EKS, RDS, VPC, IAM, S3) AND GCP (GKE, Compute Engine, Cloud SQL, IAM, Cloud Storage). Ability to navigate and optimize multi-cloud architectures.

•

Containerization: Expert-level knowledge of Docker and Kubernetes, including advanced deployment strategies and lifecycle management.

•

Automation/IaC: Strong programming skills in Python and extensive experience with Terraform.

•

Observability: Hands-on expertise building dashboards and alerting systems using Prometheus, Grafana, and log aggregation stacks (ELK/EFK).

•

Networking & Security: Solid understanding of cloud networking (VPC peering, load balancing, DNS) and zero-trust security principles in a containerized environment.

About Drivetrain

Drivetrain

DevOps & Infrastructure

29 open roles on Sydicom

A drivetrain, also known as a transmission system, is the group of components that deliver mechanical power from the prime mover to the driven components. In automotive engineering, the drivetrain is the components of a motor vehicle that deliver power to the drive wheels. This excludes the engine or motor that generates the power. In marine applications, the drive shaft will drive a propeller, thruster, or waterjet rather than a drive axle, while the actual engine might be similar to an automotive engine. Other machinery, equipment and vehicles may also use a drivetrain to deliver power from the engine(s) to the driven components.

Source: Wikipedia

Site Reliability Engineer - SRE

Key Responsibilities

Required Proficiency & Qualifications

About Drivetrain

About Drivetrain