Senior DevOps Engineer
Indexed description
Overview
We’re seeking an experienced Senior DevOps Engineer to maintain and expand our multi-cloud infrastructure, provide on-call support, and collaborate closely with our engineering team through pair programming. You will be a key contributor to ensuring uptime, stability, and scalability for our clients’ systems.
This role is best suited for a senior engineer who is comfortable with fractional ownership and delivering high-impact support in a low-hour, on-call engagement.
Key Focus Areas
Multi-cloud infrastructure (AWS primary; Azure and GCP)
Container orchestration (Docker, Docker Swarm; familiarity with Podman or Kubernetes a plus)
Networking for client connectivity (VPCs, VPNs, security groups, subnets)
Rapid incident response and troubleshooting
Collaborative pair programming and knowledge sharing
Key Responsibilities
Provide on-call support for production incidents and infrastructure issues; troubleshoot and resolve rapidly
Pair program with engineers to troubleshoot issues and share infrastructure best practices
Maintain and troubleshoot cloud infrastructure, including Docker containers and Docker Swarm orchestration
Manage networking and client connectivity patterns
Maintain Terraform/Terragrunt configurations and automate deployment processes
Monitor system health and performance, and implement alerting improvements
Implement security best practices, manage IAM roles, and configure secrets
Document infrastructure architecture, runbooks, and operational procedures
Collaborate on CI/CD workflows (primarily GitHub Actions) and support automation initiatives
Required Skills & Experience
3+ years of production experience with AWS (VPC, EC2, S3, IAM, CloudWatch); familiarity with Azure or GCP
Production experience with Docker and Docker Swarm, including container networking and service discovery
Strong Terraform experience; Terragrunt preferred
CI/CD experience with GitHub Actions and scripting (Bash, Python)
Experience participating in on-call rotations and incident response
Strong communication skills and comfort with collaborative pair programming
Preferred Skills
Python and/or Bash for infrastructure automation
Multi-cloud architecture patterns
Data pipeline infrastructure experience
Security best practices and cost optimization experience
Strong technical documentation skills
Technical Stack
Cloud: AWS (primary), Azure, GCP
Containers: Docker, Docker Swarm, Podman, Kubernetes
IaC: Terraform, Terragrunt
CI/CD: GitHub Actions
Scripting: Python, Bash
Monitoring: CloudWatch, Azure Monitor
Storage: S3, ADLS Gen2
Secrets: AWS Secrets Manager, Azure Key Vault
Engagement Details
Approximately 5–15 hours per month
Contractor/Consultant role, fully remote
On-call availability required; flexible schedule
Long-term engagement with opportunity for expanded scope and responsibility
Originally posted on Himalayas
Create a free Caio profile to unlock more results and save your role and location preferences.
Unlock free search