Staff Platform Engineer
Indexed description
This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Staff Platform Engineer in the United States.
This role sits at the core of a high-scale, cloud-native infrastructure powering real-time streaming and advertising systems used across global audiences. You will define and evolve the platform layer that enables multiple engineering teams to deploy, scale, and operate high-throughput, low-latency services. The environment is highly technical and reliability-driven, with a strong emphasis on automation, observability, and performance optimization. You will act as a senior technical leader shaping infrastructure standards, Kubernetes-based architectures, and GitOps practices. The role blends hands-on systems engineering with strategic platform design and cross-team influence. You will also play a key role in incident response, reliability engineering, and mentoring other engineers to raise operational excellence across the organization.
Accountabilities:
- Define, architect, and evolve cloud-native platform infrastructure using Infrastructure as Code (Terraform or CDK)
- Design and implement scalable Kubernetes (EKS) environments with GitOps-driven deployment workflows
- Improve platform reliability, scalability, and cost efficiency through metrics-driven automation and autoscaling strategies
- Build and maintain observability systems using monitoring, logging, and distributed tracing tools such as Prometheus and Grafana
- Define and drive SLOs, capacity planning, and reliability standards across engineering teams
- Lead production incident response efforts, root-cause analysis, and blameless postmortems
- Design and optimize CI/CD pipelines enabling safe, fast, and reliable deployments
- Champion automation, platform standardization, and low-ops engineering practices
- Collaborate with application teams to support high-performance streaming and ad-serving systems at scale
- Mentor engineers and contribute to a strong engineering culture focused on reliability and excellence
- Extensive experience in platform engineering, SRE, or DevOps roles supporting distributed systems in production
- Strong expertise in cloud-native architecture, particularly AWS and Kubernetes (EKS preferred)
- Proficiency in systems programming languages such as Go, Python, or TypeScript for infrastructure automation
- Deep experience with Infrastructure as Code tools (Terraform, CDK) and GitOps methodologies
- Strong understanding of CI/CD systems and deployment pipelines (e.g., GitHub Actions, CodePipeline)
- Experience building and operating observability stacks including metrics, logging, and tracing systems
- Background in high-throughput or low-latency systems such as streaming, ad-tech, or event-driven architectures
- Strong knowledge of cloud security principles including IAM, encryption, and network segmentation
- Proven ability to analyze and optimize system performance, reliability, and cost efficiency
- Experience mentoring engineers and influencing technical direction across teams
- Competitive salary with equity package
- Fully remote work environment within the United States
- Comprehensive medical, dental, and vision insurance (often fully covered for employees)
- 401(k) retirement plan with company matching
- Flexible time off policy plus paid holidays
- Paid parental leave and family support programs
- Home office stipend to support remote setup
- Wellness benefits including mental and physical health subscriptions
- Career growth opportunities in a high-scale, modern infrastructure environment
Requirements:
Benefits:
Create a free Caio profile to unlock the full index and keep your job-search signal for future recommendations.
Unlock free search