Head of Infrastructure
Indexed description
We need someone who can design, build, and own complex integration systems where reliability directly impacts clinical workflows. This person won't just execute tasks — they'll shape how the team thinks and builds. Reliability here has real-world consequences for clinical teams.
What You'll Do
- Compliance & Security: Ensure our infrastructure and processes are compliant with HIPAA and other relevant frameworks. Build and enforce security, audit, and data protection practices.
- Multi-Cloud Architecture: Design, operate, and evolve resilient infrastructure across AWS, GCP, and potentially other providers. Balance portability, cost, and compliance requirements.
- Kubernetes Operations: Own our Kubernetes clusters (multi-cloud), including networking, scaling, upgrades, security hardening, and workload orchestration.
- Observability & Reliability: Define and enforce standards for logging, metrics, tracing, dashboards, and alerting. Make sure teams have visibility into systems and can respond effectively.
- Incident Management: Build and maintain a strong incident response process, including runbooks, on-call practices, and postmortems.
- Automation & Infrastructure as Code: Lead the adoption and best practices for Pulumi (or equivalent), CI/CD pipelines, and self-service infra tooling.
- Proven experience building and running production infrastructure in regulated environments (HIPAA, SOC2, GDPR, etc).
- Deep expertise with Kubernetes in production, ideally across multiple cloud providers.
- Strong multi-cloud experience, especially AWS and GCP.
- Hands-on mindset — comfortable digging into systems, writing Pulumi modules, debugging incidents, or optimizing CI/CD pipelines.
- Track record of implementing observability, monitoring, and incident response systems that scale.
- Familiarity with zero-trust networking, secrets management, and compliance frameworks.
- Ability to balance technical depth with pragmatism: you know where to build from scratch vs. where to leverage vendor solutions.
- Experience managing backend engineers who build infrastructure services — not as a people manager, but as a technical leader who can guide, unblock, and raise the bar for engineering teams building infra-adjacent systems.
- Hands-on experience with graph databases (e.g. Neo4j, Amazon Neptune) and document storage systems (e.g. Firestore, MongoDB) in production environments.
Create a free Caio profile to unlock more results and save your role and location preferences.
Unlock free search