Back to search
jobgether Lever · Posted 21d ago

Senior DevOps & Site Reliability Engineer

US Full-time

IT Security & IT Lever
Continue to application Add your email once, then Caio opens the original posting.

Indexed description

This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior DevOps & Site Reliability Engineer in the United States.

This role is a high-impact engineering position focused on building, scaling, and optimizing complex cloud environments across hybrid and multi-cloud infrastructures. You will operate at the intersection of DevOps, SRE, and software engineering, driving automation-first strategies to reduce operational toil and improve platform reliability. The environment blends legacy systems with modern cloud-native architectures, requiring strong problem-solving and modernization skills. You will contribute directly to CI/CD evolution, observability frameworks, and infrastructure automation initiatives that enable engineering teams to deliver faster and more safely. This is a hands-on, deeply technical role where reliability, performance, and security are central to everything you build. You will also collaborate closely with cross-functional teams to embed DevOps best practices across the software lifecycle.

Accountabilities

You will be responsible for ensuring the reliability, scalability, and operational efficiency of large-scale cloud platforms while driving automation and modernization initiatives. This includes designing CI/CD systems, reducing manual operational work, and improving infrastructure observability across hybrid environments. You will also play a key role in incident response, performance troubleshooting, and cloud security enforcement.

    • Design, build, and maintain CI/CD pipelines and self-service deployment frameworks
    • Automate operational workflows to eliminate manual “toil” and improve system efficiency
    • Manage and optimize cloud infrastructure across Azure and/or Google Cloud Platform
    • Implement Infrastructure as Code solutions using Terraform or Bicep
    • Develop automation and tooling using PowerShell and Python
    • Build and maintain observability systems for logs, metrics, and tracing across distributed systems
    • Perform root cause analysis for complex production incidents across application and infrastructure layers
    • Ensure compliance with security and governance standards such as SOC2, HIPAA, and ISO 27001
    • Collaborate with engineering teams to embed reliability and DevOps best practices into development workflows

    Requirements

    The ideal candidate has deep experience in DevOps or Site Reliability Engineering within complex, cloud-native and hybrid environments. You should be highly technical, automation-driven, and comfortable working across infrastructure, application, and security domains. Strong troubleshooting skills and the ability to reason through distributed systems are essential.

      • 6+ years of experience in DevOps, SRE, or related engineering roles
      • Strong expertise in Microsoft Azure and/or Google Cloud Platform
      • Advanced scripting skills in PowerShell and Python
      • Hands-on experience with Terraform or Bicep for infrastructure as code
      • Strong knowledge of Kubernetes (AKS/GKE), containerization, and orchestration tools
      • Experience working with Windows and Linux server environments
      • Familiarity with distributed systems and middleware (e.g., Event Hub, Service Bus, RabbitMQ, CosmosDB, MongoDB)
      • Strong analytical and troubleshooting skills in complex production environments
      • Experience working with CI/CD tools such as GitHub Actions or Bamboo
      • Strong understanding of cloud security and compliance frameworks
      • Excellent communication and collaboration skills

      Benefits

        • Competitive compensation package with performance-based considerations
        • Comprehensive health coverage including medical, dental, and vision plans
        • Disability insurance and employer-paid life insurance
        • 401(k) retirement savings plan
        • Paid parental leave program
        • Generous paid time off and paid company holidays
        • Flexible work schedules and fully remote work options
        • Mental health and wellness resources
        • Collaborative and modern engineering culture with strong autonomy
How Jobgether works: We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team. We appreciate your interest and wish you the best! Why Apply Through Jobgether? Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time. #LI-CL1
Free. 20 seconds. No password. See every match in this search.

Create a free Caio profile to unlock more results and save your role and location preferences.

Unlock free search
Want help applying to roles like this? Search Caio for free. If the repetitive CV tweaking gets heavy, Daniel can help set up Caio Agent.
Ask about Agent