SRE Cloud Engineer- Azure
Indexed description
Key Responsibilities
- Build and deliver a new platform on Microsoft Azure; continuously improve it through automation and standardization.
- Own day-to-day operations, monitoring, and support of the Azure environment (compute, storage, networking, security).
- Partner with development teams to ensure safe, stable releases with strong monitoring and tested rollback readiness.
- Implement Infrastructure as Code (Terraform) to provision and manage resources consistently across environments.
- Build and maintain automation (Python preferred) to reduce manual work, improve repeatability, and increase reliability.
- Participate in a rotating on-call schedule and troubleshoot and resolve issues across availability, performance, networking, and security; drive follow-ups to prevent recurrence.
- Support Kubernetes/AKS operations and troubleshooting (pods/logs, probes, scaling, resource limits, node issues).
- Improve observability by reducing alert noise, building dashboards, and adding synthetic checks for key user workflows.
- Perform application and OS lifecycle tasks (patching, upgrades, maintenance, vulnerability remediation, and operational readiness).
- Document designs, configurations, runbooks, and operational procedures to enable consistent on-call response.
- 5+ years in Cloud Engineering, SRE, DevOps, or a similar production operations role.
- Strong hands-on experience with Microsoft Azure in production environments.
- Hands-on experience with Terraform.
- Strong scripting/automation skills (Python preferred) with real operational automation examples.
- Proven incident ownership end-to-end (detect → triage → fix → prevent) with clear communication.
- Hands-on monitoring/observability experience in Azure (logs, dashboards, alert tuning / noise reduction).
- Strong Linux and/or Windows Server administration skills, including patching and lifecycle activities.
- Solid networking fundamentals (TCP/IP, DNS, VPNs, firewalls).
- Strong troubleshooting skills and ability to stay calm under pressure.
- Kubernetes and AKS operational experience.
- Familiarity with CI/CD tools (GitHub Actions, Azure DevOps, Jenkins, GitLab CI).
- Experience with Azure Monitor, Log Analytics, Application Insights, KQL; Prometheus/Grafana is a plus.
- Synthetic monitoring experience for login/API/workflow validation with alerting.
- AI-assisted ops experience for troubleshooting/automation (with strong validation habits).
- Azure certifications are a plus (e.g., Azure Administrator, Azure DevOps Engineer).
- Problem-solver: Works to find root cause and prevent repeat issues, not just apply temporary manual fixes.
- Proactive: Spots toil and automates it to improve reliability and speed.
- Collaborative: Communicates clearly during incidents and supports teammates.
- Adaptable: Stays calm in fast-changing environments and handles priority shifts.
- Detail-oriented: Validates changes, avoids risky shortcuts, and documents outcomes.
- Customer-focused: Understands impact and communicates clearly to restore service quickly.
- On-call: Rotating 24×7 coverage; lead/assist response and keep communication clear.
- Collaboration: Work closely with dev/product/security; prioritize automation and permanent fixes over repeat manual work.
- Growth & Culture: Ownership-driven environment where AI-assisted learning and experimentation is encouraged—move faster, but validate results and share learnings with the team.
Min – Max :
$85,276.80 - $127,915.20 (CAD)
Benefits
The benefits described represent the current offerings at our organization, however, benefits are subject to change and may vary by location and employment status. We strive to provide a comprehensive benefits package that supports our employee’s health, wellness, and financial goals. Please note that benefits may be discussed in more detail during the hiring process.
- Vacation to help you rest, recharge, and connect with loved ones
- Paid leave benefits
- Extended health, paramedical, dental, and vision benefits
- Registered retirement and tax-free savings plans
- Tuition reimbursement, life insurance, EAP – and more!
Create a free Caio profile to unlock more results and save your role and location preferences.
Unlock free search