Back to search
OpsWerks Linkedin · Posted 27d ago

Senior Kubernetes Engineer

Mandaluyong, National Capital Region (Metro Manila), Philippines

Linkedin
Continue to application Add your email once, then Caio opens the original posting.

Indexed description

Your Role

As a Senior Kubernetes Engineer (EKS), you will be part of a managed services team supporting Kubernetes clusters running production workloads. You will handle incident response and troubleshooting, drive operational improvements, and mentor junior engineers—while coordinating closely with internal teams and client stakeholders.


  • Provide hands-on incident response and troubleshooting for Kubernetes/EKS production environments, including investigation, mitigation, and follow-through actions.
  • Act as a senior escalation point for complex cluster and application platform issues (networking, DNS, ingress, autoscaling, scheduling, node issues).
  • Support and maintain EKS platform operations such as cluster upgrades, add-ons management, node group/launch template changes, security patching, and capacity planning—following client change management processes.
  • Improve platform reliability by enhancing observability (metrics/logs/traces), alert quality, and runbook maturity.
  • Identify recurring issues and implement preventative actions through automation, standardization, and documentation.
  • Create and maintain runbooks, troubleshooting guides, operational checklists, and platform standards.
  • Participate in post-incident reviews (RCA) and ensure corrective and preventive actions are tracked to completion.
  • Mentor and guide junior engineers through reviews, pair troubleshooting, knowledge sharing, and operational best practices.
  • Demonstrate leadership during incidents and projects by coordinating tasks, communicating clearly, and keeping teams aligned on priorities.
  • Participate in an on-call rotation as part of a 24/7 operations model, with proper handoffs and team support.


Your Qualifications

  • Minimum of 5 years hands-on Kubernetes experience supporting production environments.
  • Strong experience with Amazon EKS and common Kubernetes components (CoreDNS, kube-proxy, CNI, Ingress controllers, autoscaling).
  • Proven experience in incident response, troubleshooting, and production operations (debugging pods, networking/DNS issues, node problems, resource constraints, rollout failures).
  • Working knowledge of Kubernetes fundamentals: deployments, services, ingress, configmaps/secrets, RBAC, namespaces, quotas/limits, PDBs, and upgrade readiness.
  • Familiarity with observability and troubleshooting tools (CloudWatch, Prometheus/Grafana, Splunk/ELK, kubectl debugging, etc.).
  • Basic scripting/automation ability (e.g., Python or Bash) to reduce repetitive operational tasks.
  • Solid Linux and networking fundamentals (TCP/IP basics, DNS, TLS, load balancing concepts).
  • Excellent communication skills (written and oral)—can write clear incident updates, documentation, and explain technical issues to stakeholders.
  • Preferably has leadership skills (formal or informal), with the ability to guide others, lead discussions, and influence improvements.

Plus points if you have:

  • Kubernetes certifications (CKA/CKAD) or AWS certifications
  • Terraform or Crossplane
  • CI/CD (GitHub Actions, GitLab CI, Jenkins, etc.)
  • Networking (VPC design, routing, DNS, load balancers, troubleshooting)
  • Envoy / Nginx / Proxy concepts (Ingress, service routing, L7 behavior, TLS)
  • Experience with service mesh (Istio/Linkerd) and advanced traffic management
Free. 20 seconds. No password. See every match in this search.

Create a free Caio profile to unlock more results and save your role and location preferences.

Unlock free search
Want help applying to roles like this? Search Caio for free. If the repetitive CV tweaking gets heavy, Daniel can help set up Caio Agent.
Ask about Agent