Senior DevOps Engineer
Indexed description
Why does MWDN rock?:
Here’s what you can expect when you join MWDN:
- Security: We carefully vet our clients to minimize risks and ensure reliability and timely payments - no fraud or unpleasant surprises.
- Career support: If a project isn’t the right fit, we support you and actively help find new opportunities that match your skills and career goals.
- Legal assistance: We provide guidance on legal matters, including opening and managing your independent contractor or sole proprietorship status, taxes, and related processes.
- Professional development: We offer English courses and professional growth opportunities, as well as team-building events.
What is your new project?:
- Domain: Computer and network security
- Location: Israel
- Company size: 51-200 employees
- Founded in: 2009
Are you ready to join an innovative force in cybersecurity backed by one of the industry's biggest names? Our client, recently acquired by Check Point for $200 million, is on a mission to transform external risk management.
Imagine being part of a team that uses cutting-edge technology to protect businesses from the most dangerous cyber threats out there—monitoring the dark web, pinpointing vulnerabilities, and preventing data breaches.
This is more than just a job; it’s an opportunity to make a real impact in the world of cybersecurity. The pace is fast, the challenges are thrilling, and the solutions are AI-driven, putting you at the forefront of real-time threat detection.
What’s more, with the support of a global powerhouse like Check Point, you'll have the stability, resources, and career growth opportunities that only come with being part of a leader in the cybersecurity field!
Must Have
What makes you a great fit:
- 5+ years of experience as a DevOps / SRE / Infrastructure engineer.
- Proven experience managing large-scale SaaS systems on AWS (EKS, RDS, Kafka, Redis, S3, Lambda, CloudWatch).
- Deep understanding of Kubernetes architecture and container orchestration at scale, Karpenter.
- Hands-on experience with Terraform, Helm, and CI/CD automation (GitHub Actions, Jenkins, or ArgoCD).
- Strong scripting skills in Python, Bash, or Go.
- Familiarity with monitoring and alerting tools (Prometheus, Grafana, Loki, ELK).
- Experience using or integrating AI-assisted tools (e.g., for observability, auto-remediation, or developer productivity).
- Excellent troubleshooting skills and a proactive mindset for reliability and performance optimization
- Experience in multi-environment / multi-tenant SaaS or cybersecurity / threat intelligence systems.
- Knowledge of AI/ML pipelines or AIOps concepts..
- Background in cost optimization and FinOps practices.
- Familiarity with Kafka scaling, Redis clustering, and AWS service-level tuning.
- Be a key player in scaling and modernizing a global cyber intelligence SaaS serving leading enterprises.
- Collaborate with top-tier engineers and architects driving automation and intelligent operations.
- Take ownership and lead initiatives that directly affect uptime, reliability, and efficiency.
- Work in an environment that encourages innovation, experimentation, and adoption of AI and automation in day-to-day operations.
- Lead the DevOps domain: define architecture, automation strategy, and reliability goals for the entire R&D organization.
- Own infrastructure scalability and performance: ensure our Kubernetes (EKS)-based environments are resilient, efficient, and cost-optimized.
- Develop and maintain CI/CD pipelines using GitHub Actions, Jenkins, or ArgoCD to support fast, reliable, and automated delivery.
- Drive observability and reliability initiatives: monitor system health via Prometheus, Grafana, and CloudWatch; define metrics, alerts, and SLOs.
- Leverage AI/automation tooling (e.g., anomaly detection, alert classification, cost prediction) to enhance monitoring, response, and efficiency.
- Manage infrastructure as code (Terraform, Helm, CloudFormation) and enforce IaC best practices.
- Collaborate with engineering teams to design infrastructure for new services, improve developer experience, and ensure secure deployments.
- Ensure system uptime and production readiness: lead root cause analysis, incident response, and capacity planning.
- Continuously evaluate emerging technologies, including AI-driven ops tools, to improve scalability, reliability, and delivery velocity.
- People-first management with minimal bureaucracy
- A friendly company culture, proven by employees who choose to return
- Flexible working hours
- 29 days of PTO (18 working days per year pluse all national holidays)
- 10 paid recovery days
- Full financial and legal support for independent contractors
- Free English classes, with native speakers or Ukrainian teachers
- Dedicated HR support
Create a free Caio profile to unlock the full index and keep your job-search signal for future recommendations.
Unlock free search