Apply now

Apply for Job

Site Reliability Engineer

Date:  Aug 16, 2022
Location: 

Israel, Petach Tikva

Job Category:  R&D
Department:  Product & Technology

Who we are:

CyberArk (NASDAQ: CYBR) is the global leader in Identity Security. Centered on privileged access management, CyberArk provides the most comprehensive security offering for any identity – human or machine – across business applications, distributed workforces, hybrid cloud workloads and throughout the DevOps lifecycle. The world’s leading organizations trust CyberArk to help secure their most critical assets.

 

What will you do:

CyberArk SRE are coders who enjoy a challenge and own the availability of CyberArk SaaS (infrastructure - application), by measuring failures and availability of SLIs and SLOs, using a proactive approach of prevention over mitigation and mitigation over fixing. The SRE collaborates with Dev and work with PM in order to continuously improve the services availability and quality. They will share ownership with the Dev team to create shared responsibility where the SRE owns the availability of the service, proactive prevention of issues, performing deliberate and structured troubleshooting to mitigate issues.

CyberArk Cloud Engineering is looking for a Site Reliability Engineer with "automation first" mindset who is passionate about performance, stability and security to share responsibility over the ownership of CyberArk SaaS reliability. The Site Reliability Engineer will work closely with the Dev teams and the DevOps Engineers to ensure the security, performance, resiliency and scale of production services.

 

  • Monitor and improve the availability, performance and security of production services
  • Apply prevention steps in order to improve production services reliability
  • Mitigate issues on production systems and build solutions through automation to prevent them from reoccurring
  • Apply the latest OS and security patches ensuring the compatibility of underlying running application.
  • Experience in working collaboratively with various applications development teams throughout the organization to resolve mission critical problems.
  • Automate common, repeatable tasks using Ansible and scripting languages
  • Triage and manage escalation of cases 
  • Influence design / architecture of services to proactively prevent system failures

 

What you need to succeed:

  • Experience with 3+ years as a DevOps / SRE / Production Engineer
  • 3+ years of cloud provider experience (primarily AWS)
  • 2+ Years’ experience with Python, Ruby, PowerShell, Bash
  • Strong hands on experience in Linux/Unix and Windows OS
  • Strong hands on experience in Network architecture and security configurations
  • Hands-on experience with Automation/Configuration management using either Ansible, Puppet, Chef or an equivalent
  • Bachelor’s Degree in Computer Science or related field
  • Excellent communication skills
  • Strong attention to detail
  • Strong hands-on technical abilities
  • Ability to keep track of numerous detail-intensive, interdependent tasks and ensure their accurate completion
  • A team player mentality with a strong sense of ownership

​​​​​​​#LI-SR1

Apply now

Apply for Job