Site Reliability Engineering Manager
2 weeks ago
At JFrog, we're reinventing DevOps to help the world's greatest companies innovate -- and we want you along for the ride. This is a special place with a unique combination of brilliance, spirit and just all-around great people. Here, if you're willing to do more, your career can take off. And since software plays a central role in everyone's lives, you'll be part of an important mission. Thousands of customers, including the majority of the Fortune 100, trust JFrog to manage, accelerate, and secure their software delivery from code to production -- a concept we call "liquid software." Wouldn't it be amazing if you could join us in our journey?
We are looking for a Site Reliability Engineering Manager to lead our Israel SRE team. In this role, you'll drive best practices in reliability engineering, ensuring the stability, availability, and performance of JFrog's SaaS services. You'll collaborate with global SRE leaders, refine processes, and foster a culture of accountability and continuous improvement.
*As a Site Reliability Engineering Manager at JFrog you will…*
- Lead, mentor, and develop a high-performing SRE Israel team, fostering collaboration, innovation, and accountability
 - Ensure SaaS reliability, performance, and availability, meeting or exceeding service-level objectives
 - Drive SRE best practices, including capacity planning, incident management, chaos engineering, and disaster recovery
 - Implement proactive monitoring, alerting, and anomaly detection aligned with SaaS standards
 - Collaborate with P&E and Cloud engineering teams to embed reliability into the SDLC
 - Oversee incident management, ensuring swift identification, escalation, and resolution
 - Maintain comprehensive SRE documentation, including processes, incident reports, and system architecture
 - Evaluate and adopt tools, technologies, and methodologies to enhance uptime and reliability
 
*To be a*
Site Reliability Engineering Manager
at JFrog you need…****
- 3+ years of management experience leading a team of SRE, DevOps, or a similar SaaS role
 - Bachelor's degree in Computer Science, Engineering, or related field (or equivalent experience)
 - Strong expertise in cloud platforms (AWS, GCP, or Azure), containers (Kubernetes, Docker), and configuration management (Terraform, Ansible)
 - Proficiency in Python or Go for automation and system optimization, as well as GitOps experience with SCM tools (e.g., Git, Bitbucket)
 - Strong leadership, communication, and collaboration skills, working across globally distributed teams
 - Familiarity with Agile methodologies, CI/CD pipelines, and orchestration tools (Jenkins, ArgoCD, StackStorm)
 - Familiarity with Chaos Engineering (e.g., Gremlin, Litmus, Chaos Toolkit)
 - Hands-on with alerting & observability tools (e.g., PagerDuty, OpsGenie, New Relic, Coralogix)
 - Strong understanding of scalability, high availability, and security best practices in cloud & Kubernetes environments
 
- 
					
						Certified Site Manager
2 weeks ago
Tel Aviv, Tel Aviv, Israel Yaniv Engineering Full time ₪80,000 - ₪120,000 per yearCompany DescriptionAt Yaniv Engineering, established by Gal Yaniv in 1998, we pride ourselves on maintaining high standards of quality and excellence. Our company has earned a reputation as a leading firm in our industry due to our innovative approach and commitment to advancing technology and control systems. We believe that our team's personal excellence,...
 - 
					
Site Reliability Engineer
2 weeks ago
Tel Aviv, Tel Aviv, Israel Shavit Software Full time ₪90,000 - ₪120,000 per yearWe're Hiring: Site Reliability Engineer Responsibilities:Ensure availability, reliability, and performance of cloud-based systemsMonitor, troubleshoot, and investigate incidentsImprove deployment, scaling, and self-healing processesManage full lifecycle of applications and systems through codeWork with Kubernetes and microservices-based environmentsWrite and...
 - 
					
						Site Reliability Engineer
2 days ago
Tel Aviv, Tel Aviv, Israel Cato Networks Full time ₪120,000 - ₪180,000 per yearWelcome to the future of cloud networking and securityCato Networks is the first company to converge enterprise networking and security into one centralized and global service that is delivered by cloud. It is led by networking and security pioneer Shlomo Kramer (Check Point, Imperva) and early investor (Palo Alto Networks, Exabeem, Trusteer and more)....
 - 
					
						Senior Site Reliability Engineer
4 days ago
Tel Aviv, Tel Aviv, Israel Aerospike Full time ₪900,000 - ₪1,200,000 per yearAerospike is the real-time database for mission-critical use cases and workloads, including machine learning, generative, and agentic AI. Aerospike powers millions of transactions per second with millisecond latency, at a fraction of the total cost of ownership compared to other databases.Global leaders, including Adobe, Airtel, Barclays, Criteo, DBS Bank,...
 - 
					
						Senior Site Reliability Engineer
2 days ago
Tel Aviv, Tel Aviv, Israel Aerospike Full time ₪120,000 - ₪180,000 per yearAerospike is the real-time database for mission-critical use cases and workloads, including machine learning, generative, and agentic AI. Aerospike powers millions of transactions per second with millisecond latency, at a fraction of the total cost of ownership compared to other databases. Global leaders, including Adobe, Airtel, Barclays, Criteo, DBS...
 - 
					
						Site Reliability Engineer
2 weeks ago
Tel Aviv, Tel Aviv, Israel Finubit Full time ₪80,000 - ₪120,000 per yearAbout Finubit:Finubit is a fast-moving startup creating the bank's next-generation cloud platform — a modern, Kubernetes-native and AI-driven foundation that powers engineering for over a thousand developers.We're rethinking how banks build, deploy, and operate systems at scale — combining GitOps, ChatOps, and AI automation to enable...
 - 
					
Site Reliability Engineer
2 days ago
Tel Aviv, Tel Aviv, Israel Wiz Full time ₪90,000 - ₪120,000 per yearCome join the company that is reinventing cloud security and empowering businesses to thrive in the cloud. As the fastest-growing startup ever, Wiz is on a mission to help organizations secure cloud environments that will accelerate their businesses. Trusted by security teams all over the world, we have a proven track record of success and a culture that...
 - 
					
Sr. Site Reliability Engineer
2 weeks ago
Tel Aviv, Tel Aviv, Israel Navan Full time $104,000 - $130,878 per yearAt , we're building the next generation of AI-powered workforces. As a dedicated team within Navan, our mission is to advance the state of agentic AI. We are the builders of Navan Cognition: a multi-agent AI platform that has already transformed our internal operations by handling challenging, real-world business processes with a focus on reliability and...
 - 
					
						Sr. Site Reliability Engineer
2 weeks ago
Tel Aviv, Tel Aviv, Israel Tripeur - a Navan company Full time $104,000 - $130,878 per yearAt , we're building the next generation of AI-powered workforces. As a dedicated team within Navan, our mission is to advance the state of agentic AI. We are the builders of Navan Cognition: a multi-agent AI platform that has already transformed our internal operations by handling challenging, real-world business processes with a focus on reliability and...
 - 
					
						Lead Site Reliability Engineer
2 weeks ago
Tel Aviv, Tel Aviv, Israel Grubhub Full timeWhy Work For UsGrubhub, part of Wonder Group Inc, is all about connecting hungry diners with our network of over 375,000 merchants nationwide. Innovative technology, user-friendly platforms and streamlined delivery capabilities set us apart and make us an industry leader in the world of online food ordering. When you join our team, you become part of a...