
Reliability Engineer
1 week ago
At , we're building the next generation of AI-powered workforces. As a dedicated team within Navan, our mission is to advance the state of agentic AI. We are the builders of Navan Cognition: a multi-agent AI platform that has already transformed our internal operations by handling challenging, real-world business processes with a focus on reliability and accuracy. Now, we're taking the next step by opening this technology up to other companies.
Joining our team means joining the frontline of AI innovation, crafting the foundation for a rapidly unfolding, AI-powered business era.
What You'll Do
- Design, build, and support tooling, automation, and infrastructure to maximize the reliability, scalability, and performance of Navan Cognition.
- Proactively identify, mitigate, and resolve issues, leveraging AI-driven insights and automation where possible.
- Develop robust monitoring, alerting, and incident response strategies; ensure actionable observability across all critical systems.
- Drive best practices in CI/CD, Infrastructure-as-Code, environment provisioning, and disaster recovery.
- Collaborate closely with engineering teams to build, deploy, and maintain highly available services in production.
- Take responsibility for uptime, reliability, and the operational excellence of Navan Cognition.
- Help define and measure SLOs/SLAs to ensure world-class service delivery.
What We're Looking For
- 3+ years in Site Reliability, DevOps, or related Infrastructure Engineering roles in 24/7 production environments.
- Deep experience operating, automating, and supporting distributed systems on AWS or similar clouds.
- Experience with Infrastructure-as-Code (e.g., Terraform, CloudFormation) and CI/CD tooling (e.g., Jenkins, Github Actions, etc.).
- Strong skills in Python, Bash, or comparable scripting languages for automation.
- Hands-on experience with observability stacks (e.g., New Relic, Grafana, CloudWatch, Datadog) and incident response.
- Familiarity with microservices architectures and patterns for resilience/scalability (e.g., throttling, retries, circuit breakers).
- Experience with common data stores (MySQL/RDS, DocumentDB, Elasticsearch, Redis).
- Working knowledge of backends (bonus: performance optimization and monitoring); experience with Java, Python, or Go is a plus.
- Interest or experience in applying AI for infrastructure automation, monitoring, or optimization (a strong plus).
- A collaborative mindset with strong communication skills, able to work independently and comfortably across teams and disciplines.
- Thrives in a fast-paced, high-growth environment and ready to tackle complex system challenges at scale.
- Data-driven, analytical thinker with the ability to dive into metrics, identify insights, and drive product improvements
- Startup-ready: thrive in fast-paced, ambiguous environments; bias for learning, action, and innovation
-
Site Reliability Engineering Manager
1 week ago
Tel Aviv, Tel Aviv, Israel JFrog Full time $125,000 - $175,000 per yearAt JFrog, we're reinventing DevOps to help the world's greatest companies innovate -- and we want you along for the ride. This is a special place with a unique combination of brilliance, spirit and just all-around great people. Here, if you're willing to do more, your career can take off. And since software plays a central role in everyone's lives, you'll be...
-
Site Reliability Engineer
1 week ago
Tel Aviv, Tel Aviv, Israel Shavit Software Full time ₪104,000 - ₪130,878 per yearWe're Hiring: Site Reliability Engineer Responsibilities:Ensure availability, reliability, and performance of cloud-based systemsMonitor, troubleshoot, and investigate incidentsImprove deployment, scaling, and self-healing processesManage full lifecycle of applications and systems through codeWork with Kubernetes and microservices-based environmentsWrite and...
-
Site Reliability Engineer
1 week ago
Tel Aviv, Tel Aviv, Israel eToro Full time $90,000 - $120,000 per yeareToro is the trading and investing platform that empowers users to invest, share, and learn. We were founded in 2007 with the vision of a world where everyone can trade and invest simply and transparently. We have created an investment platform that is built around collaboration and investor education. On our platform, users can view other investors'...
-
Senior Site Reliability Engineer
1 week ago
Tel Aviv, Tel Aviv, Israel JFrog Full time $150,000 - $200,000 per yearAt JFrog, we're reinventing DevOps to help the world's greatest companies innovate -- and we want you along for the ride. This is a special place with a unique combination of brilliance, spirit and just all-around great people. Here, if you're willing to do more, your career can take off. And since software plays a central role in everyone's lives, you'll be...
-
Sr. Site Reliability Engineer
1 week ago
Tel Aviv, Tel Aviv, Israel Navan Full time $104,000 - $130,878 per yearAt , we're building the next generation of AI-powered workforces. As a dedicated team within Navan, our mission is to advance the state of agentic AI. We are the builders of Navan Cognition: a multi-agent AI platform that has already transformed our internal operations by handling challenging, real-world business processes with a focus on reliability and...
-
Lead Site Reliability Engineer
1 week ago
Tel Aviv, Tel Aviv, Israel Grubhub Full time ₪70,000 - ₪120,000 per yearWhy Work For UsGrubhub, part of Wonder Group Inc, is all about connecting hungry diners with our network of over 375,000 merchants nationwide. Innovative technology, user-friendly platforms and streamlined delivery capabilities set us apart and make us an industry leader in the world of online food ordering. When you join our team, you become part of a...
-
Sr. Site Reliability Engineer
1 week ago
Tel Aviv, Tel Aviv, Israel Tripeur - a Navan company Full time $104,000 - $130,878 per yearAt , we're building the next generation of AI-powered workforces. As a dedicated team within Navan, our mission is to advance the state of agentic AI. We are the builders of Navan Cognition: a multi-agent AI platform that has already transformed our internal operations by handling challenging, real-world business processes with a focus on reliability and...
-
Dev Reliability Engineer Team Lead
6 days ago
Tel Aviv, Tel Aviv, Israel Paragon Full time ₪70,000 - ₪120,000 per yearDescriptionParagon is on a mission to transform the world of cyber intelligence.Based in Tel Aviv, our innovative team is made up of top-tier talent who are passionate about making an impact. At Paragon, you'll find the freedom to think boldly, collaborate with purpose, and grow alongside a team united by a shared mission — striving for excellence, and...
-
Cryptography Reliability Engineer
1 week ago
Tel Aviv, Tel Aviv, Israel Fireblocks Full time ₪90,000 - ₪120,000 per yearThe world of digital assets is accelerating in speed, magnitude, and complexity, opening the door to new ways for leveraging the blockchain. Fireblocks' platform and network provide the simplest and most secure way for companies to work with digital assets and it trusted by some of the largest financial institutions, banks, globally-recognized brands, and...
-
Director of Engineering
1 week ago
Tel Aviv, Tel Aviv, Israel JFrog Full time $150,000 - $200,000 per yearAt JFrog, we're reinventing DevOps to help the world's greatest companies innovate -- and we want you along for the ride. This is a special place with a unique combination of brilliance, spirit, and just all-around great people. Here, if you're willing to do more, your career can take off. And since software plays a central role in everyone's lives, you'll...