Senior Software Development Engineer, Annapurna Labs, Elastic Collectives

14 hours ago


Tel AvivYafo TA ISR, Israel Amazon Full time ₪90,000 - ₪120,000 per year
We seek an experienced engineer to work on distributed Artificial Intelligence/Machine Learning (AI/ML) systems. This role focuses on developing high-performance collective operations - the fundamental operations that enable AI to scale efficiently across multiple accelerators and servers. Most of our stack uses C/C++ at a relatively low level, requiring knowledge of Linux systems and performance-optimized code.

We value experience with ML frameworks, performance tuning and optimization techniques, embedded systems, and high-speed networking interconnects. Experience optimizing ML workloads is particularly valuable for this role.
If you enjoy solving complex performance challenges, want to work with ML customers, iterate quickly, and deliver optimized solutions at scale, join us You'll work on the forefront of AI/ML, developing high-throughput, low-latency features for the largest clusters, with the largest customers, for the largest AI models.

Key job responsibilities
You'll work on the stack from ML collective frameworks to the libfabric and Elastic Fabric Adapter (EFA) stacks. Your focus will be designing and implementing Application Programming Interfaces (APIs) and features, as well as optimizing performance at every layer, reducing latency, and maximizing throughput for ML workloads on AWS

A day in the life
Annapurna Labs, a crucial part of AWS, is responsible for developing hardware and software components for EC2 infrastructure. Our team focuses on building networking solutions that for Machine Learning (ML) and High-Performance Computing (HPC) workloads on AWS.

We have mixed discipline orgs, you'd be working side by side with infrastructure experts, hardware engineers, RTL engineers, scientists & architects. Our workforce spans the globe and is truly international, you'll find yourself working side by side with individuals from numerous countries. We take mentorship seriously, you can both expect senior mentorship and will be expected to mentor new and junior engineers.

The pace is fast as we work on the latest advancements of AI/ML, but we take the time to bond as a team and enjoy the successes. We offer flexibility in working hours, and respect WLB as a core org tenet. The team enjoys working with numerous principal-level engineers and closely with directors, career growth opportunities are certainly available. This is a role where you will always be encouraged to keep learning, the AI/ML field is fast moving and constantly evolving.

About the team
Annapurna Labs, an integral part of Amazon Web Services (AWS), develops hardware and software components that serve as critical building blocks for Elastic Compute Cloud (EC2) infrastructure. Every instance in EC2 runs hardware designed by Annapurna Labs. We specialize in designing performance-optimized software, systems, and chips that enhance the AWS customer experience.

- 5+ years of non-internship professional software development experience
- 5+ years of programming with at least one software programming language experience
- 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Experience as a mentor, tech lead or leading an engineering team

- Master's degree in computer science or equivalent

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit for more information. If the country/region you're applying in isn't listed, please contact your Recruiting Partner.


  • Tel Aviv, Tel Aviv, Israel Amazon Web Services (AWS) Full time ₪80,000 - ₪150,000 per year

    DescriptionWe seek an experienced engineer to work on distributed Artificial Intelligence/Machine Learning (AI/ML) systems. This role focuses on developing high-performance collective operations - the fundamental operations that enable AI to scale efficiently across multiple accelerators and servers. Most of our stack uses C/C++ at a relatively low level,...


  • Tel Aviv, Tel Aviv, Israel Amazon Web Services (AWS) Full time ₪120,000 - ₪240,000 per year

    DescriptionWe are seeking an experienced engineer to work on distributed AI/ML systems. This role involves working on collective operations - the fundamental operations that enable AI to scale across multiple accelerators & servers. Most of our stack is C/C++ and relatively low level, so solid knowledge of Linux, kernels, and performant code is important....


  • Tel Aviv, Tel Aviv, Israel Amazon Full time ₪120,000 - ₪240,000 per year

    We are seeking an experienced engineer to work on distributed AI/ML systems. This role involves working on collective operations - the fundamental operations that enable AI to scale across multiple accelerators & servers. Most of our stack is C/C++ and relatively low level, so solid knowledge of Linux, kernels, and performant code is important. Experience...


  • Tel Aviv, Tel Aviv, Israel Amazon Full time ₪120,000 - ₪240,000 per year

    DESCRIPTIONWe are seeking an experienced engineer to work on distributed AI/ML systems. This role involves working on collective operations - the fundamental operations that enable AI to scale across multiple accelerators & servers. Most of our stack is C/C++ and relatively low level, so solid knowledge of Linux, kernels, and performant code is important....


  • Tel Aviv, Tel Aviv, Israel myGwork - LGBTQ+ Business Community Full time ₪120,000 - ₪240,000 per year

    This job is with Amazon, an inclusive employer and a member of myGwork – the largest global platform for the LGBTQ+ business community. Please do not contact the recruiter directly.DescriptionWe are seeking an experienced engineer to work on distributed AI/ML systems. This role involves working on collective operations - the fundamental operations that...


  • Tel Aviv-Yafo, TA, ISR, Israel Amazon Full time ₪60,000 - ₪120,000 per year

    AWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon's Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS's services and features apart in the industry. As a member of the UC organization, you'll support the development...


  • Haifa, HA, ISR, Israel Amazon Full time ₪80,000 - ₪120,000 per year

    AWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon's Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS's services and features apart in the industry. As a member of the UC organization, you'll support the development...


  • Haifa, HA, ISR, Israel Amazon Full time ₪100,000 - ₪120,000 per year

    AWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon's Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS's services and features apart in the industry. As a member of the UC organization, you'll support the development...


  • Tel Aviv-Yafo, TA, ISR, Israel Amazon Full time ₪120,000 - ₪180,000 per year

    AWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon's Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS's services and features apart in the industry. As a member of the UC organization, you'll support the development...


  • Haifa, HA, ISR, Israel Amazon Full time ₪80,000 - ₪150,000 per year

    AWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon's Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS's services and features apart in the industry. As a member of the UC organization, you'll support the development...