Senior Software Development Engineer, Annapurna Labs, Elastic Collectives

6 days ago


Tel Aviv, Tel Aviv, Israel Amazon Full time
We seek an experienced engineer to work on distributed Artificial Intelligence/Machine Learning (AI/ML) systems. This role focuses on developing high-performance collective operations - the fundamental operations that enable AI to scale efficiently across multiple accelerators and servers. Most of our stack uses C/C++ at a relatively low level, requiring knowledge of Linux systems and performance-optimized code.

We value experience with ML frameworks, performance tuning and optimization techniques, embedded systems, and high-speed networking interconnects. Experience optimizing ML workloads is particularly valuable for this role.
If you enjoy solving complex performance challenges, want to work with ML customers, iterate quickly, and deliver optimized solutions at scale, join us You'll work on the forefront of AI/ML, developing high-throughput, low-latency features for the largest clusters, with the largest customers, for the largest AI models.

Key job responsibilities
You'll work on the stack from ML collective frameworks to the libfabric and Elastic Fabric Adapter (EFA) stacks. Your focus will be designing and implementing Application Programming Interfaces (APIs) and features, as well as optimizing performance at every layer, reducing latency, and maximizing throughput for ML workloads on AWS

A day in the life
Annapurna Labs, a crucial part of AWS, is responsible for developing hardware and software components for EC2 infrastructure. Our team focuses on building networking solutions that for Machine Learning (ML) and High-Performance Computing (HPC) workloads on AWS.

We have mixed discipline orgs, you'd be working side by side with infrastructure experts, hardware engineers, RTL engineers, scientists & architects. Our workforce spans the globe and is truly international, you'll find yourself working side by side with individuals from numerous countries. We take mentorship seriously, you can both expect senior mentorship and will be expected to mentor new and junior engineers.

The pace is fast as we work on the latest advancements of AI/ML, but we take the time to bond as a team and enjoy the successes. We offer flexibility in working hours, and respect WLB as a core org tenet. The team enjoys working with numerous principal-level engineers and closely with directors, career growth opportunities are certainly available. This is a role where you will always be encouraged to keep learning, the AI/ML field is fast moving and constantly evolving.

About the team
Annapurna Labs, an integral part of Amazon Web Services (AWS), develops hardware and software components that serve as critical building blocks for Elastic Compute Cloud (EC2) infrastructure. Every instance in EC2 runs hardware designed by Annapurna Labs. We specialize in designing performance-optimized software, systems, and chips that enhance the AWS customer experience.

Basic Qualifications:
- 5+ years of non-internship professional software development experience
- 5+ years of programming with at least one software programming language experience
- 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Experience as a mentor, tech lead or leading an engineering team

Preferred Qualifications:
- Master's degree in computer science or equivalent

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit for more information. If the country/region you're applying in isn't listed, please contact your Recruiting Partner.

  • Tel Aviv, Tel Aviv, Israel Amazon Web Services (AWS) Full time

    DescriptionWe seek an experienced engineer to work on distributed Artificial Intelligence/Machine Learning (AI/ML) systems. This role focuses on developing high-performance collective operations - the fundamental operations that enable AI to scale efficiently across multiple accelerators and servers. Most of our stack uses C/C++ at a relatively low level,...


  • Tel Aviv, Tel Aviv, Israel Amazon Full time

    DESCRIPTIONWe seek an experienced engineer to work on distributed Artificial Intelligence/Machine Learning (AI/ML) systems. This role focuses on developing high-performance collective operations - the fundamental operations that enable AI to scale efficiently across multiple accelerators and servers. Most of our stack uses C/C++ at a relatively low level,...


  • Tel Aviv, Tel Aviv, Israel Amazon Full time

    AWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon's Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS's services and features apart in the industry. As a member of the UC organization, you'll support the development...


  • Tel Aviv, Tel Aviv, Israel Amazon Full time

    DESCRIPTIONAWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon's Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS's services and features apart in the industry. As a member of the UC organization, you'll support the...


  • Tel Aviv, Tel Aviv, Israel Amazon Web Services (AWS) Full time

    DescriptionAnnapurna Labs, an Amazon company, plays a significant role in AWS's success with developing Graviton servers.Graviton servers are ARM-based processors that provide better performance and cost-effectiveness for running various compute-intensive applications on the cloud.As a Software Test Development Engineer in our team, you will be responsible...


  • Tel Aviv, Tel Aviv, Israel Amazon Full time

    Annapurna Labs, an Amazon company, plays a significant role in AWS's success with developing Graviton servers. Graviton servers are ARM-based processors that provide better performance and cost-effectiveness for running various compute-intensive applications on the cloud. As a Software Test Development Engineer in our team, you will be responsible for the...


  • Tel Aviv, Tel Aviv, Israel Amazon Web Services (AWS) Full time

    DescriptionJoin the Amazon family and take part in the development of next generation products that will define the future of the cloud domain. AnnapurnaLabs, an Amazon company, is looking for talented engineers to help us develop the semiconductor platform which is based on revolutionary architecture.You're invited to take part in developing, integrating...


  • Tel Aviv, Tel Aviv, Israel Amazon Web Services (AWS) Full time

    *DESCRIPTION*AWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon's Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS's services and features apart in the industry. As a member of the UC organization, you'll support the...


  • Tel Aviv, Tel Aviv, Israel Amazon Full time

    AWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon's Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS's services and features apart in the industry. As a member of the UC organization, you'll support the development...


  • Tel Aviv, Tel Aviv, Israel Amazon Web Services (AWS) Full time

    DescriptionAWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon's Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS's services and features apart in the industry. As a member of the UC organization, you'll support the...