Senior Software Development Engineer, Annapurna Labs, Elastic Collectives
10 hours ago
Description
We seek an experienced engineer to work on distributed Artificial Intelligence/Machine Learning (AI/ML) systems. This role focuses on developing high-performance collective operations - the fundamental operations that enable AI to scale efficiently across multiple accelerators and servers. Most of our stack uses C/C++ at a relatively low level, requiring knowledge of Linux systems and performance-optimized code.
We value experience with ML frameworks, performance tuning and optimization techniques, embedded systems, and high-speed networking interconnects. Experience optimizing ML workloads is particularly valuable for this role.
If you enjoy solving complex performance challenges, want to work with ML customers, iterate quickly, and deliver optimized solutions at scale, join us You'll work on the forefront of AI/ML, developing high-throughput, low-latency features for the largest clusters, with the largest customers, for the largest AI models.
Key job responsibilities
You'll work on the stack from ML collective frameworks to the libfabric and Elastic Fabric Adapter (EFA) stacks. Your focus will be designing and implementing Application Programming Interfaces (APIs) and features, as well as optimizing performance at every layer, reducing latency, and maximizing throughput for ML workloads on AWS
A day in the life
Annapurna Labs, a crucial part of AWS, is responsible for developing hardware and software components for EC2 infrastructure. Our team focuses on building networking solutions that for Machine Learning (ML) and High-Performance Computing (HPC) workloads on AWS.
We have mixed discipline orgs, you'd be working side by side with infrastructure experts, hardware engineers, RTL engineers, scientists & architects. Our workforce spans the globe and is truly international, you'll find yourself working side by side with individuals from numerous countries. We take mentorship seriously, you can both expect senior mentorship and will be expected to mentor new and junior engineers.
The pace is fast as we work on the latest advancements of AI/ML, but we take the time to bond as a team and enjoy the successes. We offer flexibility in working hours, and respect WLB as a core org tenet. The team enjoys working with numerous principal-level engineers and closely with directors, career growth opportunities are certainly available. This is a role where you will always be encouraged to keep learning, the AI/ML field is fast moving and constantly evolving.
About The Team
Annapurna Labs, an integral part of Amazon Web Services (AWS), develops hardware and software components that serve as critical building blocks for Elastic Compute Cloud (EC2) infrastructure. Every instance in EC2 runs hardware designed by Annapurna Labs. We specialize in designing performance-optimized software, systems, and chips that enhance the AWS customer experience.
Basic Qualifications
- 5+ years of non-internship professional software development experience
- 5+ years of programming with at least one software programming language experience
- 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Experience as a mentor, tech lead or leading an engineering team
Preferred Qualifications
- Master's degree in computer science or equivalent
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit for more information. If the country/region you're applying in isn't listed, please contact your Recruiting Partner.
Company
- Annapurna Labs Ltd.
Job ID: A3098714
-
Tel Aviv, Tel Aviv, Israel Amazon Web Services (AWS) Full time ₪120,000 - ₪240,000 per yearDescriptionWe are seeking an experienced engineer to work on distributed AI/ML systems. This role involves working on collective operations - the fundamental operations that enable AI to scale across multiple accelerators & servers. Most of our stack is C/C++ and relatively low level, so solid knowledge of Linux, kernels, and performant code is important....
-
Tel Aviv, Tel Aviv, Israel Amazon Full time ₪120,000 - ₪240,000 per yearWe are seeking an experienced engineer to work on distributed AI/ML systems. This role involves working on collective operations - the fundamental operations that enable AI to scale across multiple accelerators & servers. Most of our stack is C/C++ and relatively low level, so solid knowledge of Linux, kernels, and performant code is important. Experience...
-
Tel Aviv, Tel Aviv, Israel Amazon Full time ₪120,000 - ₪240,000 per yearDESCRIPTIONWe are seeking an experienced engineer to work on distributed AI/ML systems. This role involves working on collective operations - the fundamental operations that enable AI to scale across multiple accelerators & servers. Most of our stack is C/C++ and relatively low level, so solid knowledge of Linux, kernels, and performant code is important....
-
Tel Aviv, Tel Aviv, Israel myGwork - LGBTQ+ Business Community Full time ₪120,000 - ₪240,000 per yearThis job is with Amazon, an inclusive employer and a member of myGwork – the largest global platform for the LGBTQ+ business community. Please do not contact the recruiter directly.DescriptionWe are seeking an experienced engineer to work on distributed AI/ML systems. This role involves working on collective operations - the fundamental operations that...
-
Verification Tech Lead
1 week ago
Tel Aviv, Tel Aviv, Israel Annapurna Labs Ltd. Full time ₪180,000 - ₪250,000 per yearDESCRIPTIONAWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon's Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS's services and features apart in the industry. As a member of the UC organization, you'll support the...
-
SW Automation Student, Annapurna Labs
1 week ago
Tel Aviv, Tel Aviv, Israel Amazon Web Services (AWS) Full time ₪40,000 - ₪80,000 per yearDescriptionAnnapurna Labs, an Amazon company, plays a significant role in AWS's success with developing Graviton servers.Graviton servers are ARM-based processors that provide better performance and cost-effectiveness for running various compute-intensive applications on the cloud.As a Software Test Development Engineer in our team, you will be responsible...
-
SW Automation Student, Annapurna Labs
1 week ago
Tel Aviv, Tel Aviv, Israel myGwork - LGBTQ+ Business Community Full time ₪50,000 - ₪80,000 per yearThis job is with Amazon, an inclusive employer and a member of myGwork – the largest global platform for the LGBTQ+ business community. Please do not contact the recruiter directly.DescriptionAnnapurna Labs, an Amazon company, plays a significant role in AWS's success with developing Graviton servers.Graviton servers are ARM-based processors that provide...
-
Software Development Engineer, Annapurna
10 hours ago
Tel Aviv, Tel Aviv, Israel Amazon Web Services (AWS) Full time ₪90,000 - ₪120,000 per year*DESCRIPTION*AWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon's Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS's services and features apart in the industry. As a member of the UC organization, you'll support the...
-
Embedded Software Engineer, Annapurna
1 week ago
Tel Aviv, Tel Aviv, Israel Amazon Full time $80,000 - $120,000 per yearDESCRIPTIONAWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon's Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS's services and features apart in the industry. As a member of the UC organization, you'll support the...
-
Senior Software Engineer
6 days ago
Tel Aviv, Tel Aviv, Israel Scopio Labs Full time ₪120,000 - ₪180,000 per yearAbout Scopio LabsScopio Labs is a fast-growing medical technology company transforming the fields of hematology and cell morphology analysis. As a team, we work to enable faster, earlier detection and diagnosis of blood-related diseases, so patients can start treatment sooner and have better outcomes. We are collaborative and global – and every individual...