
Principal Machine Learning Engineer – GenAI Benchmarking
2 hours ago
The Principal Machine Learning Engineer – GenAI
is responsible for
hands-on design, development, and operation
of large-scale systems and tools for AI model benchmarking, optimization, and validation.
Unlike traditional ML Engineers focused mainly on training models, this role centers on
building, running, and continuously improving the infrastructure, automation, and services
that enable rigorous, repeatable, and production-grade model evaluation at scale.
This is a
hands-on principal role
that combines strategic technical leadership with active engineering execution.
You will own the architecture, implementation, and optimization of benchmarking and validation capabilities across Red Hat's AI ecosystem. This includes architecting
Validation-as-a-Service platforms
, delivering high-performance benchmarking pipelines, integrating with leading GenAI frameworks, and setting industry standards for model evaluation quality and reproducibility.
The role demands
deep GenAI domain expertise, architectural foresight, and direct coding involvement
to ensure evaluation platforms are flexible, extensible, and optimized for real-world, large-scale use.
What You Will Do
- Architect and lead scalable benchmarking pipelines for LLM performance measurement (latency, throughput, accuracy, cost) across multiple serving backends and hardware types.
- Build optimization & profiling tools for inference performance, including GPU utilization, memory footprint, CUDA kernel efficiency, and parallelism strategies.
- Develop Validation-as-a-Service platforms with APIs and self-service tools for standardized, on-demand model evaluation.
- Integrate and optimize model serving frameworks (vLLM, TGI, LMDeploy, Triton) and API-based serving (OpenAI, Mistral, Anthropic) in production environments.
- Establish dataset & scenario management workflows for reproducible, comprehensive evaluation coverage.
- Implement observability & diagnostics systems (Prometheus, Grafana) for real-time benchmark and inference performance tracking.
- Deploy and manage workloads in Kubernetes (Helm, Argo CD, Argo Workflows) across AWS/GCP GPU clusters.
- Lead performance engineering efforts to identify bottlenecks, apply optimizations, and document best practices.
- Stay ahead of the GenAI ecosystem by tracking emerging frameworks, benchmarks, and optimization techniques, and integrating them into the platform.
What You Will Bring
- Advanced Python for ML/GenAI pipelines, backend development, and data processing.
- Kubernetes (Deployments, Services, Ingress) with Helm for large-scale distributed workloads.
- Deep expertise in LLM serving frameworks (vLLM, TGI, LMDeploy, Triton) and API-based serving (OpenAI, Mistral, Anthropic).
- GPU optimization mastery: CUDA, mixed precision, tensor/sequence parallelism, memory optimization, kernel-level profiling.
- Design and operation of benchmarking/evaluation pipelines with metrics for accuracy, latency, throughput, cost, and robustness.
- Experience with Hugging Face Hub for model/dataset management and integration.
- Familiarity with GenAI tools: OpenAI SDK, LangChain, LlamaIndex, Cursor, Copilot.
- Argo CD and Argo Workflows for reproducible ML orchestration.
- CI/CD (GitHub Actions, Jenkins) for ML workflows.
- Cloud expertise (AWS/GCP) for provisioning, running, and optimizing GPU workloads (A100, H100, etc.).
- Monitoring and observability (Prometheus, Grafana) and database experience (PostgreSQL, SQLAlchemy).
Nice to Have
- Distributed training across multi-node, multi-GPU environments.
- Advanced model evaluation: bias/fairness testing, robustness analysis, domain-specific benchmarks.
- Experience with OpenShift/RHOAI for enterprise AI workloads.
- Benchmarking frameworks: GuideLLM, HELM (Holistic Evaluation of Language Models), Eval Harness.
- Security scanning for ML artifacts and containers (Trivy, Grype).
- Design of tradeoff-analysis tools for model selection and deployment.
About Red Hat
Red Hat is the world's leading provider of enterprise open source software solutions, using a community-powered approach to deliver high-performing Linux, cloud, container, and Kubernetes technologies. Spread across 40+ countries, our associates work flexibly across work environments, from in-office, to office-flex, to fully remote, depending on the requirements of their role. Red Hatters are encouraged to bring their best ideas, no matter their title or tenure. We're a leader in open source because of our open and inclusive environment. We hire creative, passionate people ready to contribute their ideas, help solve complex problems, and make an impact.
Inclusion at Red Hat
Red Hat's culture is built on the open source principles of transparency, collaboration, and inclusion, where the best ideas can come from anywhere and anyone. When this is realized, it empowers people from different backgrounds, perspectives, and experiences to come together to share ideas, challenge the status quo, and drive innovation. Our aspiration is that everyone experiences this culture with equal opportunity and access, and that all voices are not only heard but also celebrated. We hope you will join our celebration, and we welcome and encourage applicants from all the beautiful dimensions that compose our global village.
Equal Opportunity Policy (EEO)
Red Hat is proud to be an equal opportunity workplace and an affirmative action employer. We review applications for employment without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, citizenship, age, veteran status, genetic information, physical or mental disability, medical condition, marital status, or any other basis prohibited by law.
Red Hat does not seek or accept unsolicited resumes or CVs from recruitment agencies. We are not responsible for, and will not pay, any fees, commissions, or any other payment related to unsolicited resumes or CVs except as required in a written contract between Red Hat and the recruitment agency or party requesting payment of a fee.
Red Hat supports individuals with disabilities and provides reasonable accommodations to job applicants. If you need assistance completing our online job application, email application- General inquiries, such as those regarding the status of a job application, will not receive a reply.
-
Machine Learning Data Engineer
1 week ago
Petah Tikva, Center District, Israel Cellebrite Full time ₪104,000 - ₪130,878 per yearCompany Overview:Cellebrite's (Nasdaq: CLBT) mission is to enable its global customers to protect and save lives by enhancing digital investigations and intelligence gathering to accelerate justice in communities around the world. Cellebrite's AI-powered Digital Investigation Platform enables customers to lawfully access, collect, analyze and share digital...
-
Machine Learning Engineer
2 weeks ago
Be'er Sheva, South District, Israel NeuroBrave Full time ₪104,000 - ₪130,878 per yearJoin the Neuro-tech revolution todayNeuroBrave develops a revolutionary software platform for human brain and neural system digital connectivity, utilizing state-of-the-art proprietary signal processing, AI and infrastructure. Our bio-signal processing AI bridges the gap between neuroscience and industry. Our suite of mobile applications offers...
-
Machine Learning Engineer
2 weeks ago
Binyamina - Givat Ada, Haifa District, Israel Sunbit Full time $150,000 - $200,000 per yearSunbit builds financial technology for real life. Our technology eases the stress of paying for life's expenses by giving people more options on how and when they pay. Founded in 2016, Sunbit offers a next-generation, no-fee credit card that can be managed through a powerful mobile app, as well as a point-of-sale payment option available at more than 25,000...
-
AI Platform Principal
2 weeks ago
Yokneam Ilit, North District, Israel Quasar Medical | Medical Device Manufacturer Full time $104,000 - $130,878 per yearAI Platform PrincipalLocation - Quasar Israel Hayetzira 6 Yokneam Ilit IsraelRole OverviewAs Quasar's firstAI Platform Principal, you will establish Quasar's AI framework and drive enterprise‑wide implementation of the AI Work Intelligence Platform, fine‑tune its agents for Quasar‑specific processes and build internal capabilities so teams can iterate...
-
AI Transformation Principal
6 days ago
Petah Tikva, Center District, Israel CyberArk Full time ₪90,000 - ₪120,000 per yearCompany DescriptionAbout CyberArk:CyberArk (NASDAQ: CYBR), is the global leader in Identity Security. Centered on privileged access management, CyberArk provides the most comprehensive security offering for any identity – human or machine – across business applications, distributed workforces, hybrid cloud workloads and throughout the DevOps lifecycle....
-
Data Scientist
2 days ago
Center District, Israel G-STAT Full time ₪90,000 - ₪120,000 per yearWe're Hiring: Data Scientist for a Leading Financial OrganizationWhat you'll do:* Develop Machine Learning and NLP models* Get hands-on experience with GenAI and LLMs• Deploy models into Production environments* Collaborate closely with Product Managers, Analysts, and other Data ScientistsWhat we're looking for:* Relevant academic degree* 1–2 years of...
-
Digital Solution Engineer
2 hours ago
Raanana, Center District, Israel Keysight Technologies Full time ₪120,000 - ₪240,000 per yearAbout the jobKeysight is looking for an experiencedDigital Application/Solution Engineerthat will work closely with Keysight's Field Sales team and customers to understand measurement challenges, and propose solutions based on Keysight products while demonstrating Keysight's capabilities for customers' success.Every day you will have the opportunity to...
-
Junior Electrical/Systems Engineer – Design
2 weeks ago
Karmiel, North District, Israel Enertec Medical Full time ₪104,000 - ₪130,878 per yearWe're looking for a motivatedentry-level engineerto join our team and work directly with the Principal Engineer on projects in thedefense and medical sectors.Most of our work involvesautomated test and calibration systems– technology that ensures reliability in defense equipment and maintains high standards in medical devices. This is a hands-on role,...
-
Generative AI Engineer
2 hours ago
Raanana, Center District, Israel Integress, Inc. Full time ₪120,000 - ₪180,000 per yearAbout The PositionAbout AllCloudAllCloud is a global professional services company that provides organizations with cloud enablement and transformation tools. As an AWS Premier Consulting Partner and audited MSP, a Salesforce Platinum Partner, and a Snowflake Premier Partner, AllCloud helps clients connect their front and back offices by building a new...
-
Data Engineer
4 days ago
Raanana, Center District, Israel AllCloud Full time ₪120,000 - ₪180,000 per yearAllCloud is a global professional services company that provides organizations with cloud enablement and transformation tools. As an AWS Premier Consulting Partner and audited MSP, a Salesforce Platinum Partner, and a Snowflake Premier Partner, AllCloud helps clients connect their front and back offices by building a new operating model to harness the...