Back to Jobs

HPC Engineer - Research Infrastructure @ Luma AI

Remote, USA Full-time Posted 2026-06-21

Help Luma build some of the biggest & fastest AI supercomputing clusters in the world! As a High-Performance Computing engineer, you’ll work at the intersection of hardware and software, designing systems that deliver the maximum possible performance for running large-scale AI models. We work at the very cutting edge of speed and scale, combining the traditions of High-Performance Computing (HPC) in a modern cloud environment. For this role, it’s important you understand how to combine CPU’s, GPU’s, and network devices into systems that are then deployed at a large scale to peak efficiency. You understand the lowest levels of the software platforms that sit on top of this hardware, including how to best optimize the Linux kernel and user-space code. You are capable of writing code to automate the monitoring and healing of these systems, commanding a large number of servers with few people.ResponsibilitiesIn this role, you will work closely with and directly accelerate machine learning researchers, but don't need to be a machine learning expert yourself. We value people who can quickly obtain a deep technical understanding of new domains and enjoy being self-directed and identifying the most important problems to solve. You’ll be managing training HPC clusters at Luma from provisioning to performance tuning.Areas of work will include observability, distributed job tracing, GPU diagnostics, software environment management and additional tooling plus work on the actual code to enable necessary features.We believe that increasing compute is a huge lever to AI progress. You will have a direct impact on our ability to grow to an unprecedented scale and likewise produce unprecedented results.Experience8+ years experience as infrastructure engineer or Devops in large and complex distributed systems.Deep understanding of networking, bonus points for experience in HPC networking.Experience developing high-quality software in a general-purpose programming language, preferably including Python.Excellent problem-solving skills and… Apply To This Job

Similar Jobs

Staff Embedded Software Engineer (S) @ Innatera

Remote, USA Full-time

Software Engineer Sr Staff – Test Architect @ Hewlett Packard Enterprise

Remote, USA Full-time

Sr Software Engineer (SAP/Finance) @ Consumers Energy

Remote, USA Full-time

Model Validation 2nd Line of Defense Lead Analyst @ Citi

Remote, USA Full-time

Automation Process Analyst @ Hiscox

Remote, USA Full-time

Lead Security Engineer - Application Security @ Dream Sports

Remote, USA Full-time

Software Security Engineer (Intermediate) @ Takealot Group

Remote, USA Full-time

Payroll Administrator

Remote, USA Full-time

Senior Financial Analyst

Remote, USA Full-time

Senior DevOps Engineer

Remote, USA Full-time

Arenaflex Remote Data Entry Operator - Flexible Work-from-Home Position | No Experience Required | Training Provided

Remote, USA Full-time

Experienced Remote Data Entry Specialist – Customer Service and Administrative Support at Blithequark - $30 per Hour

Remote, USA Full-time

Healthcare Customer Service Representative

Remote, USA Full-time

Experienced Full Stack Data Entry Specialist – Remote Database Management

Remote, USA Full-time

Experienced Data Entry Specialist – Remote Opportunity at arenaflex

Remote, USA Full-time

Remote Appointment Specialist

Remote, USA Full-time

Specialist - Advisor - Fixed Term: Academic Advisor Position in Remote Location with Michigan State University

Remote, USA Full-time

Administrative Assistant/Receptionist

Remote, USA Full-time

Experienced Social Media Chat Support Specialist – Remote Customer Service Representative

Remote, USA Full-time

American Express At Home Customer Service Representative - Remote Work Opportunity with a Global Leader in Financial Services

Remote, USA Full-time