Back to Jobs

[Remote] Staff AI/ML Infrastructure Engineer

Remote, USA Full-time Posted 2026-06-24

Note: The job is a remote job and is open to candidates in USA. Vultr is on a mission to make high-performance cloud infrastructure easy to use, affordable, and locally accessible for enterprises and AI innovators around the world. The Staff AI/ML Infrastructure Engineer will drive the design, performance, and reliability of the AI infrastructure platform, requiring deep GPU systems knowledge and strong automation experience.

Responsibilities

  • Design and maintain GPU and bare metal infrastructure in containerized and physical environments
  • Build scalable GPU clusters in partnership with networking and provisioning teams
  • Ensure reliable, high-performance provisioning of GPU infrastructure
  • Develop automated testing systems for GPU-based platforms
  • Implement infrastructure solutions for diverse AI/ML workloads
  • Benchmark, test, and troubleshoot GPU performance at scale
  • Collaborate with hardware vendors on drivers, firmware, and support
  • Resolve hardware, software, and performance issues across environments
  • Optimize rail and cluster performance across architectures
  • Lead technical direction and mentor engineers on infrastructure best practices

Skills

  • 5+ years experience working with bare metal infrastructure and hardware automation
  • Hands-on experience with modern NVIDIA/AMD GPU platforms and high-performance networking (RoCE, InfiniBand)
  • Deep knowledge of BIOS, BMC, firmware, NICs, Redfish/IPMI, and PCIe systems
  • Strong Linux systems experience including device drivers and package management
  • Experience building infrastructure automation using Python and Bash
  • Familiarity with GPU drivers, firmware ecosystems, and vendor collaboration
  • Experience designing and delivering complex infrastructure products
  • Proven ability to lead projects and mentor engineers
  • Experience optimizing multi-cluster GPU environments
  • Exposure to Machine Learning software stacks and GPU workloads

Benefits

  • 100% company-paid insurance premiums for employee medical, dental and vision plans.
  • 401(k) plan that matches 100% up to 4%, with immediate vesting
  • Professional Development Reimbursement of $2,500 each year
  • 11 Holidays + Paid Time Off Accrual + Rollover Plan
  • Commitment matters to Vultr! Increased PTO at 3 year and 10 year anniversary + 1 month paid sabbatical every 5 years + Anniversary Bonus each year
  • $500 stipend for remote office setup in first year + $400 each following year
  • Internet reimbursement up to $75 per month
  • Gym membership reimbursement up to $50 per month
  • Company paid Wellable subscription

Company Overview

  • Vultr is an AI cloud infrastructure platform offering latest generation NVIDIA GPUs and AMD CPUs and GPUs across 32 worldwide regions It was founded in 2014, and is headquartered in West Palm Beach, Florida, USA, with a workforce of 201-500 employees. Its website is https://www.vultr.com.
  • Company H1B Sponsorship

  • Vultr has a track record of offering H1B sponsorships, with 1 in 2024. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    Similar Jobs

    [Remote] Account Executive, MidMarket (LATAM)

    Remote, USA Full-time

    [Remote] EHV EPC Project Manager (Power Delivery)- Remote

    Remote, USA Full-time

    [Remote] Clinical Account Manager- Atlanta, GA area

    Remote, USA Full-time

    [Remote] Field Marketing Manager, East Coast

    Remote, USA Full-time

    [Remote] Business Development Manager, Fermentation

    Remote, USA Full-time

    [Remote] Software Engineer

    Remote, USA Full-time

    [Remote] Business Development Manager – Lender/Buyer Relationships (US)

    Remote, USA Full-time

    [Remote] Staff ML Application Engineer

    Remote, USA Full-time

    [Remote] Healthcare RCM Client Success Manager

    Remote, USA Full-time

    [Remote] Staff Software Engineer

    Remote, USA Full-time

    QA Lead

    Remote, USA Full-time

    Experienced Customer Service Representative – Tax Compliance and Support

    Remote, USA Full-time

    Experienced Full Stack Customer Support Specialist – Remote Apple Product Support

    Remote, USA Full-time

    Remote Live Chat Support Specialist – Real‑Time Customer Engagement for arenaflex Creators (No Experience Required)

    Remote, USA Full-time

    Tech Lead, Web Core Product & Chrome Extension - Melbourne, Australia

    Remote, USA Full-time

    Market Development Manager - Biopharma

    Remote, USA Full-time

    Market Intelligence Lead, Commercial Real Estate

    Remote, USA Full-time

    [Remote] Recruiter | Turn Your Experience Into a Business

    Remote, USA Full-time

    Experienced Full Stack Customer Success Account Manager – Cloud Services and Strategic Partnership Development

    Remote, USA Full-time

    Software Engineer, iOS Core Product - Munich, Germany

    Remote, USA Full-time