Principal Infrastructure Engineer (Compute)
Glocomms is partnered with a technology company pioneering the future of high-performance computing, AI training and inference, and supercomputing through proprietary GPU cluster infrastructure. As they are expanding their AI labs and data center operations across the U.S., they're seeking a Principal Infrastructure Engineer to lead the design, automation, and lifecycle management of their compute infrastructure powering GPU clusters.
Responsibilities
- Architect and deploy compute infrastructure involving GPUs and custom silicon across server and rack environments.
- Diagnose and resolve intricate issues related to high-performance computing systems.
- Build and support services for managing hardware and firmware configurations.
- Streamline server operations through full lifecycle automation.
- Oversee the entire compute hardware lifecycle, including coordination with vendors for replacements and repairs.
- Act as the primary resource for resolving hardware-related escalations.
- Track and enhance system performance, proactively addressing inefficiencies.
- Automate provisioning and administrative workflows to boost operational productivity.
- Work closely with networking and storage teams to ensure integrated infrastructure performance.
Qualifications
- 5+ years of hands-on experience in engineering compute platforms.
- Deep expertise in Linux system internals and performance optimization.
- Skilled in bare-metal deployment frameworks such as MaaS, Metal3, or Tinkerbell.
- Strong understanding of GPU architecture and workload tuning, particularly at the kernel and driver level.
- Proficient in infrastructure-as-code and automation tools like Terraform and Ansible.
- Experience managing container orchestration using Kubernetes and SLURM.
This will be a remote flexible role, ideal candidates will be local to Austin, TX, New York, NY, Seattle, WA, or San Francisco, CA. The client will not be able to sponsor now or in the future. If you or someone you know is interested, please apply in directly!
FAQs
Congratulations, we understand that taking the time to apply is a big step. When you apply, your details go directly to the consultant who is sourcing talent. Due to demand, we may not get back to all applicants that have applied. However, we always keep your CV and details on file so when we see similar roles or see skillsets that drive growth in organisations, we will always reach out to discuss opportunities.
Yes. Even if this role isn’t a perfect match, applying allows us to understand your expertise and ambitions, ensuring you're on our radar for the right opportunity when it arises.
We also work in several ways, firstly we advertise our roles available on our site, however, often due to confidentiality we may not post all. We also work with clients who are more focused on skills and understanding what is required to future-proof their business.
That's why we recommend registering your CV so you can be considered for roles that have yet to be created.
Yes, we help with CV and interview preparation. From customised support on how to optimise your CV to interview preparation and compensation negotiations, we advocate for you throughout your next career move.