Staff Network Automation Engineer
Glocomms is partnered with a British technology company pioneering the future of high-performance computing, AI training and inference, and supercomputing through proprietary GPU cluster infrastructure. As they are expanding their AI labs and data center operations across the U.S., they're seeking a Staff Network Automation Engineer to lead the design, automation, and lifecycle management of their modular network infrastructure. This role is ideal for a seasoned engineer passionate about infrastructure-as-code, scalable automation, and cutting-edge networking technologies.
Responsibilities
- Design and implement network automation workflows using Python, Ansible, Terraform, and Salt.
- Develop infrastructure-as-code solutions for scalable and repeatable deployments.
- Build and maintain configuration management systems and software push platforms.
- Lead zero-touch provisioning (ZTP) and lifecycle controller implementations.
- Architect modular network infrastructure for high-performance GPU clusters and AI labs.
- Integrate network automation with container orchestration platforms like Kubernetes.
- Implement monitoring and remediation systems using Prometheus and Grafana.
- Maintain source-of-truth and simulation environments using NetBox and Containerlab.
- Collaborate with cross-functional teams to support AI training, inference, and supercomputing workloads.
- Ensure operational excellence and reliability across network infrastructure.
Qualifications
- Strong proficiency in Python and automation tools (Ansible, Terraform, Salt, Puppet).
- Experience with network infrastructure tools (NetBox, Containerlab).
- Deep understanding of networking protocols: TCP/IP, IPv4/IPv6, BGP, MPLS, DHCP, DNS.
- Expertise in overlay technologies: EVPN/VXLAN, Geneve, BGP+EVPN.
- Familiarity with networking platforms: Mellanox, Arista, Cumulus, SoNIC.
- Experience with Kubernetes and CNIs (Calico, Cilium); cloud-native and container technologies.
- Proven track record in infrastructure-as-code and configuration management.
- Strong communication, collaboration, and ownership mindset.
- Commitment to operational excellence and customer-first thinking.
- Growth mindset and ability to mentor and lead technical initiatives.
This will be a remote flexible role, ideal candidates will be local to Austin TX, New York NY, Seattle WA, or San Francisco CA. The client will not be able to sponsor now or in the future. If you or something you know is interested, please apply in directly!