Job Search and Career Advice Platform

Enable job alerts via email!

Senior HPC & DevOps Engineer

Advanced Micro Devices, Inc.

Remote

GBP 65,000 - 85,000

Full time

30+ days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading technology company in the UK is seeking a Senior HPC & DevOps Engineer to manage high-performance computing clusters and modern DevOps infrastructure. Responsibilities include deploying and maintaining HPC clusters using Slurm, designing CI/CD pipelines, and automating infrastructure provisioning with tools like Ansible and Terraform. Ideal candidates possess strong skills in Kubernetes, CI/CD, and scripting. This role offers a dynamic environment with a focus on innovation.

Qualifications

  • 5+ years of experience in HPC and DevOps.
  • Strong understanding of infrastructure automation frameworks.
  • Proficiency in scripting with Python or Bash.

Responsibilities

  • Deploy and maintain high-performance computing clusters using Slurm.
  • Automate infrastructure provisioning with Ansible and Terraform.
  • Design CI/CD pipelines with tools like Jenkins and GitHub Actions.

Skills

Slurm management
GPU compute environments
CI/CD pipelines
Kubernetes orchestration
Python scripting
Ansible
Docker

Education

Bachelor's or Master's degree in computer/software engineering or related

Tools

Grafana
Prometheus
Checkmk
Terraform
Job description

WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world’s most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.

AMD together we advance_

THE ROLE:

We are seeking a highly skilled Senior HPC & DevOps Engineer with experience in managing both high-performance computing clusters and modern DevOps infrastructure. The ideal candidate combines expertise in Slurm-managed HPC clusters, GPU compute environments, CI/CD pipelines, and Kubernetes-based orchestration. This person thrives in collaborative, fast-paced environments, drives technical execution with minimal oversight, and has a passion for building reliable, scalable, and high-performance systems.

THE PERSON:

The ideal candidate is a skilled engineer with a strong background in DevOps, site reliability, or infrastructure engineering. They are proficient in Kubernetes, CI/CD tools, scripting (Python/Bash), and infrastructure automation frameworks such as Ansible. Experience working with GPU compute environments and integrating automated test workflows is highly valued. This person thrives in collaborative, fast-paced environments and can drive technical execution with minimal oversight. They bring a problem-solving mindset, strong communication skills, and a passion for building reliable, scalable systems.

KEY RESPONSIBILITIES:
  • Deploy, configure, and maintain HPC clusters using Slurm.
  • Manage GPU compute nodes, high-speed interconnects, and parallel storage systems.
  • Design and maintain CI/CD pipelines using Buildkite, GitHub Actions, Jenkins.
  • Automate infrastructure provisioning and configuration with Ansible, Terraform, Python, Bash.
  • Deploy containerized applications using Docker, Kubernetes, Helm.
  • Monitor cluster health and performance; build dashboards with Grafana, Prometheus, Checkmk.
  • Collaborate across teams to optimize workflows, troubleshoot issues, and document best practices.
PREFERRED EXPERIENCE:
  • Strong experience with Slurm or equivalent HPC schedulers.
  • CI/CD, DevOps tools, and automation expertise.
  • GPU compute and lifecycle management (CUDA/ROCm).
  • Linux administration, shell scripting, and distributed systems troubleshooting.
  • Containerization and orchestration (Docker, Kubernetes, Helm).
  • Agile, collaborative mindset with strong communication skills.
ACADEMIC CREDENTIALS:
  • Bachelor's or Master's degree in computer/software engineering, Computer Science, or related technical discipline.

Benefits offered are described: AMD benefits at a glance.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.