Job Search and Career Advice Platform

Enable job alerts via email!

Platform Engineer

Catches Limited

Remote

GBP 60,000 - 75,000

Full time

25 days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A forward-thinking tech company in the United Kingdom seeks a Platform Engineer to manage high-performance computing infrastructure. The successful candidate will automate GPU cluster provisioning and maintain Linux environments. This full-time, remote role offers a high-trust environment with emphasis on innovation. Join a team where your influence will shape the product and engineering culture.

Benefits

Remote-first working environment
Co-working allowances
Innovative product influence

Qualifications

  • Strong background in Linux Systems Administration.
  • Experience managing Bare Metal servers (on-premise or packet/equinix metal).
  • Proficiency in Infrastructure as Code (IaC) tools.

Responsibilities

  • Automate the provisioning and lifecycle of high-performance GPU clusters using Terraform and Ansible.
  • Maintain the stability and performance of large-scale Linux environments supporting AI/ML training workloads.
  • Collaborate with vendors and internal teams to troubleshoot hardware and networking bottlenecks.

Skills

Linux Systems Administration
Infrastructure as Code (IaC)
Terraform
Ansible
GPU Management

Tools

Terraform
Ansible
Kubernetes
Docker
Prometheus
Grafana
Job description

Direct message the job poster from CATCHES

Backed by some of the most influential names in luxury fashion globally. We blend advanced 3D rendering, AI and VFX techniques to deliver unparalleled shopping experiences for luxury fashion.

Role

We are hiring a Platform Engineer to manage and optimise our next-generation high-performance computing infrastructure. Move beyond standard cloud instances and manage the raw power of bare metal GPU clusters.

Responsibilities
  • Automate the provisioning and lifecycle of high-performance GPU clusters using Terraform and Ansible.
  • Maintain the stability and performance of large-scale Linux environments supporting AI/ML training workloads.
  • Collaborate with vendors and internal teams to troubleshoot hardware and networking bottlenecks (latency, throughput).
  • Implement monitoring solutions (Prometheus/Grafana) to visualise GPU health and cluster efficiency.
  • Assist in optimising the stack for containerised workloads (Kubernetes/Docker).
Requirements
  • Strong background in Linux Systems Administration.
  • Experience managing Bare Metal servers (on-premise or packet/equinix metal).
  • Proficiency in Infrastructure as Code (IaC) tools.
  • Nice to have: Exposure to GPUs, InfiniBand, or high-throughput networking (we will train the right candidate).
What working with CATCHES is like
  • Fully remote-first, async-friendly, with optional co-working allowances.
  • High-trust, low-bureaucracy environment that values experimentation and shipping.
  • Early influence on product, architecture and engineering culture.
  • Cutting-edge tech, luxury-fashion creativity, and games-industry scale challenges combined.
Seniority level
  • Mid-Senior level
Employment type
  • Full-time
Job function
  • Information Technology
  • Industries: Technology, Information and Internet and Retail Apparel and Fashion

Referrals increase your chances of interviewing at CATCHES by 2x

Get notified about new Platform Engineer jobs in United Kingdom.

London, England, United Kingdom

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.