Job Search and Career Advice Platform

Enable job alerts via email!

Senior SRE - AI Infra, UK Visa Sponsorship, London

NewsNowGh

Cambridge

On-site

GBP 80,000 - GBP 100,000

Full time

24 days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A pioneering AI company is seeking a highly experienced Site Reliability Engineer to enhance the reliability and scalability of its AI platform. This role, based in England, offers visa sponsorship for international professionals. You will design and maintain resilient infrastructure, collaborate with software engineers, and drive improvements in monitoring and operations. Ideal candidates have a Master's degree and 7+ years in SRE or DevOps, with strong skills in cloud platforms and infrastructure tools. This is an excellent opportunity to work in a leading AI firm with global impact.

Qualifications

  • 7+ years of experience in SRE, DevOps, or similar roles in distributed systems environments.
  • Hands-on experience with Docker, Kubernetes, CI/CD pipelines, and infrastructure-as-code tools.
  • Solid knowledge of observability stacks, networking, security, and system administration.

Responsibilities

  • Design, build, and maintain scalable, highly available, and fault-tolerant infrastructure.
  • Ensure high availability of inference and training environments across HPC clusters.
  • Implement and improve monitoring, alerting, logging, and incident management systems.
  • Drive infrastructure-as-code, deployment, and orchestration.
  • Work with security teams to ensure compliance with best practices.

Skills

Cloud platforms
Reliability engineering practices
Docker
Kubernetes
CI/CD pipelines
Scripting or programming (Python, Go, Bash)
Observability stacks
Networking
System administration

Education

Master’s degree in Computer Science, Engineering, or a related field

Tools

Terraform
Job description
A pioneering AI company is seeking a highly experienced Site Reliability Engineer to enhance the reliability and scalability of its AI platform. This role, based in England, offers visa sponsorship for international professionals. You will design and maintain resilient infrastructure, collaborate with software engineers, and drive improvements in monitoring and operations. Ideal candidates have a Master's degree and 7+ years in SRE or DevOps, with strong skills in cloud platforms and infrastructure tools. This is an excellent opportunity to work in a leading AI firm with global impact.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Site Reliability Engineer Job in UK 2026 with Visa Sponsorship | Mistral AI

NewsNowGh

Cambridge
On-site
GBP 80,000 - 100,000
Full time
30+ days ago
Junior Software Engineer - AI Platform, SRE Inspired

Black6

City of London
On-site
GBP 30,000 - 45,000
Full time
30+ days ago
Senior SRE: Cloud, AI Ops & Reliable Services

Fitch Group, Inc., Fitch Ratings, Inc., Fitch Solutions Group

Greater London
On-site
GBP 80,000 - 95,000
Full time
30+ days ago
Senior SRE - Remote, Scale & Real-Time Performance

Hacker Typer

City of London
Remote
GBP 70,000 - 90,000
Full time
30+ days ago
AI-First SRE: Reliability Engineer for Scalable Cloud

Xceptor

Greater London
On-site
GBP 55,000 - 75,000
Full time
30+ days ago
Global Head of SRE & AI-Driven Cloud Reliability

Barracuda Networks, Inc.

United Kingdom
Hybrid
GBP 90,000 - 120,000
Full time
30+ days ago
Senior Infrastructure & SRE Engineer — Remote & Equity

Mesh-AI Limited

City of London
Hybrid
GBP 80,000 - 100,000
Full time
30+ days ago
Senior SRE - Customer-Facing Infra & Open Source Go

Methodfi

Greater London
On-site
GBP 60,000 - 80,000
Full time
30+ days ago
Senior SRE, Data Platform — Remote-First

Wikimedia Foundation, Inc.

Greater London
Remote
GBP 113,000 - 176,000
Full time
30+ days ago
Senior AI-Driven Cloud SRE

Fitch Group, Inc., Fitch Ratings, Inc., Fitch Solutions Group

Manchester
On-site
GBP 60,000 - 80,000
Full time
30+ days ago