Job Search and Career Advice Platform

Enable job alerts via email!

Senior AI Reliability Engineer – Scale-Driven Systems

Applied Intuition Inc.

Greater London

Hybrid

GBP 70,000 - GBP 90,000

Full time

Today
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading AI company in London is looking for a Senior Software Engineer focused on AI reliability engineering. This role involves developing Service Level Objectives and monitoring systems for large language model serving. The ideal candidate will have extensive experience with distributed systems and AI infrastructure challenges. Candidates should hold at least a Bachelor’s degree and possess strong communication skills. This position may require a hybrid work model, offering competitive compensation and benefits.

Benefits

Competitive compensation
Flexible working hours
Generous vacation and parental leave

Qualifications

  • Extensive experience with distributed systems observability and monitoring at scale.
  • Understand operating AI infrastructure including model serving.
  • Experience with chaos engineering and resilience testing.

Responsibilities

  • Develop Service Level Objectives for AI systems.
  • Design monitoring systems for availability and metrics.
  • Lead incident response for critical AI services.

Skills

Distributed systems observability
AI infrastructure operation
SLO/SLA frameworks implementation
Chaos engineering
Excellent communication skills

Education

Bachelor’s degree in a related field

Tools

AI-specific observability tools
ML hardware accelerators
Job description
A leading AI company in London is looking for a Senior Software Engineer focused on AI reliability engineering. This role involves developing Service Level Objectives and monitoring systems for large language model serving. The ideal candidate will have extensive experience with distributed systems and AI infrastructure challenges. Candidates should hold at least a Bachelor’s degree and possess strong communication skills. This position may require a hybrid work model, offering competitive compensation and benefits.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior AI Platform Engineer — Scale AI Infrastructure

Plain

City of London
Remote
GBP 100,000 - 125,000
Full time
30+ days ago
Senior AI Software Engineer — Cloud-Native & Secure APIs

AVEVA Group Limited

Cambridge
Hybrid
GBP 70,000 - 90,000
Full time
30+ days ago
Senior ML Engineer — Large-Scale AI Systems

Applied Intuition Inc.

Greater London
On-site
GBP 80,000 - 100,000
Full time
30+ days ago
Senior Backend Engineer - AI Platform & Scalable Systems

Methodfi

City of London
Hybrid
GBP 70,000 - 90,000
Full time
30+ days ago
Senior AI Data Infrastructure Engineer – Remote

Voice AI Space

Greater London
Hybrid
GBP 117,000 - 154,000
Full time
30+ days ago
Senior SRE - AI Infra, UK Visa Sponsorship, London

NewsNowGh

Cambridge
On-site
GBP 80,000 - 100,000
Full time
30+ days ago
Senior AI Deployment Architect: Scale AI Solutions

Celonis GmbH

City of London
On-site
GBP 70,000 - 90,000
Full time
30+ days ago
Senior Software Engineer & Team Lead — Scale an AI Platform

Methodfi

Greater London
On-site
GBP 80,000 - 100,000
Full time
30+ days ago
Senior Applied AI Engineer - Production-Scale AI Systems

ModelML Inc.

City of London
On-site
GBP 70,000 - 90,000
Full time
30+ days ago
Senior AI Engineer - Customer-Facing Feature Lead

Methodfi

Greater London
Hybrid
GBP 70,000 - 100,000
Full time
30+ days ago