Job Search and Career Advice Platform

Enable job alerts via email!

Site Reliability Engineer UK

AnaVation LLC

Remote

GBP 70,000 - GBP 90,000

Full time

Today
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A technology company in the United Kingdom is seeking a highly skilled Site Reliability Engineer (SRE) to lead the transition from legacy systems to modern, scalable microservices. You will work closely with various teams to ensure uptime and reliability, emphasizing observability and automation strategies. Ideal candidates have 5+ years of experience in SRE, strong cloud platform skills (especially Azure), and a passion for cross-functional collaboration to enhance software delivery and system resilience.

Qualifications

  • 5+ years in SRE, DevOps, or Production Engineering roles.
  • Deep experience with cloud platforms (preferably Azure or AWS).
  • Hands-on experience with Azure DevOps is strongly preferred.
  • Proficiency with observability tools such as New Relic, Datadog.
  • Strong understanding of software deployment strategies.

Responsibilities

  • Collaborate with architects and engineers to build resilient systems.
  • Lead the charge on full-stack observability using modern APM tooling.
  • Develop and implement auto-scaling strategies and load testing plans.
  • Implement deployment strategies such as canary releases.
  • Create incident response playbooks and refine processes.

Skills

SRE, DevOps, or Production Engineering
Cloud platforms (Azure or AWS)
Infrastructure-as-Code (Terraform)
Azure DevOps
Observability tools (New Relic, Datadog)
Scripting languages (Python, PowerShell)
Multi-tenant environments
Cross-functional communication
Job description

About StarCompliance

StarCompliance is on a mission to make compliance simpleandeasy. Trusted globally by enterprise financial institutions, the user-friendly STAR platform empowers organizations to achieve regulatory compliance while safeguarding their integrityandbusiness reputations. Through a customizable, 360-degree view of employee activity, the STAR software enables firms to automate the detectionand resolution of potential areas of conflict while streamlining daily workflowsandincreasing efficiency.

Location: Candidates MUST be UK based and have right to work.

We are seeking a highly skilled and pragmatic Site Reliability Engineer (SRE) to help lead our evolution from legacy single-tenant monoliths to modern, scalable, multi-tenant microservices. This is a pivotal role for our business, enabling faster delivery, improved reliability, and real scalability across our SaaS portfolio.

While we’ve got a solid handle on infrastructure monitoring, we’re still in the early innings when it comes to application-level observability, autoscaling, and progressive delivery strategies (e.g., canary releases, blue/green deployments). That’s where you come in.

You’ll work closely with Infrastructure, Architecture, Engineering, and Support teams to design, build, and evangelize the next generation of SRE practices and tools that ensure uptime, resiliency, and customer trust.

Responsibilities
  • Champion Reliability by Design: Collaborate with architects and engineers to build resilient, fault-tolerant systems across our evolving cloud-native stack.
  • Observability Overhaul: Lead the charge on full-stack observability, leveraging modern APM tooling, meaningful SLOs/SLIs, and actionable alerts.
  • Scaling Systems: Develop and implement auto-scaling strategies, load testing plans, and capacity forecasting for multi-tenant environments.
  • Progressive Delivery: Help implement and automate deployment strategies such as canary releases, feature flags, and blue/green rollouts.
  • Incident Response: Create and refine on-call processes, incident response playbooks, and blameless post-mortem routines.
  • Monitoring & Tooling: Own and evolve our monitoring infrastructure, integrating metrics, logs, and traces into a cohesive ecosystem.
  • Developer Empowerment: Build reusable templates, dashboards, and platform tooling to empower dev teams to “shift left” on reliability.
  • Cross-functional Collaboration: Work hand-in-hand with Infrastructure, Architecture, Support, and Engineering teams to drive shared accountability for uptime and performance.
Skills
  • 5+ years in SRE, DevOps, or Production Engineering roles, ideally within a SaaS or cloud-native environment.
  • Deep experience with cloud platforms (preferably Azure or AWS), and Infrastructure-as-Code tools (e.g. Terraform).
  • Hands-on experience with Azure DevOps is strongly preferred, as our CI/CD and project workflows are fully built around it.
  • Proficiency with observability tools such as New Relic, Datadog, Prometheus, or similar.
  • Strong understanding of software deployment strategies, CI/CD pipelines, and release engineering.
  • Ability to code in at least one modern scripting or systems language (e.g., Python,PowerShell, Go, Bash).
  • Experience operating multi-tenant environments with an emphasis on security, performance, and cost optimization.
  • Excellent communicator who thrives in cross-functional settings and can influence engineering culture around reliability.
Desirable Skills
  • Experience in regulated industries (e.g., financial services, healthcare).
  • Background with service mesh architectures, distributed tracing, and gRPC/GraphQL.
  • Familiarity with incident management platforms (e.g., PagerDuty, OpsGenie).
  • Contributions to open-source SRE tooling or frameworks.
StarCompliance Background Checks

All positions require pre-employment screening due to employees potentially having access to highly sensitive and confidential information involving finance and compliance; candidates must be trustworthy and have a heightened sensitivity to protecting confidential financial, professional information. To be eligible for employment with StarCompliance, candidates must undergo a rigorous background investigation with checks including, but not limited to, criminal record history, consumer credit, employment history, qualifications, and education checks.

Equal Opportunity Employer Statement

We prohibit discrimination and harassment of any kind based on race, sex, religion, sexual orientation, national origin, disability, genetic information, pregnancy, gender identity or expression, marital/civil union/domestic partnership status, veteran status or any other protected characteristic as outlined by country, state, or local laws.

This policy applies to all employment practices within our organisation, including hiring, recruiting, promotion, termination, layoff, recall, leave of absence, compensation, benefits, training, and apprenticeship. StarCompliance makes hiring decisions based solely on qualifications, merit, and business needs at the time. For more information, please request a copy of our Equal Opportunities Policy.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Data Engineer

AnaVation LLC

United Kingdom
Remote
GBP 60,000 - 80,000
Full time
30+ days ago
.Net Software Engineer

AnaVation LLC

York and North Yorkshire
On-site
GBP 40,000 - 60,000
Full time
30+ days ago
Relationship Manager - UK

StarCompliance, Inc.

United Kingdom
Remote
GBP 40,000 - 60,000
Full time
30+ days ago
Relationship Manager - UK

AnaVation LLC

United Kingdom
Remote
GBP 45,000 - 60,000
Full time
30+ days ago
Site Reliability Engineering (SRE) Manager

SS&C

London
Hybrid
GBP 80,000 - 100,000
Full time
30+ days ago
Software Engineer

StarCompliance, Inc.

York and North Yorkshire
On-site
GBP 30,000 - 40,000
Full time
30+ days ago
Site Reliability Engineer

Xceptor

Greater London
On-site
GBP 55,000 - 75,000
Full time
30+ days ago
Sales Engineer

AnaVation LLC

London
Remote
GBP 50,000 - 70,000
Full time
30+ days ago
Senior Software Engineer

AnaVation LLC

York and North Yorkshire
On-site
GBP 45,000 - 60,000
Full time
30+ days ago
Systems Reliability Engineer (SRE), Edge

CloudFlare

City of London
Hybrid
GBP 70,000 - 90,000
Full time
30+ days ago