Job Search and Career Advice Platform

Enable job alerts via email!

Lead Site Reliability Engineer

Methodfi

Greater London

Hybrid

GBP 70,000 - 90,000

Full time

Today
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading fintech company in Greater London is looking for an experienced SRE/DevOps Engineer to join its team. The role focuses on designing, implementing, and maintaining cloud infrastructure, particularly on Google Cloud Platform. You will need strong expertise in Kubernetes and Docker, as well as experience with Terraform for infrastructure as code. The company offers a hybrid working model, valuing diverse perspectives and fostering an inclusive culture. Join us to help reshape property investing and contribute to a more equitable future.

Qualifications

  • 5+ years in SRE, DevOps, or platform engineering roles with production-grade infrastructure experience.
  • Strong hands-on experience with Google Cloud Platform (GCP).
  • Expert-level knowledge of Kubernetes and Docker.
  • Proficiency in Terraform for infrastructure as code.
  • Experience with Cloudflare services, including DNS and CDN.
  • Experience implementing and managing observability stacks with Prometheus and Grafana.
  • Strong understanding of CI/CD principles.
  • Knowledge of cloud networking and security practices.
  • Solid scripting skills (Shell, Python, or similar).

Responsibilities

  • Design, implement, and maintain cloud infrastructure on GCP.
  • Own Kubernetes clusters and containerization strategy.
  • Build and evolve Infrastructure as Code using Terraform.
  • Manage Cloudflare infrastructure for performance optimization.
  • Deploy AI powered product features in secure serverless environments.
  • Implement monitoring and observability with Prometheus and Grafana.
  • Design and maintain CI/CD pipelines for fast, safe releases.
  • Ensure security best practices across infrastructure.
  • Work with teams to improve application reliability and performance.
  • Enable developer productivity through self-service tooling and automation.

Skills

SRE experience
Google Cloud Platform (GCP)
Kubernetes
Docker
Terraform
Cloudflare services
Prometheus
Grafana
CI/CD principles
Scripting skills
Job description

London, Waterloo (Hybrid, 4 days in-office - Wednesday is our set work from home day, though you can come in on Wednesday too if you wish)

We are disrupting one of the world's largest asset classes, property. With £2Bn+ assets on our platform and 30,000+ users across 70 countries, we're building the future of asset ownership and in doing so, are able to address wealth inequality.

Our product simplifies property investing from start to finish, making real estate investment accessible to everyone.

What you'll love doing
  • Working in cross-functional product teams, taking infrastructure and reliability initiatives from concept to production.

  • Navigating ambiguity in a fast-moving environment where ownership and freedom are core to how we operate.

  • Building and maintaining robust, scalable infrastructure across our GCP cloud environment. Working with Kubernetes, Terraform, Cloudflare, and modern observability tooling to ensure our platform runs smoothly.

  • Collaborating closely with engineering teams to design CI/CD pipelines, improve deployment practices, and champion reliability as a core engineering principle.

  • Helping to define SRE practices for a high-growth fintech platform. Mentoring other engineers as we scale our teams and impact.

What you'll be doing
  • Designing, implementing, and maintaining our cloud infrastructure on Google Cloud Platform (GCP), ensuring scalability, reliability, and security.

  • Owning our Kubernetes clusters and containerization strategy - from Docker image optimization to cluster management and deployment orchestration.

  • Building and evolving our Infrastructure as Code using Terraform, creating modular, testable, well-documented configurations that scale with our rapid growth.

  • Managing and optimizing our Cloudflare infrastructure, including Workers for edge computing, DNS, CDN, security policies, and performance optimization.

  • Deploy AI powered product features in isolated and secure serverless environments.

  • Implementing comprehensive monitoring and observability using Prometheus and Grafana, defining SLIs/SLOs, and proactively identifying issues before they impact users.

  • Designing and maintaining CI/CD pipelines with appropriate quality gates, testing strategies, and deployment techniques (blue-green, canary) to enable fast, safe releases.

  • Ensuring security best practices across our infrastructure - from network design and access controls to secrets management and vulnerability scanning.

  • Working with engineering teams to improve application reliability, performance, and observability through instrumentation and architectural guidance.

  • Enabling developer productivity through self-service tooling, clear documentation, and automation of operational tasks.>

What we're looking for

Essential

  • 5+ years in SRE, DevOps, or platform engineering roles with production-grade infrastructure experience

  • Strong hands-on experience with Google Cloud Platform (GCP)

  • Expert-level knowledge of Kubernetes and Docker - you've deployed, managed, and troubleshot production clusters

  • Proficiency in Terraform for infrastructure as code

  • Experience with Cloudflare services, including Workers, DNS, CDN, and security features

  • Experience implementing and managing observability stacks with Prometheus and Grafana

  • Strong understanding of CI/CD principles, pipeline design, and deployment strategies

  • Experience with cloud networking, security groups, VPCs, and network peering

  • Solid scripting skills (Shell, Python, or similar)

Desirable

  • Experience with blue-green or canary deployment techniques

  • Familiarity with programming languages like Go or TypeScript

  • Background in implementing security automation and quality gates

  • Experience with configuration management tools

  • Understanding of SRE principles: SLIs, SLOs, error budgets, and blameless postmortems

  • Experience with edge computing and serverless architectures

  • Track record of mentoring engineers and fostering a culture of reliability

What we are building

The first end-to-end real estate investment offering - making the dream of owning real estate more accessible to everyone globally.

Diversity & inclusion at GetGround

We encourage applications from all sections of society and we believe in the criticality of an inclusive culture. We are committed to equal employment opportunity regardless of race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity or any other basis as protected by law.

  • 42% of our employees identify as female or non-specified, 58% as male

  • 22 nationalities represented across offices in 5 countries

  • Design Accessibility

  • Inclusion is at the heart of our culture - we celebrate and reflect on key D&I and cultural events such as: Black History Month, International Women's Day and Pride

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.