Job Search and Career Advice Platform

Enable job alerts via email!

Manager Site Reliability Engineering (GCP)

Veson Nautical LLC

Greater London

Hybrid

GBP 80,000 - GBP 110,000

Full time

Today
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A global software company is seeking a Manager for Site Reliability Engineering (GCP) in London. You will lead a talented team in managing and scaling Google Cloud infrastructure. The successful candidate will have extensive experience in a SaaS environment and strong leadership skills, with responsibilities including overseeing cloud operations and driving automation. Fluency in English and French is essential, reflecting the team's diverse culture and clients worldwide. This position offers a hybrid work model, emphasizing collaboration and innovation.

Qualifications

  • Experience leading a Site Reliability Engineering or Cloud Operations team.
  • Operational experience with Google Cloud Platform in a SaaS environment.
  • Familiarity with Kubernetes and Terraform for infrastructure management.

Responsibilities

  • Lead and mentor a team of Site Reliability Engineers.
  • Design and manage scalable cloud infrastructure on Google Cloud Platform.
  • Establish effective monitoring and alerting practices.

Skills

Leadership
Cloud infrastructure management
Containerization (Docker, Kubernetes)
Automation (Terraform, IaC)
Scripting (Python, Bash)
Bilingual English and French

Tools

Google Cloud Platform
Elasticsearch
Terraform
CI/CD (Gitlab Pipelines, ArgoCD)
Job description
Overview

Manager, Site Reliability Engineering (GCP)

Veson Nautical empowers the global maritime industry to navigate complexity on all sides of the trade. Veson's platform combines AI-driven workflows, trusted data, and seamless collaboration, to deliver the insight and context needed for confident, competitive decision-making.

The Opportunity

As the manager of the Site Reliability Engineering team for Google Cloud Platform (GCP) at Veson Nautical, you will be responsible for designing, building, monitoring and supporting the GCP infrastructure that underpins our rapidly growing SaaS platform (10Kubernetes clusters, over 1800 pods, more than 20 billion documents on Elasticsearch) and the services and products that depend upon it. This includes:

  • Oceanbolt – a dynamic data intelligence platform, tracking over 23,000 vessels in real time to deliver accurate, timely market intelligence to drive decision making
  • Shipfix – using proprietary AI-driven tools to infer cargo and vessel information, extracting, anonymizing, and aggregating billions of data points with near real-time processing of email exchanges in the shipping market

Our business and our platforms are experiencing rapid growth, which ensures we have no shortage of exciting and challenging projects to work on.

The Team

This is a hands-on technical leadership role where you'll manage a talented team of Site Reliability Engineers, while personally contributing to architecture and infrastructure initiatives. We are looking for a leader who can think systematically and manage complex systems at scale through automation. The successful candidate will be comfortable participating in architectural discussions with software engineers, and from a scalability perspective will ensure that the platform can double in size over the next 1-2 years. The team is committed to a DevOps culture, proactive monitoring and managing infrastructure at scale – they thrive on improving our cloud-native platforms and adopting new technologies.

This position will be in our London office in Southwark, in a hybrid model where we expect a minimum of two days/week attendance in person.

Our Stack
  • Google Cloud Platform – primarily PaaS services (Bigtable, Cloud SQL, Dataflow, Datastore, GKE, GCS, KMS, Pub/Sub)
  • Email – ingestion through Microsoft Graph API automation and IMAP integrations
  • ElasticSearch – hosted with Kubernetes Operator
  • CI/CD – Gitlab Pipelines and ArgoCD
  • Infrastructure-as-Code – Terraform, Terragrunt and Atlantis
  • Monitoring and Security – Cloud Armor Enterprise, Grafana / Grafana Tempo, OpenTelemetry, OpsGenie, Renovate, Sentry
  • AI Tools – Augment Code, GitHub Copilot, Claude
Key Responsibilities
  • Lead and mentor a team of Site Reliability Engineers, fostering a culture of excellence, collaboration, and continuous improvement
  • Design, implement, and manage scalable, reliable, and secure cloud infrastructure on Google Cloud Platform
  • Oversee the provisioning and management of containerized applications using Docker and Kubernetes
  • Drive automation initiatives for infrastructure provisioning and configuration management using Terraform and other IaC tools
  • Partner closely with development teams to ensure reliability, performance, and scalability of platforms
  • Establish and maintain comprehensive monitoring, alerting, and observability practices
  • Build processes and discipline to improve consistency, visibility, and documentation across infrastructure and operations
  • Lead incident response efforts and ensure service uptime
  • Develop automation, monitoring, and management solutions
  • Prepare infrastructure for integration and future growth
Skills/Experience Needed to Be Successful in This Role

Required:

  • Previous experience working on a large-scale Software-as-a-Service (SaaS) platform which supported thousands of global users in a 24x7x365 environment
  • Previous experience leading a Site Reliability Engineering / Cloud Ops / Platform / Infrastructure team
  • Operational experience with Google Cloud Platform, Kubernetes and Terraform
  • Programming or scripting experience in Python, bash, or a similar language
  • Experience with cloud cost management (budgeting, anomaly detection, cost analysis and reporting, etc)
  • Bilingual fluency in English and French is essential for this position

Highly Desirable:

  • Previous experience architecting large Google Cloud infrastructure
  • Previous experience deploying / operating / monitoring Elasticsearch clusters
  • Experience leading a geographically distributed team

Nice to have skills:

  • KEDA / ArgoCD
  • PostgreSQL and BigQuery database management experience

We are focused on building a diverse and inclusive workforce. If you’re excited about this role, but do not meet 100% of the qualifications listed above, we encourage you to apply. While we try to be thorough with our job descriptions, not everything about you as a candidate can be condensed into a list of bullet points.

More About Veson:

We are a team of multi-cultural, multi-disciplined professionals that are dedicated to making our clients successful and charting a new, innovative course for the commercial marine industry. Veson Nautical employs a staff of extremely capable creators and innovators all focused on meeting the goals of our clients. We invest extensively in employee development and experience to maintain focus and enthusiasm. The Veson Nautical team is made up of a dynamic blend of engineers, artists, sailors, teachers, brokers, bankers, traders, consultants, and customer service experts.

Veson Nautical is a successful, rapidly growing global software company. Our clients are the world’s leading commercial maritime owners, operators and commodity trading companies. Veson’s solutions enable our clients to identify new opportunities and proactively manage their business to make more profitable decisions. With offices in Singapore, Tokyo, London, Houston and headquarters in Boston, USA, Veson Nautical is a dynamic organization with a committed team of professionals. Dedicated to ensuring the highest levels of client satisfaction, Veson Nautical brings decades of experience, technical knowledge, enthusiasm and commitment to clients around the world. The combination of exceptional market growth and leading market position make this a superb opportunity for the right candidate.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Account Executive - Maritime, Freight, Banking, & Other

Veson Nautical LLC

Greater London
Hybrid
GBP 70,000 - 100,000
Full time
30+ days ago
Data Scientist

Veson Nautical LLC

Stoke-on-Trent
Hybrid
GBP 45,000 - 70,000
Full time
30+ days ago
Surveyor / Senior Surveyor

Lloyd's Register Applied Technology Group

Hull and East Yorkshire
Hybrid
GBP 30,000 - 50,000
Full time
30+ days ago
Senior Platform Engineer - Featurespace

Visaitalia

Cambridge
Hybrid
GBP 70,000 - 90,000
Full time
30+ days ago
Senior Software Engineer (Full-Stack), Strategy Platform

AnaVation LLC

City of London
Hybrid
GBP 60,000 - 80,000
Full time
30+ days ago
Job Vacancy: Software Engineer

Idwal Marine

Cardiff
On-site
GBP 45,000 - 70,000
Full time
30+ days ago
Operations Manager

Enshore Subsea Ltd

Blyth
On-site
GBP 60,000 - 80,000
Full time
30+ days ago
Principal Site Reliability Engineer

Dubizzle Limited

Greater London
Hybrid
GBP 70,000 - 90,000
Full time
30+ days ago
Systems Engineering Team Lead

SEA

Barnstaple
Hybrid
GBP 80,000 - 80,000
Full time
30+ days ago
Marine Engineer - Southampton

Svitzer Americas

Southampton
On-site
GBP 80,000 - 100,000
Full time
30+ days ago