Job Search and Career Advice Platform

Enable job alerts via email!

AI Supercomputing Infrastructure Engineer

University of Bristol

Bristol

Hybrid

GBP 40,000 - GBP 60,000

Full time

Today
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading university in the UK is seeking an AI Supercomputing Infrastructure Engineer to join their Bristol Centre for Supercomputing. This role will involve designing and operating large-scale supercomputing services and working closely with researchers. Candidates should have expertise in areas like NetOps or DevOps and a degree in a relevant field. The position supports hybrid working and offers an open-ended contract until August 2030, promoting a diverse working environment.

Qualifications

  • Degree (or equivalent practical experience) in computer science or ML/AI research.
  • Good organizational skills to manage workloads of team members.

Responsibilities

  • Design and operate large, highly available supercomputing services.
  • Work closely with researchers to co-design solutions for research problems.

Skills

Domain expertise in NetOps
Domain expertise in DevOps
Organizational skills

Education

Degree in computer science or equivalent

Tools

Kubernetes
Terraform/OpenTofu
Job description
AI Supercomputing Infrastructure Engineer (up to x2 FTE)

The Bristol Centre for Supercomputing (BriCS) runs the Isambard-AI National Artificial Intelligence Research Resource,recently announced AI Data facilityand the Isambard3 Tier-2 Supercomputer. Isambard-AI is the most powerful supercomputer in the UK and amongst the most powerful in Europe.

The AI Supercomputing team owns the entire process of developing andoperatingthecentre’scomputeand software infrastructure, which includes:

  • The sourcing of hardware and system design.
  • The deployment of huge software-defined infrastructure using tools such as Kubernetes andTerraform/OpenTofu.
  • Building and operating platforms to enable researchers to conduct leading-edge research usingthe systems.
  • Optimisingand refining software to ensure environmental and economic efficient use.
As one of the largest Open AI Research Resources internationally, we are committed tocatalysingan AI transformation in the research and development community.

In this role, you will work as part of the AI Supercomputing Team to build andoperateprimarily the infrastructure and compute platforms that researchers use for their work. You do not need to be anAIor computational research domain expert to deliver world-class infrastructure, but you do need to quickly obtain a deep technical understanding of new domains. You should enjoy being self-directed andidentifyingthe most important problems to solve as the team matures with standardized tools and processes around stability, robust servicedeliveryand scaling.

This role supports hybrid working arrangements to provide flexibility. While the Centre's operational needs require successful candidates to start as soon as possible, we are able to accommodate individual notice periods to attract top talent.

What will you be doing?

As a member of the AI Supercomputing Team, youwill;

  • Design andoperatelarge,highly availablesupercomputing services managed as software-definedinfrastructures, andintegrated as complete computational experiments.
  • You will experience designing andoperatingmassive-scale GPU and combined CPU/GPU workloads across these services.
  • You will design and debugplatforms, andwill work closely with researchers as you co-design solutions that will enable the development and operation of new algorithms and software to solve leading-edge research problems.
You should apply if
  • Want to help build,maintainand securesome of the largest, modern software-defined supercomputing systems.
  • Would enjoy working with world class domain and AI researchers as your primary workload.
  • Have built small to large clusters or dabbled in building your own physical or software-definedsystems, andhave motivationto scale up to something massive and nationally impactful.
  • Love building large distributed,highly availablesystems, and want to see them used for truly open national-scale researchin a cybersecurity compliant manner.
For our AI Supercomputing Infrastructure Engineer rolefocusing onstorage and networking,you’llneed:
  • Domainexpertisein 1 or more areas from NetOpsandDevOps.
  • Degree (or equivalent practical experience) in computer science, computational or ML/AI research or ina naturalscience with a high degree of competence in computer science or computational research.
  • Goodorganisationalskills to manage not just your own workload but alsothatof less experienced members of the team.
The available job descriptionprovides a full view of thepersonspecification.
Additional information

For any informal enquiries, please contact Emma Rose, Centre Manager - emma.rose@bristol.ac.uk.

Contract type: Open ended with fixed funding until August 2030.

Work pattern: Monday - Friday, 35 hours per week.

Grade: K

School/Unit: Bristol Centre for Supercomputing (BriCS)

This advert will close at 23:59 UK time on Sunday, 1st March.

The interview date will be confirmed shortly.

Our strategy and mission

We recently launched ourstrategy to 2030 tying together our mission, vision and values.

The University of Bristol aims to be a place where everyone feels able to be themselves and do their best in an inclusive working environment where all colleagues can thrive and reach their full potential. We want to attract, develop, and retain individuals with different experiences, backgrounds and perspectives – particularly people of colour, LGBT+ and disabled people - because diversity of people and ideas remains integral to our excellence as a global civic institution.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

AI Infrastructure Associate Engineer (up to x3 FTE)

University of Bristol

Bristol
On-site
GBP 80,000 - 100,000
Full time
30+ days ago
AI Supercomputing Infra Engineer (Storage & Networking)

University of Bristol

Bristol
Hybrid
GBP 40,000 - 60,000
Full time
30+ days ago
Senior Linux Systems Administrator

Dubizzle Limited

Cambridge
Hybrid
GBP 50,000 - 70,000
Full time
30+ days ago
Research Systems Technical Support Officer

University of Bristol

Bristol
Hybrid
GBP 35,000 - 50,000
Full time
30+ days ago
AI Engineer

Air IT Limited

Sandiacre
Hybrid
GBP 30,000 - 45,000
Full time
30+ days ago
Senior AI Engineer III

miraitalent

City of London
On-site
GBP 100,000 - 150,000
Full time
30+ days ago
Technical Specialist - Cyber-Physical Multi-Agent Systems

Advanced Research & Invention Agency

Greater London
Hybrid
GBP 70,000 - 105,000
Full time
30+ days ago
Technical Director

Methodfi

Greater London
Hybrid
GBP 100,000 - 125,000
Full time
30+ days ago
Technical Director

the Homebase

Greater London
Hybrid
GBP 90,000 - 120,000
Full time
30+ days ago
Delivery Manager (AI Safety)

the Homebase

Greater London
Hybrid
GBP 80,000 - 100,000
Full time
30+ days ago