Job Search and Career Advice Platform

Enable job alerts via email!

EDA Infrastructure Engineer New London

Mesh-AI Limited

Bristol

Hybrid

GBP 60,000 - 80,000

Full time

Today
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading technology firm in the UK is looking for an exceptional Infrastructure Engineer to develop and maintain the foundations of silicon development. This role involves creating and supporting workflows, deploying infrastructure using IaC, and managing network services. The ideal candidate will have a strong background in EDA tooling, container orchestration, and grid computing systems. Join a collaborative and rapidly growing team dedicated to impactful AI solutions. Modern offices are located in London and Bristol.

Benefits

Collaborative culture
Opportunity for rapid career growth
Ownership over projects

Qualifications

  • Proficient in modern software development languages and infrastructure-as-code frameworks.
  • Experience in diagnosing and resolving network/storage/CPU/RAM bottlenecks.
  • Experience in deploying and managing grid compute systems.

Responsibilities

  • Create and support tooling and workflows for EDA tooling.
  • Deploy and maintain compute infrastructure using IaC.
  • Manage key network services, including VPN and central authentication.
  • Maintain a cluster compute solution for job scheduling.
  • Setup observation tooling for resource utilisation and machine failures.
  • Work with the engineering team to optimize workloads.

Skills

EDA tooling
Workload management tools (e.g., Slurm)
Container orchestration tools (Docker, Kubernetes)
Infrastructure as code (Ansible, Terraform)
Build systems tooling (Bazel)

Tools

Linux/Unix systems
Grid compute systems (Slurm, LSF, SGE)
Containerisation frameworks (Docker, Singularity)
Job description

Fractile is building silicon, systems and software which will redefine the frontier of AI: running the world’s most advanced models at radically higher speed and lower cost. We have an exceptional team across hardware and software capable of bringing about this change, and we are growing fast to meet demand and deliver our product at scale.

We are seeking an exceptional Infrastructure engineer to develop and maintain the foundations of our silicon development, supporting a wide range of computationally-intensive workloads while scaling up capacity by orders-of-magnitude. In this role you will work together with our build flow lead to provide efficient, scalable processes and compute to our front-end and back-end silicon engineering teams. You will need to work closely with engineers across the organisation to resolve bottlenecks and optimise workflows, to provide a environment that enables our team to execute at scale and with speed.

Key Responsibilities
  • Create and support tooling and workflows centred around EDA tooling, which will require coding and build-system knowledge to assist with tasks faced by different teams.
  • Deploy, and maintain compute infrastructure (in either the cloud or on-premise) using an infrastructure-as-code (IaC) framework (Ansible/Terraform).
  • Manage key network services such as a VPN, central authentication (LDAP), file/object storage, and license servers.
  • Maintain a cluster compute solution, capable of scheduling a wide array of types of job with large resource requirements.
  • Setup and monitor observation tooling for resource utilisation, machine failures, and more (e.g. Prometheus/Zabbix).
  • Work with the engineering team to build and optimise their workloads
It would be great if you have
  • Experience of EDA tooling
  • Experience working with workload management tools, such as Slurm.
  • Experience working with container orchestration tools, such as Docker, and Kubernetes.
  • Experience working with infrastructure as code, such as Ansible, or Terraform.
  • Experience working with build systems tooling, such as Bazel.
Preferred Qualifications
  • Proficient in modern software development language(s) and infrastructure-as-code frameworks.
  • Proficient in the use and administration of Linux/Unix systems, and ideally management of shared compute environments.
  • Past experience with diagnosing and resolving network/storage/CPU/RAM bottlenecks across complex workloads.
  • Experience deploying and managing a grid compute system (Slurm/LSF/SGE).
  • Proficiency with containerisation frameworks (Docker/Singularity).
How we work
  • Ownership and execution: you will have full agency to drive your work forward
  • Rapid iteration: we all work directly with top leadership to move from idea to hardware on ambitious timelines
  • Full-stack engagement: hardware, software, silicon, and modelling teams all work closely together to create a product with generational impact
  • Optimistic and pragmatic: we possess the will to win, and to do the hard work to get us there
  • Team player mentality: the mission is bigger than any of us, and we have the curiosity and technical focus to see the best idea shipped, no matter who’s it is
About us
  • Founded in 2022, team of 70+ which is expanding rapidly
  • Modern, open offices in London and Bristol
  • Collaborative, problem-solving culture built on deep curiosity, entrepreneurial initiative and technical fluency
Export control and security clearance

Certain roles may involve working on technologies subject to export restrictions. Applicants may be required to undergo additional eligibility checks to ensure compliance with applicable law.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.