Enable job alerts via email!

Neural Network Optimization Engineer

Methodfi

City of London

On-site

GBP 60,000 - 80,000

Full time

30+ days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A forward-thinking technology company based in London is looking for a Neural Network Optimization Engineer to enhance the efficiency of neural network inference workflows. The role requires expertise in TensorRT, Triton language, and CUDA programming, along with a deep understanding of GPU architectures. The company offers a competitive salary, professional growth opportunities, and support for Skilled Worker visa sponsorship in the UK.

Benefits

Competitive salary

Skilled Worker visa sponsorship

Opportunities for professional growth

Collaborative work environment

Impactful projects

Qualifications

Proven professional experience optimizing neural network inference workloads.
Strong expertise with TensorRT and Triton language.
Experience with neural network quantization techniques.

Responsibilities

Optimize neural network models for inference performance.
Implement model quantization methods for better efficiency.
Benchmark and analyze performance on targeted hardware.

Skills

TensorRT

Triton language

CUDA programming

Python

PyTorch

neural network inference optimization

Tools

CUDA Toolkit

Quantization tools

About Us

Founded in the US in 2022 and now based in London, UK, Recraft is an AI tool for professional designers, illustrators, and marketers, setting a new standard for excellence in image generation.

We designed a tool that lets creators quickly generate and iterate original images, vector art, illustrations, icons, and 3D graphics with AI. Over 3 million users across 200 countries have produced hundreds of millions of images using Recraft, and we’re just getting started.

Join a universe of professional opportunities, develop and support large-scale projects, and shape the future of creativity. We are committed to making Recraft an essential, daily tool for every designer and setting the industry standard. Our mission is to ensure that creators can fully control their creative process with AI, providing them with innovative tools to turn ideas into reality.

If you’re passionate about pushing the boundaries of AI, we want you on board!

Job Description

We are seeking an experienced Neural Network Optimization Engineer who will specialize in enhancing the performance, latency, and throughput of neural network inference workflows. The ideal candidate will have substantial hands‑on experience optimizing inference workloads using technologies such as TensorRT, Triton language, and model quantization techniques. You will collaborate closely with ML researchers to ensure that our machine learning models run at peak efficiency and reliability in production environments.

Key Responsibilities

Optimize neural network models for inference performance and latency reduction
Implement model quantization methods (e.g., INT8, FP8) to maximize computational efficiency.
Benchmark, analyze, and improve inference performance on targeted hardware platforms.
Collaborate with the ML researchers to deploy optimized models in production environments.
Stay updated with the latest developments in model optimization, inference engines, quantization methods, and related technologies.

Requirements

Proven professional experience optimizing neural network inference workloads.
Strong expertise with TensorRT, Triton language, CUDA programming.
Experience with neural network quantization techniques.
Proficiency in Python and PyTorch.
Deep understanding of GPU architectures and performance optimization.
Excellent problem‑solving skills and ability to analyze performance bottlenecks.

What We Offer

Competitive salary.
We’re able to offer Skilled Worker visa sponsorship in the UK for qualified candidates.
Opportunities for professional growth and development.
A collaborative and user‑focused work environment.
The chance to shape the future of AI‑powered creativity through research.
Exciting projects where your insights will directly impact product development.

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Top cities

Top companies

Popular jobs