Job Search and Career Advice Platform

Enable job alerts via email!

TensorRT/CUDA Inference Engineer — UK Visa Sponsorship

Methodfi

City of London

On-site

GBP 60,000 - 80,000

Full time

30+ days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A forward-thinking technology company based in London is looking for a Neural Network Optimization Engineer to enhance the efficiency of neural network inference workflows. The role requires expertise in TensorRT, Triton language, and CUDA programming, along with a deep understanding of GPU architectures. The company offers a competitive salary, professional growth opportunities, and support for Skilled Worker visa sponsorship in the UK.

Benefits

Competitive salary
Skilled Worker visa sponsorship
Opportunities for professional growth
Collaborative work environment
Impactful projects

Qualifications

  • Proven professional experience optimizing neural network inference workloads.
  • Strong expertise with TensorRT and Triton language.
  • Experience with neural network quantization techniques.

Responsibilities

  • Optimize neural network models for inference performance.
  • Implement model quantization methods for better efficiency.
  • Benchmark and analyze performance on targeted hardware.

Skills

TensorRT
Triton language
CUDA programming
Python
PyTorch
neural network inference optimization

Tools

CUDA Toolkit
Quantization tools
Job description
A forward-thinking technology company based in London is looking for a Neural Network Optimization Engineer to enhance the efficiency of neural network inference workflows. The role requires expertise in TensorRT, Triton language, and CUDA programming, along with a deep understanding of GPU architectures. The company offers a competitive salary, professional growth opportunities, and support for Skilled Worker visa sponsorship in the UK.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.