
Enable job alerts via email!
Generate a tailored resume in minutes
Land an interview and earn more. Learn more
A forward-thinking technology company based in London is looking for a Neural Network Optimization Engineer to enhance the efficiency of neural network inference workflows. The role requires expertise in TensorRT, Triton language, and CUDA programming, along with a deep understanding of GPU architectures. The company offers a competitive salary, professional growth opportunities, and support for Skilled Worker visa sponsorship in the UK.
Founded in the US in 2022 and now based in London, UK, Recraft is an AI tool for professional designers, illustrators, and marketers, setting a new standard for excellence in image generation.
We designed a tool that lets creators quickly generate and iterate original images, vector art, illustrations, icons, and 3D graphics with AI. Over 3 million users across 200 countries have produced hundreds of millions of images using Recraft, and we’re just getting started.
Join a universe of professional opportunities, develop and support large-scale projects, and shape the future of creativity. We are committed to making Recraft an essential, daily tool for every designer and setting the industry standard. Our mission is to ensure that creators can fully control their creative process with AI, providing them with innovative tools to turn ideas into reality.
If you’re passionate about pushing the boundaries of AI, we want you on board!
We are seeking an experienced Neural Network Optimization Engineer who will specialize in enhancing the performance, latency, and throughput of neural network inference workflows. The ideal candidate will have substantial hands‑on experience optimizing inference workloads using technologies such as TensorRT, Triton language, and model quantization techniques. You will collaborate closely with ML researchers to ensure that our machine learning models run at peak efficiency and reliability in production environments.