Job Search and Career Advice Platform

Enable job alerts via email!

Research Engineer - Large Language Models

Methodfi

Remote

GBP 60,000 - GBP 80,000

Full time

Today
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A cutting-edge AI company is seeking a Research Engineer focusing on large language models. This role involves experimenting with novel architectures, optimizing AI models, and collaborating with the engineering team to deploy updates. Ideal candidates should possess a strong background in deep learning and have an advanced degree in relevant fields. The position is remote in the UK, requiring occasional trips to the Silicon Valley office.

Qualifications

  • Advanced degree in Computer Science, AI, or Machine Learning is preferred.
  • Demonstrated ability in independent research is a plus.
  • Experience in large-scale deep learning model training is advantageous.

Responsibilities

  • Experiment with language model architectures to drive the research roadmap.
  • Optimize multimodal models for quality and performance.
  • Architect data processing pipelines for training data quality.

Skills

Proficiency in deep learning frameworks
Experience with AI product development
Independent research capability

Education

Master's or PhD in Computer Science, AI, or related field

Tools

PyTorch
JAX
TensorFlow
Job description
Research Engineer- Large Language Models

Full-time | Remote (UK) with trips to Silicon Valley office | Reports to Founders

Introduction
  • Join us at Fastino as we build the next generation of LLMs. Our team, boasting alumni from Google Research, Apple, Stanford, and Cambridge is on a mission to develop specialized, efficient AI.

  • Fastino's GLiNER family of open source models has been downloaded more than 5 million times and is used by companies such as NVIDIA, Meta, and Airbnb

  • Fastino has raised $25M (as featured in TechCrunch) through our seed round and is backed by leading investors including Microsoft, Khosla Ventures, Insight Partners, Github CEO Thomas Dohmke, Docker CEO Scott Johnston, and others.

What You’ll Work On
  • Experiment with novel language model architectures, helping drive and execute Fastino's research roadmap

  • Optimize Fastino’s multimodal models to improve response quality, instruction adherence, and overall performance metrics

  • Architect data processing pipelines, implementing filtering, balancing, and captioning systems to ensure training data quality across diverse content categories

  • Implement reinforcement learning techniques including Direct Preference Optimization and Generalized Reward Preference Optimization to align model outputs with human preferences and quality standards

  • Build robust and real-world motivated evaluations

  • Partner with Fastino engineering team to ship model updates directly to customers

  • Establish best practices for code health and documentation on the team, to facilitate collaboration and reliable development

What We’re Looking For
  • Required - Great velocity for building and shipping agents / AI products.

  • Optional - Advanced degree (Master's or PhD) in Computer Science, Artificial Intelligence, Machine Learning, or related technical discipline with concentrated study in deep learning and computer vision methodologies

  • Optional - Demonstrated ability to do independent research in Academic or Industry settings

  • Optional - Substantial industry experience in large-scale deep learning model training, with demonstrated expertise in at least one of Large Language Models, Vision-Language Models, Diffusion Models, or comparable generative AI architectures

  • Optional - Comprehensive technical proficiency and practical experience with leading deep learning frameworks, including advanced competency in one of PyTorch, JAX, TensorFlow, or equivalent platforms for model development and optimization

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Machine Learning Research Engineer (Foundational Research)

Refinitiv

City of London
On-site
GBP 70,000 - 90,000
Full time
30+ days ago
Remote Research Engineer — LLMs & Multimodal AI

Methodfi

Cambridge
Remote
GBP 60,000 - 80,000
Full time
30+ days ago
Senior Machine Learning Research Engineer - Speech

microTECH Global Limited

Greater London
On-site
GBP 70,000 - 90,000
Full time
30+ days ago
Research Engineer, Pretraining Scaling (London) London, UK

Applied Intuition Inc.

London
On-site
GBP 60,000 - 80,000
Full time
30+ days ago
Member of Technical Staff - ML Engineering

the Homebase

Greater London
Hybrid
GBP 60,000 - 80,000
Full time
30+ days ago
Senior ML Runtime Engineer Bristol

Mesh-AI Limited

Bristol
On-site
GBP 70,000 - 90,000
Full time
30+ days ago
Research Scientist - Diffusion

Methodfi

City of London
Hybrid
GBP 70,000 - 90,000
Full time
30+ days ago
Research Scientist – LTX Model Quality

Popular Pays, Inc.

Greater London
On-site
GBP 80,000 - 100,000
Full time
30+ days ago
AI Research Engineer - Pre training (100% Remote)

Tether

Greater London
Remote
GBP 70,000 - 90,000
Full time
30+ days ago
Applied AI Engineer London

ModelML Inc.

City of London
On-site
GBP 70,000 - 90,000
Full time
30+ days ago