Machine Learning Engineer (LLMs)

AgileRL Ltd
Feb 24, 2026

Full time I.T. & Communications

Job Description

Machine Learning Engineer (Large Language Models)

We are seeking a talented and experienced Machine Learning Engineer to join our team. This engineer will contribute to the further development of Arena, a web-based software platform for LLM- and RLOps, and our open source reinforcement learning library.

As a Machine Learning Engineer, you will be responsible for designing, implementing, and maintaining the infrastructure, tools, and services that enable businesses to build and deploy reinforcement learning models efficiently and effectively.

Responsibilities

Collaborate with the team to understand requirements and design new features of the Arena platform and open source framework.
Develop scalable and reliable infrastructure to support LLM training, reinforcement fine tuning, model deployment, and management.
Integrate existing machine learning frameworks and libraries into the platform and open source framework, providing a range of algorithms, environments, and tools for reinforcement learning model development.
Stay up to date with the latest advancements in AI, MLOps, reinforcement learning algorithms, tools, and techniques, and incorporate them into the platform as appropriate.
Provide technical guidance and support to internal users and external customers using the Arena platform and open source framework.

Requirements

Master's or Ph.D. degree in Computer Science, Engineering, or a related field, or 3+ years of relevant industry experience.
Solid understanding of LLM training, reinforcement learning algorithms and concepts, with hands on experience in building and training AI models.
Strong programming skills, with experience using ML frameworks and libraries (e.g. PyTorch, TensorFlow, Ray, Gym, TRL, DeepSpeed, VLLM), and MLOps tools.
Experience in building machine learning platforms or tooling for industrial or enterprise settings.
Proficiency in data management techniques, including storage, retrieval, and pre processing of large scale datasets.
Familiarity with model deployment and management, including the development of APIs, deployment pipelines, and performance optimisation.
Experience in designing and developing cloud based infrastructure for distributed computing and scalable data processing.
Deep understanding of software engineering and machine learning principles and best practices.
Strong problem solving and communication skills, and the ability to work independently as well as in a team environment.

Compensation

Competitive salary + significant stock options.
30 days of holiday, plus bank holidays, per year.
Flexible working from home and 6 month remote working policies.
Enhanced parental leave.
Learning budget of £500 per calendar year for books, training courses and conferences.
Company pension scheme.
Regular team socials and quarterly all company parties.

Join the fast-growing AgileRL team and play a key role in the development of cutting edge reinforcement learning tooling and infrastructure.

Full name Email address LinkedIn Country of residence When can you start? Note: for the following longer form questions we have received an overwhelming number of applications with answers that are AI generated. Any application that uses AI generated answers will not be considered.

What motivates you to apply to this role, and what are you looking forward to in contributing towards the AgileRL mission? (200 words max) What are 3 reinforcement learning capabilities or algorithmic improvements you believe would be most valuable to add to the Arena platform or AgileRL open source framework, and what challenges would they help users solve? (200 words max) What unique experience do you have with developing, implementing, or researching reinforcement learning algorithms and systems that makes you the ideal candidate for this role? (200 words max) Upload your CV Upload File Max file size 10MB. I agree to the Privacy Policy

Build, tune and deploy RLmodels at lightning speed.

Machine Learning Engineer (LLMs)

Job Description

Modal Window