🍁 SearchCanadaJobs.com

Software Engineer, RL Data

Company

Jobleads-UK

Location

Greater London, England

Type

Full-time

About the role


This is a senior, foundational role on a new team: you’ll make architecture decisions that the rest of the team builds on and help shape what we build first. The work is hands‑on and varied. Some weeks you’ll be deep in pipeline or infrastructure engineering; others you’ll be tuning prompts until the output is good or sitting with a research team that depends on your systems and shipping the fixes they need. We are looking for experienced engineers who own outcomes end‑to‑end — down to reading transcripts, supporting users, and wrangling vendors.


The company's RL Data team builds the systems that produce high‑quality reinforcement learning data for Claude: data collection pipelines, human feedback tooling, the execution environments RL tasks run in, and the quality assurance that keeps training data trustworthy at scale. Our goal is to make Claude great at real work — especially the work that matters most, like AI safety research and benefic...

🍁 Ready to Apply?

Take the next step in your Canadian career

Apply Now