DropCode is building the data engine for protein function. Starting with enzymes, we use our patented droplet microfluidics platform to capture exponentially more data on protein function than conventional methods, linking genotype to phenotype at per-droplet resolution, making every droplet a micro test tube. This data fuels machine learning models that learn in ever greater detail how sequence determines function. Our wedge is enzyme engineering for biocatalysis and industrial biotechnology, but our ambition is to make DropCode the definitive platform for protein function prediction.
We are Cambridge PhDs with deep expertise across microfluidics, biochemistry, machine learning, optics, and engineering. We believe the language of biology is machine learning, and that the fastest path to transformative models is not just better AI, it is better inputs.
The RoleWe are looking for an exceptional computational scientist to lead our machine learning and protein modelling efforts. You will own the sequence-function modelling stack end to end: from processing large-scale functional datasets generated in our microfluidic runs, to training and deploying generative and predictive models that drive the next round of experiments. You will work in a tight loop with the biology and engineering teams, turning quantitative phenotypic data into closed-loop active learning systems that continuously improve our models.
This is a foundational role. You will be building the ML infrastructure from the ground up, and your architectural choices will shape DropCode for years.
What You'll DoYou are frustrated by the slow, artisanal nature of current biological engineering and believe the field needs a step change in data scale and quality. You think quantitatively, treat every experiment as a data point for a model, and have strong opinions about what it takes to build the best protein design systems in the world. You thrive in collaborative, fast moving environments where the pace is set by scientific urgency, not process.