Snr Data Scientist, Computational Genomics/DNA Modelling

LondonFullTimePosted Jul 3, 2026
Apply

About Relation

Relation is a sector defining TechBio company developing transformational medicines, with technology at our core. Our ambition is to understand human biology in unprecedented ways, discovering therapies to treat some of life’s most devastating diseases. We leverage single-cell multi-omics from patient tissue, functional assays, and machine learning to drive disease understanding, from cause to cure.

We are scaling rapidly and building a team of exceptional individuals to push the boundaries of drug discovery. You will work in highly interdisciplinary teams where biology, computation, and engineering come together to solve complex problems that have not been solved before. Our state-of-the-art wet and dry labs in the heart of London are designed to accelerate this integration and translate insight into impact.

We are committed to building diverse and inclusive teams. Relation is an equal opportunities employer and does not discriminate on the basis of gender, sexual orientation, marital or civil partnership status, gender reassignment, race, colour, nationality, ethnic or national origin, religion or belief, disability, or age.

By joining Relation, you will help define how medicines are discovered and deliver meaningful impact for patients.

The opportunity

This is a unique opportunity for a senior data scientist to work on DNA sequence modelling, statistical machine learning, and functional genomics in support of target discovery and mechanism elucidation. You will develop and apply computational models that connect genetic variation and regulatory DNA sequence to downstream molecular phenotypes, cellular programmes, and disease biology. Embedded within the Rosalind team you will build and evaluate models that link sequence to function - including variant effect prediction,regulatory activity modelling and representation learning for genomic data.

Day to Day you will:

  • Develop and implement machine learning models for DNA sequence, regulatory elements, and genetic variation in disease-relevant contexts.

  • Build, evaluate and benchmark models for sequence-to-function tasks such as variant effect prediction, regulatory activity prediction, and the interpretation of non-coding disease signals.

  • Integrate sequence-derived representations with transcriptomic, epigenomic, and perturbational datasets to uncover disease mechanisms and support target prioritisation.

  • Partner closely with experimental and machine learning researchers to validate hypotheses, interpret results, and guide downstream studies.

  • Communicate findings clearly to internal stakeholders, including presenting methods, results, and recommendations.

  • Contribute to publications, scientific communications, and project documentation, supporting scientific excellence and external visibility.

Professionally, you will have

  • PhD in computational biology, machine learning, statistics, genomics, bioinformatics, or a related quantitative discipline.

  • Post-PhD experience, ideally including time in an industry, biotech, or pharmaceutical environment.

  • Experience with DNA language models, genomic foundation models, or transformer-based sequence models.

  • Understanding of statistical machine learning and probabilistic modelling, with experience selecting and evaluating appropriate modelling approaches for complex biological datasets.

  • Knowledge of statistical genetics methods (e.g fine-mapping, colocalisation,, or variant to gene approaches).

  • High proficiency in Python (preferred) and R, with experience working in high-performance computing environments.

  • Ability to operate independently, driving projects from concept through delivery.

  • Bonus experience:

    • Familiarity with single-cell transcriptomics or patient-derived datasets.

    • Experience working in interdisciplinary teams within biotech or pharma settings.

    • Experience integrating machine learning models with genomics, single-cell, or perturbation datasets.

    • Familiarity with 3D genome / chromatin interaction data where relevant (Hi-C, Capture-C, etc.).

Personally, you:

  • Are comfortable working in a matrixed environment, balancing multiple stakeholders and contributing effectively across teams.

  • Take ownership of your work, proactively seek opportunities to contribute, and enable others to do their best work.

  • Communicate openly and directly, give and receive feedback constructively, and handle challenging conversations with respect.

  • Actively seek out diverse perspectives, build strong working relationships, and contribute to shared goals across teams.

  • Embrace challenges with openness and resilience, set high standards for yourself, and strive to deliver meaningful outcomes.

Working Style & Culture at Relation

At Relation, we operate in a matrixed, interdisciplinary environment, where impact is driven through collaboration across scientific, technical, and operational domains. We collaborate, and you will partner with colleagues across multiple teams and projects, contributing your expertise while aligning to shared company priorities. We work together and win together!

The patient is waiting!

RECRUITMENT AGENCIES: Please note that Relation does not accept unsolicited resumes from agencies. Resumes should not be forwarded to our job aliases or employees. Relation will not be liable for any fees associated with unsolicited CVs.

Relation is a committed equal opportunities employer.

Want jobs like this matched to you?

Swoopd scores fresh postings against your résumé so you only see the matches that matter.

Get started free