Machine Learning in Life Science

The Center for Basic Machine Learning in Life Science (MLLS) was established on January 21, 2021 with the generous support of the Novo Nordisk Foundation. We bring together leading machine learning research groups in Denmark to establish a solid foundation for future data science progress in the life sciences.

Artificial intelligence and data science are rapidly changing how science is being conducted. Researchers increasingly rely on a data-centric view, where experimental data is used to discover patterns in data and phrase new scientific hypotheses.

To extract information from data, modern data science techniques often “convert” the raw sensory measurements into abstract data representations, but these representations are often ill-understood and difficult to interpret, which hinders their use for phrasing robust hypotheses. This is particularly true in the life sciences, where data typically is “noisy” and incomplete.

To provide solutions to these problems, we conduct basic research in machine learning that is motivated and informed by fundamental problems in biology and biomedicine.

Our Mission

We conduct the basic machine learning research needed to estimate representations of biomedical data that are

Robust
Interpretable
Data efficient
Reflective of inherent data uncertainty
Able to leverage existing knowledge

These representations are both predictive and knowledge discovery tasks.

Research

Our research focuses on four themes, and each theme advances different aspects of representation learning for life science and support each other:

Meaningful representation of data and computational and mathematical tools development to realize the answer.
Geometric constructions to incorporate existing knowledge into representations and ensure that the result is understandable by humans.
Representation of data often appearing within life science, such as trees, graphs, and sequences.
Inclusion of real data that is “noisy” and investigation of how associated uncertainty is best encoded.

MLLS Team

Professor Ole Winther, PI

Google Scholar | Personal Research Page

University of Copenhagen and Technical University of Denmark

Hierarchical generative models, approximative inference, NLP, gene regulation and biological sequence analysis and foundation models

Professor Anders Krogh, co-PI

Google Scholar | Personal Research Page

University of Copenhagen

Representation learning for gene expression data and DNA sequences. Machine learning in bioinformatics.

Professor Wouter Boomsma, co-PI

Google Scholar | Personal Research Page

University of Copenhagen

Sequence modelling, protein representation learning, Bayesian inference (Markov chain Monte Carlo).

Professor Aasa Feragen, co-PI

Google Scholar | Personal Research Page

Technical University of Denmark

Geometric modeling, machine learning for biomedical imaging, structured data (trees, networks, …)

Associate Professor Jes Frellsen, co-PI

Google Scholar | Personal Research Page

Technical University of Denmark

Deep generative models, missing values, Bayesian modeling, approximate inference.

Professor Søren Hauberg, co-PI

Google Scholar | Personal Research Page

Technical University of Denmark

Random geometry, uncertainty quantification, deep generative models.

Advisory Board

Oliver Stegle, PhD, EMBL-EBI
Deborah Marks, PhD, Harvard University
Corinna Cortes, PhD, Google Research

Coordination

Lisbeth Borbye, PhD, University of Copenhagen

Research Achievements

Coming soon...

MLLS Seminars

Seminar speaker	Topic	Year
Stefano Sarao Minnelli	AI Bias dynamics	2025
Fredrik Johansson	Interpretable prediction with missing values	2025
Filip Tronarp	Recursive Variational State Estimation: The Dynamic Programming Approach	2025
Line Sandvad Nielsen	NetStart 2.0: Prediction of Eukaryotic Translation Initiation Sites Using a Protein Language Model	2025
Peter Mørch Groth	Kermut: Composite kernel regression for protein variant effects.	2025
Hrittik Roy	Reparameterization invariance in approximate Bayesian inference.	2025
Melih Kandemir	Embodied Estimators for Full Autonomy.	2025
Louis Ohl	A tutorial on discriminative clustering and mutual information.	2025
Beatrix M. G. Nielsen	Challenges in explaining representational similarity.	2025
Søren Wengel Mogensen	Causal discovery and weak equivalence of graphs.	2025
Eli N. Weinstein	Causal Molecular Design.	2025
Johnny Xi	Causal Velocity Models: Counterfactual Transports via Score Estimation.	2025
Sussane Ditlevsen	Warning of a collapse of the Atlantic overturning circulation.	2024
Wessel Bruinsma	Autoregressive Conditional Neural Processes.	2024
Oliver Stegle	Enhancing human genetics with new computational tools and single-cell sequencing.	2024
Gabriel Niels Damsholt	Uncertainty Estimation for DNNs via SDEs.	2024
Lasse Blaabjerg	Protein variant effect prediction using ML.	2024
Sindy Loewe	Rotating Features for Object Discovery.	2024
Berian James	Scalable physical inference with hypernetworks.	2024
Matthias Bauer	High-performance low-complexity neural compression.	2024
Carl Hvarfner	Vanilla Bayesian Optimization in high dimensions.	2024
Frederikke Isa Marin & Felix Teufel	BEND: Benchmarking DNA Language Models.	2024
Joris Fournel	Medical image segmentation quality control.	2024
Michaela Areti Zervou	Protein Sequence Classification and Generation.	2024
Marcelo Hartmann	Warped Geometric Information in optimization.	2024
Joakim Edin	XAI, medical coding, and EHRs.	2024
Filip Tronarp	Robust Cholesky Discretization of Gauss-Markov models.	2024
Ira Ktena	Promises and perils of AI innovation in healthcare.	2024
Karthik Bharath	Rolled Gaussian process models for curves on manifolds.	2024
Christian Igel	Bayesian vs. PAC-Bayesian DNN Ensembles.	2024
Henry Moss	Return of the Latent Space Cowboys: rethinking VAEs in Bayesian Optimisation.	2024
Kristoffer Wickstrøm	Uncertainty estimation in representation learning explainability.	2024
Yogesh Verma	Modular Flows: Differential Molecular Generation.	2023
Andrew White	Explaining molecular properties with natural language. / Deep learning for molecular design with few data points.	2023
Marloes Arts	Diffusion Models and Force Fields for Coarse-Grained MD.	2023
Stefan Sommer	Stochastic morphometry and sampling of conditioned stochastic processes.	2023
Rocío Mercado	Deep generative models for biomolecular engineering.	2023
Ignacio Peis	Missing Data Imputation and HyperGenerators.	2023
Ola Rønning	Probabilistic mixture model approximation with Stein mixtures.	2023
Martin Jørgensen	Bézier Gaussian Processes.	2023
Pierre-Alexandre Mattei	Are ensembles getting better all the time?	2023
Frederikke Marin	I Can’t Believe It’s Not Better.	2023
Søren Hauberg	All Layers Marginal Likelihood Training with Fully Correlated Linearized Laplace Approximations.	2023
Mingyu Kim	Enhancing Neural Radiance Fields with Regularization.	2023
Antoine Wehenkel	Simulation-based Inference for Cardiovascular Models.	2023
Cong Geng	Bounds all around: training energy-based models with bidirectional bounds.	2022
Simon Bartels	How much data do we need?	2022
Viktoria Schuster	The deep generative decoder: a minimum viable model.	2022
Simon Olsson	Machine Learning for Molecular Dynamics of Proteins.	2022
Damien Garreau	What does LIME really see in images?	2022
Ragnhild Ørbæk Laursen	NMF for somatic mutations in cancer genomics.	2022
Agustinus Kristiadi	Low-Cost Bayesian Methods for Fixing Overconfidence.	2022
Jes Frellsen	How to deal with missing data in supervised deep learning?	2022
Ole Winther	DeepTMHMM: Deep Learning for Transmembrane Topology Prediction.	2022
Anders Krogh	Scaling issues in maximum likelihood estimation.	2022
Beau Coker	Wide Mean-Field Variational BNNs Ignore the Data.	2022
Nicholas Kramer	A probabilistic perspective on numerical solution of differential equations.	2022
Pascal Notin	Disease variant prediction with deep generative models.	2022
Nikolaj Thams	Robustness to dataset shift.	2022
Siavash Bigdeli	Deep Statistical Image Modeling.	2022
Mark van der Wilk	Meaningful Metrics for Probabilistic Predictions.	2022
Marco Miani	Laplacian Autoencoders for Stochastic Representations.	2022
Theo Karaletsos	Black-box coreset variational inference.	2022
Raghavendra Selvan	On the Carbon Footprint of Deep Learning.	2022
Sebastian Weichwald	Causal Models on the Brink.	2022
Victor García Satorras	Equivariant Diffusion for Molecule Generation in 3D.	2022
Jean Feydy	Modelling protein surfaces	2021
Gustav Lindved	Predicting protein thermostability using language models.	2021
Jakob Havtorn	Out-of-distribution testing for Hierarchical VAE.	2021
Felix Teufel	Improved signal peptide prediction using protein language models.	2021
Niels Bruun Ipsen	Not-MIWAE: Deep Generative Modelling with Missing not at Random Data.	2021
Pola Schwöbel	Last Layer Marginal Likelihood for Invariance Learning.	2021
Didrik Nielsen	SurVAE Flows: Surjections to Bridge the Gap between VAEs and Flows.	2021
Kasra Arnavaz	Semi-supervised, Topology-Aware Segmentation of Tubular Structures from Live Imaging 3D Microscopy.	2021
Roshan Rao	MSA Transformers for protein sequences.	2021
Lars Kai Hansen	Values in AI.	2021
Jonas Busk	Calibrated Uncertainty for Molecular Property Prediction.	2021
Frederik Warburg	Bayesian Triplet loss.	2021
Veronika Cheplygina	How our publication traditions hinder efficient discovery.	2021
Eli Weinstein	Latent alignment as a replacement for multiple sequence alignment.	2021
Giorgio Giannone	Few-Shot Generative Models.	2021
Eli Weinstein	A structured observation distribution for biological sequence prediction.	2021
Arnor Sigurdsson	Deep integrative models for large-scale human genomics.	2021
Pablo Moreno-Muñoz	Gaussian processes.	2021
Chris Sander	Machine learning for hard biological problems — three examples.	2021
Kresten Lindorff-Larsen	Perspective on Alphafold 2 and remaining problems.	2021
Albert Jelke Kooistra	Understanding drug selectivity and individual responses.	2021
Kristoffer Stensbo-Smidt	Flow-transformed Gaussian processes.	2021
Eike Petersen	Responsible and Regulatory Conform ML for Medicine.	2021
Guan Wang	Graph2Graph Learning with Conditional Autoregressive Models.	2021
Enzo Ferrante	Gender bias in X-ray classifiers.	2021
Germans Savcisens	Life2Vec: Transformers for behavior representation.	2021
George Papamakarios	Normalizing flows for atomic solids.	2021
Markus Heinonen	Low-rank Bayesian neural networks.	2021

Events

EurIPS Copenhagen 2025 Conference

03.12.2025 – 05.12.2025

EurIPS is a community-driven conference that makes world-class AI research accessible to Europe by showcasing papers accepted at NeurIPS, the most prestigious AI conference globally. By bringing together researchers, industry leaders, and innovators, EurIPS creates a unique networking hub that connects academic breakthroughs with real-world applications.

GenU Conference 2025

17.09.2025 – 18.09.2025

Generative models and uncertainty quantification lie at the heart of Bayesian modelling and inference. At this small meeting, we discuss recent developments within the field. The meeting is deliberately kept small in order to ensure that discussion remains honest, lively and interesting.

MLLS PI Research Retreat

25.08.2025

MLLS PI gathering to discuss future research.

MLLS Retreat 2025

15.06.2025 – 18.06.2025

Annual retreat at Tjärö for MLLS professors and students with hackathons and social activities.

GenU Conference 2024

18.09.2024 – 19.09.2024

MLLS Retreat 2024

25.04.2024 – 28.04.2024

Annual retreat at Isaberg for MLLS professors and students with hackathons and social activities.

GenLife Conference 2024

15.04.2024 – 16.04.2024

This conference is an attempt to highlight some of the important current developments in the interface between generative models and life science—with a particular focus on the area of biomolecular modelling and relevant machine learning tools.

GenU Conference 2023

20.09.2023 – 21.09.2023

MLLS Retreat 2023

18.04.2023 – 19.04.2023

Annual retreat at Sonnerupgaard for MLLS professors and students with hackathons and social activities.

MLLS PI Vision Workshop

01.12.2022

MLLS PI gathering to discuss future research.

GenU Conference 2022

14.09.2022 – 15.09.2022

MLLS Opening Symposium: The Royal Danish Academy of Sciences and Letters

15.08.2022 – 16.08.2022

Formal celebration of the start of MLLS.

AI Meets Life Science and Biotech Symposium

21.06.2022

The symposium will explore modern AI and machine learning as drivers for scientific discovery and far-reaching applications within life science, health science and biotech. Crucial in this transformation is the ability to extract meaningful associations and causalities from messy real-world data – a paradigm shift from more traditional data analysis. In this workshop we take a snapshot look at how modern machine learning and AI currently transform areas like medical research and clinical practice.

MLLS Retreat 2022

19.04.2022 – 20.04.2022

Annual retreat at Sonnerupgaard for MLLS professors and students with hackathons and social activities.

NeurIPS 2021 Meetup

07.12.2021 – 11.12.2021

This NeurIPS 2021 meetup takes place in Copenhagen aiming to act as the central meetup for this area. The meetup will feature a collection of focused reading groups. PhD students within ELLIS Copenhagen may get ECTS credit for joining these reading groups.

GenU Conference 2021

12.10.2021 – 13.10.2021