Name: PhD Thesis Defence: Sören Stahlschmidt
Start: 6/4/2025 1:00:00 PM
End: 6/4/2025 4:00:00 PM
Location: University of Skövde Building G, Room G110

It seems you are not using Javascript, which may cause parts of the website to not work as intended.

4 June

13:00 - 16:00

University of Skövde Building G, Room G110

Add to calendar

Sören Stahlschmidt defends his PhD thesis "Machine Learning for Predicting Cancer Endpoints from Bulk Omics Data".

Sören Stahlschmidt i Spanien

The PhD thesis defence will be held in Room G110 at the University of Skövde and will also be streamed online. Join the livestream here.

Abstract

Cancer remains one of the leading causes of death and is a major burden on patients and healthcare systems. One difficulty for finding effective treatment and matching patients to the right treatment strategy is the complexity of tumor biology. Machine learning holds the potential to learn patterns from data generated by high-throughput technologies, such as RNA-sequencing, that can elucidate the mechanisms underlying cancers and make clinically relevant predictions. In this thesis, we investigate the modeling of cancer with machine learning approaches from different molecular perspectives. First, we review the literature on the fusion of biomedical modalities with multimodal deep neural networks. In this review, we provide a descriptive overview, propose a novel taxonomy, and identify relevant research gaps. Moreover, for models to be applicable to clinical practice, they must be robust to shifts in the distribution patients are sampled from. Such shifts can stem from differences in the underlying biology or technical variation introduced during the processing of the biological material. Therefore, in two studies, we investigate domain generalization of machine learning models trained with bulk RNA-sequencing data to predict cancer survival endpoints. First, we show that deep learning-based domain generalization methods developed on non-molecular data improve robustness to distributional shifts on molecular data. We test these methods by predicting overall and recurrence free survival of breast cancer patients with subgroup shifts between source and target domains. Next, we show that relative representations of normalized count values, such as binning or ranking of expression values within a single sample, can increase domain generalization. We test these approaches in three experiments on breast, brain, and ovarian cancer. In a final study, we show that cancer stage can be predicted from circulating microRNA data with machine learning models, providing a proof of concept for this application. Overall, the work in this thesis supports making machine learning models more applicable to clinical practice by providing empirical evidence of methods improving the modeling of cancer biology. Continuing to study domain generalization of models in clinical practice and to develop methods for robustness are highlighted as future work.

Read the full thesis in DiVA

Supervisors:

Jane Synnergren, Professor, University of Skövde and University of Gothenburg
Göran Falkman, Associate Professor, University of Skövde
Benjamin Ulfenborg, Senior Lecturer, University of Skövde

Opponent:

Ole Christian Lingjærde, Professor, University of Oslo

Committee:

Francesca Buffa, Professor, Bocconi University and University of Oxford
Dirk Repsilber, Professor, University of Örebro
Marija Cvijovic, Professor, University of Gothenburg and Chalmers University of Technology

Contact

PhD Student Bioinformatics

Sören Stahlschmidt

School of Bioscience

soren.richard.stahlschmidt@his.se

0500-448029

Published: 5/8/2025

Edited: 5/8/2025

Responsible: webmaster@his.se

Search results

Search tips

Shortcuts

How can we help?

Search results

Search tips

Shortcuts

How can we help?

PhD Thesis Defence: Sören Stahlschmidt

Abstract

Supervisors:

Opponent:

Committee:

Contact

PhD Student Bioinformatics

Sören Stahlschmidt

About us

Shortcuts

About the website

About the University