Possibilities for student projects (BSc, MSc, Projektarbeit, lab rotation, ...)
We mainly offer topics from the area of algorithmic bioinformatics. These can range from more theoretical algorithmic questions to workflow development for specific bioinformatics applications.
Please feel free to propose and discuss your own topic with us.
You find most referenced student theses here: https://www.cs.hhu.de/lehrstuehle-und-arbeitsgruppen/algorithmische-bioinformatik/lehre-und-abschlussarbeiten/abschlussarbeiten/abgeschlossene-arbeiten. Otherwise ask.
BSc level
- develop HOG-based ILP to compute shortest common superstring of given k-mer set and also minimize the run length encoding of the mask. See https://www.biorxiv.org/content/10.1101/2023.02.01.526717v1.abstract Problem: not all SCSs possible with HOG? miss important ones? Contact: Gunnar, Sara?
- Using SAT to model first species counterpoint and compare to ILP implementation (Tanaka) Contact: Philipp
- Integrate/Implement CP ILP Formulations as a stand-alone application that processes MIDI or as a plugin for the scorewrite software MuseScore (Tanaka) Contact: Philipp
- Paper Dominik Heider https://www.sciencedirect.com/science/article/pii/S0010482524002695 do that with LOBICO (categorical classification, not binary)
- Maximizing diversity for anticlustering (while keeping optimal dispersion). Keywords: ILP, R, C++, maybe preprocessing, maybe optimization with constraint programming. Contact: Khoa
- “Continuous logical” formulas for regression to predict drug responses. Keywords: ILP, Python. Contact: Khoa
- 3D-time-dose-response surface fitting and extrapolation, with focus on different time-response models. Keywords: Integral, Gompertz-Laird/exponential functions. Contact: My Ky
- Implementierung und Vergleichen von Metaheuristiken für Maximum Diversity bei Anticlustering https://doi.org/10.1016/j.ejor.2022.02.003. Contact: Nguyen Khoa Tran
MSc level
- Vehicle routing problem for leaf optimization. Get Martin's code to run. Produce better pictures for Daan. Make modifications to speed up. Check if the model is good enough, adapt, … Contact: Gunnar
- Data from Olga. compare molecular graphs. Contact: Gunnar
- Further stuff Nan (new stuff on image analysis)
Other topics / No fixed level
- Benchmarking and Testing of Clustering-based, Manual, and ML-based Cell Type Prediction in High-Plex Imaging Data (Bioinformatic M.Sc. project)
This project aims to evaluate different approaches for cell type prediction in high-plex imaging data, including clustering-based methods, manual annotation, and machine learning (ML) models. The study will compare the accuracy, reproducibility, and computational efficiency of these techniques across diverse datasets. The findings will provide insights into the optimal strategies for cell identification in spatial omics studies, supporting more robust downstream biological interpretations.
- Application and Optimisation of Automated Cell Type Prediction Tools for Distinct Hematologic and Solid Malignancies to Identify Distinct Changes in Patient Subgroups (Quantitative Biology M.Sc. project)
This project focuses on optimizing automated cell type prediction tools for specific hematologic and solid malignancies in multiplexed immunofluorescence imaging. By applying advanced computational methods taking into account the known composition and characterisation of the disease entity, the study will identify unique cellular alterations within specific patient subgroups, potentially revealing novel biomarkers or therapeutic targets. The work involves testing and refining prediction models to improve their accuracy in detecting disease-specific cellular patterns, contributing to precision oncology research. While this project focuses on the computational side of multiplexed immunofluorescence, interested students can also get hands-on experience in data generation and wet-lab techniques.
- Efficient heuristic or exact algorithms for circular sequence comparison in sub-quadratic time: Aligning bacterial genomes vs one another or reads of a circular genome vs a circular genome
- SWGTS-Expansions: 1.) Add encrypted package transfer and key sharing for increased throughput/speed, 2.) Alternative Filtering Methods e.g. k-mer based filtering 3.) Metadata Managment System 4.) Division of long reads into segments ; Skills: Docker-Compose, REST, Encryption, Client-Server Architecture
- An edge-based ILP for modeling leaf venation patterns (builds on BSc thesis Mario Surlemont). Contact: Gunnar Klau
- SAT Formulation for MERIDA https://academic.oup.com/bioinformatics/advance-article/doi/10.1093/bioinformatics/btab546/6342406. Contact: Nguyen Khoa Tran
- ILP for Matrix Reordering Problem https://www.vldb.org/conf/2004/RS1P1.PDF. Contact: Nguyen Khoa Tran
- Implement an algorithm to find active modules (average Heinz) by maximizing a fractional objective function https://www.biorxiv.org/content/10.1101/2021.05.11.443638v2 Contact: Gunnar Klau
- Spa-Typing and Flight Data: Collect Flight Data and Samples from Spa Database and look for Correlations (Check if a high volume of flights correlates to similar detected Spa-Types / Develop a framework to check correlation between flight volume and genetic information), Build on Ninas work, Contact: Philipp
- Backport Sven's 0-Edge-CE-Heuristic into Yoshiko Main (and thus Cytoscape App), Contact: Sven, (Philipp?)
- Projektarbeit, very unclear outcome: https://newatlas.com/massive-attack-dna-spray-paint/56881/ Decode Sequence Massive Attack