Skip to content

Heidi Putkuri

Bioinformatician expanding my knowledge to data engineering and machine learning.

📍 Helsinki, Finland | 🔗 GitHub | 💼 LinkedIn | 💻projects

mail: heidi.putkuri@hotmail.com

Professional Profile

Bioinformatician with hands-on experience in next-generation sequencing (NGS) data analysis spanning human genetics and microbial research. Skilled in developing automated, reproducible analysis pipelines and applying machine learning to biological data. Experienced in high-performance and cloud computing environments, variant calling workflows, and large dataset management. Currently working in public health automating viral sequencing data analysis and database migration. Passionate about leveraging data science and machine learning to drive insights in molecular biology and bioinformatics.

Education

I have a BSc in Molecular Biosciences with a minor in Computer Science and an MSc in Life Science Informatics (Bioinformatics) both from the University of Helsinki. I am currently part time studying for a BSc in Data Engineering and AI at the University of Applied Sciences of Kajaani.

gantt
    title Education Timeline
    dateFormat  YYYY
    axisFormat  %Y

    section Degrees
    BSc – Molecular Biosciences + CS :2019, 2022
    MSc – Life Science Informatics       :2022, 2025
    BSc – Data Engineering & AI          :2024, 2028

Experience

Bioinformatician

Finnish Institute for Health and Welfare | July 2025 - Present

  • Automated sequencing data analysis workflows, reducing manual processing time, by developing Python-based pipeline orchestration for viral sequencing data.
  • Part of LIMS database migration affecting thousands of records and hundreds of users, creating data views using sql to ensure data integrity and consistency during the migration process.

Tools: Python · Git · Nextflow · Oracle · Docker · Linux · SQL


Master's Thesis Researcher

University of Helsinki | June 2024 - October 2024

Thesis: Classifying Bacterial Species Using Methylation Patterns in Complex Microbial Communities

I worked full-time on my master’s thesis project, which involved developing a machine learning model to classify bacterial species based on NGS methylation data. This work is part of a scientific publication (Markkanen et al., 2026). The project demonstrated my ability to independently take a bioinformatics project from raw data to a publishable method. Random Forest model achieved 94% classification accuracy. My work was part of a larger scientific publication soon to be published. Current preprint is available here. I created a reproducible analysis workflow documented in a version-controlled Python codebase that is part of the publication repository found here.

Tools: Python · Scikit-learn · Pandas · Git · Jupyter Notebook ·


Research Assistant

Folkhälsan Research Institute - Eye Genetics Group | January 2023 - December 2024

I worked part-time during the academic year and full-time during the summer. My role involved analyzing next-generation sequencing (NGS) data to help the genetics team identify genetic variants associated with eye diseases. I developed and maintained bioinformatics scripts and pipelines for processing already aligned and raw sequencing data, including quality control, alignment, and variant calling. I also performed downstream analyses such as variant annotation to make the results interpretable for the research team. Additionally, I contributed to the migration of data storage and analysis pipelines to CSC’s high-performance computing environment, ensuring that our workflows were optimized for large datasets and could be executed efficiently on the cluster.

Tools: Linux · Python · SLURM · CSC Puhti · Bash · Git · Cloud storage (AWS in CSC Allas) · Sequence alignment · Variant calling


Software Testing Intern

FISION OY | September 2021 - February 2022

I worked part time as a software testing intern at FISION, where I was responsible for testing the graphical user interface (GUI) of their software products. My role involved executing test cases, identifying and documenting bugs, and collaborating with the development team to ensure a high-quality user experience. Got to also take a look under the hood of software development and gained exposure to the full software development lifecycle.

Tools: GUI testing · documentation · Software QA

Tech Stack

Currently using VSCode for development, GitHub for version control, and Linux and macOS for operating systems with additional experience in Windows.

Programming languages

Python Java
Bash R Ruby
HTML CSS JavaScript SQL

Data & Machine Learning

Tensorflow PyTorch
scikit-learn NumPy
Pandas Tidyverse
Plotly Optuna MLflow

DevOps & Infrastructure

Docker GitLab GitHub Actions
Azure Terraform Great Expectations

Databases & Cloud

Oracle PostgreSQL MongoDB AWS S3


Download PDF