Day 1: Foundations of Health Data Analysis
Fundamentals of healthcare data
Intro to Python and health data analysis
Data analysis using dataframes
Data cleaning, grouping, transforming
Merging multiple datasets
Handling date and time data types
Day 2: Health Data Visualisation
Overview of health data sources
Visualisation theory
Intro to data visualisation in Python
Visual exploration of data
Visualising patient-group differences
Geographical visualisation, maps
Overview of plot types
Publication-ready plots
Day 3: Epidemiology and Biostatistics
Age standardisation
Intro to statistical epidemiology
Foundations of medical statistics
Hypothesis testing
Family-wise error, p-value adjustment
Logistic regression
Overview of data types
Clinical trials vs observational data
Day 4: Survival Models and AI
Visualising Kaplan-Meier curves
Cox proportional hazard models
AI from the ground up: theory and uses
Supervised learning
Unsupervised learning
Sensitivity, specificity, ROC-AUC
Interpretation of model decision-making
Neural networks and deep learning
Integration of LLMs into Python
Day 5: Data Hackathon!
In cooperation with our partners, we have prepared challenging data tasks using data from the healthcare and education sectors. The goal of the hackathon is for every participant to utilise their new data and programming skills directly in practice and at the same time, learn something new about important social topics. Several teams already came up with interesting findings in both tasks, which provided new insights for stakeholders.