Australian Hospital Separations ETL Pipeline ๐Ÿ“Š

Automated extraction, transformation & visual analytics for AIHW Admitted-Patient-Care tables.

Streamlit dashboard showing hospital-separation analytics

Project Overview

This pipeline fetches the latest Excel workbooks published by the Australian Institute of Health & Welfare (AIHW), cleans and normalises every sheet to a tidy relational schema in PostgreSQL, then spins up an interactive Streamlit dashboard (7 charts + optional profiling) for rapid exploratory analysis and auto-generated insights.

Key Features

Tech Stack โš™๏ธ

  • Python ยท requests, BeautifulSoup, pandas, openpyxl
  • SQLAlchemy โžœ PostgreSQL
  • Streamlit + Plotly Express
  • Automated via single main.py (CI / cron friendly)
  • Env-configured DB_URL for easy cloud deploy

Try It Yourself

Clone the repo, create a PostgreSQL database, set DB_URL in .env, then run:

pip install -r requirements.txt
python main.py      # runs full ETL
streamlit run streamlit_app.py  # runs Streamlit server

๐Ÿ”— View Source Code on GitHub

๐Ÿ” Blog Post