Automated extraction, transformation & visual analytics for AIHW Admitted-Patient-Care tables.
This pipeline fetches the latest Excel workbooks published by the Australian Institute of Health & Welfare (AIHW), cleans and normalises every sheet to a tidy relational schema in PostgreSQL, then spins up an interactive Streamlit dashboard (7 charts + optional profiling) for rapid exploratory analysis and auto-generated insights.
staging_admissions & clean_admissions.ydata-profiling.requests, BeautifulSoup, pandas, openpyxlmain.py (CI / cron friendly)DB_URL for easy cloud deploy
Clone the repo, create a PostgreSQL database, set DB_URL in
.env, then run:
pip install -r requirements.txt
python main.py # runs full ETL
streamlit run streamlit_app.py # runs Streamlit server