Production-style churn prediction workflow with SHAP explainability and profit-driven evaluation.
Built on the IBM Telco dataset, this project delivers an end-to-end churn prediction system: from data acquisition & cleaning through feature engineering, model selection & evaluation, to a Streamlit app deployment. It incorporates leakage prevention, profit-optimized decision thresholds, and SHAP-based interpretability for actionable insights.
contract_length_months, is_electronic_check, and tenure-based bins.pandas, scikit-learn, xgboost, shapkagglehubmatplotlib, Altair, Plotlyjoblibvenv (dependency pins for reproducibility)python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt
python -m src.models.train --csv "data/raw/Telco-Customer-Churn.csv"
streamlit run app/Home.py