This project uses machine learning to predict annual population estimates across Virginia using demographic data.
- Linear Regression (baseline)
- Random Forest Regressor
- LSTM Neural Network
- One-hot encoding, scaling, feature selection
- 5-fold CV with RMSE and R² evaluation
- Random Forest had best performance (RMSE ≈ 266)
- LSTM struggled due to limited temporal data
- Top features: age, year, locality, race
Virginia Single-Race Population Estimates
VA_Population_Modeling.ipynb: modeling notebooktemplate.tex: final report (LaTeX)figures/: plots