Skip to content

Applications of ML techniques including PCA for dimensionality reduction with K-Means and DBSCAN clustering to identify vulnerable communities across Virginia - CS 4774 ML @ UVA Spring '25

Notifications You must be signed in to change notification settings

huygnguyen04/Virginia-Social-Vulnerability-Identification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

Team Name: Tensor Titans 😄

Members: Artie Humphreys, Santiago Perez Lugo, Huy Nguyen


🔗 Project Links

📄 Final Report (PDF): View Full Analysis

📊 Google Colab Notebook: View Cleaned Source Code and Full Version

📽️ Checkpoint Presentation Video: Watch on YouTube


Project Overview

This project explores how unsupervised machine learning can help uncover socially vulnerable communities across the state of Virginia using the CDC’s 2018 Social Vulnerability Index (SVI) dataset. Our workflow includes:

  • Dimensionality Reduction: Principal Component Analysis (PCA) reduces 128 socio-demographic indicators into a compact vulnerability space.

  • Clustering: K-Means is used for full-state coverage, while DBSCAN identifies dense clusters of high-risk communities (hotspots).

  • Scoring Metrics: Two composite scores proposed are PCA-Norm and Distance-Norm to validate against official SVI scores using Pearson correlation.


🗺️ Sample Visualizations

Map of All Tracts in Virginia

Virginia Census Tracts Map

Preliminary Results – Clustered SVI Score Map with KMeans and DBSCAN

Clustering SVI Score Map


This project was developed for CS 4774: Machine Learning at the University of Virginia (Spring 2025), and offers a data-driven framework for identifying and visualizing social vulnerability across Virginia's communities.

About

Applications of ML techniques including PCA for dimensionality reduction with K-Means and DBSCAN clustering to identify vulnerable communities across Virginia - CS 4774 ML @ UVA Spring '25

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published