This project implements a Support Vector Machine (SVM) model to classify individuals as Healthy or Parkinson’s patients based on biomedical voice measurements. The dataset is sourced from Kaggle and includes multiple acoustic features extracted from voice recordings.
- Source: Parkinson's Disease Dataset on Kaggle('https://www.kaggle.com/datasets/vikasukani/parkinsons-disease-data-set/data')
- Filename: parkinsons.data
- Attributes:
- name: Unique identifier for each patient
- status: (Target Variable)
- 1 → Parkinson’s Disease
- 0 → Healthy
- Other Features: Various voice-based metrics such as jitter, shimmer, and fundamental frequency.
git clone https://github.com/Bioinformatician-dev/parkinsons-svm-classifier.git
cd parkinsons-svm-classifier
pip install numpy pandas scikit-learn matplotlib seaborn kaggle
- Accuracy Score: Measures overall classification correctness.
- Confusion Matrix: Shows the number of correct and incorrect predictions.
- Classification Report: Provides precision, recall, and F1-score.
- Accuracy: ~85-90%
- Precision & Recall: Depend on feature scaling and hyperparameter tuning.
- The SVM model effectively classifies Parkinson’s patients.
- Feature scaling improves performance.
- Adding non-linear kernels (RBF, polynomial) might further enhance accuracy.