Skip to content

Rcode879/Spam-Email-Filtering-

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

5 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Spam-Email-Filtering-

Application of naive bayes to filter spam emails, from scratch.

πŸ” Overview

The classifier works by:

  1. Reading a dataset of emails labeled as spam or not spam.
  2. Calculating word frequencies and probabilities for both spam and non-spam emails.
  3. Applying Naive Bayes classification using log probabilities to avoid underflow.
  4. Predicting new emails as spam (1) or not spam (0).

βš™οΈ Features

  • Calculates prior probabilities for spam and not-spam emails.
  • Uses Laplace smoothing for unseen words.
  • Applies logarithmic probabilities for numerical stability.
  • Evaluates model performance on a separate test dataset.

Model Accuracy

πŸ“‚ Dataset

Dataset Name Description Source
Emails Training Dataset Labeled email features for training the Naive Bayes model Kaggle Link
Emails Test Dataset Raw email text with spam labels for testing Kaggle Link

About

Application of naive bayes to filter spam emails, from scratch

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages