The Spam Email Classifier is a Machine Learning project that classifies emails as Spam or Ham (Not Spam).
It uses Natural Language Processing (NLP) and Machine Learning algorithms to achieve high accuracy in detecting spam emails.
- Preprocessing of raw email text (tokenization, stopword removal, TF-IDF vectorization).
- Trained ML model for binary classification (Spam / Ham).
- Interactive Hugging Face demo for real-time testing.
- Easy to extend with deep learning models in the future.
The model was trained on the popular Spam Email Dataset containing thousands of spam and ham messages.
It ensures balanced training for reliable predictions.
- Python 🐍
- Scikit-learn
- Pandas & NumPy
- NLTK (for text preprocessing)
- Hugging Face Spaces + Gradio (for live demo)
(Add your screenshots here: training plots, confusion matrix, Hugging Face demo UI)
👉 Try the model on Hugging Face: Spam Email Classifier Demo
- Achieved high accuracy on the test dataset.
- Correctly identifies spam vs ham messages.
- Reliable predictions on unseen emails.
- Add deep learning models (LSTM / Transformer-based).
- Support for multiple languages.
- Deploy as a browser extension for real-time email classification.
Muhammad Rayan Shahid
AI & ML Enthusiast
🌐 GitHub
💼 LinkedIn
📊 Kaggle
🤗 Hugging Face
🎥 YouTube - ByteBrilliance AI