Collection of links to various datasets for machine learning
-
Updated
Dec 21, 2021
Collection of links to various datasets for machine learning
Outlier detection (z-score and IQR) and visualization on Geolife dataset for transport mode detection task
Advanced anomaly detection using topological data analysis and manifold learning.
Predicting news impressions on X (Twitter) using IndoBERT embeddings + XGBoost/LightGBM/CatBoost. Full NLP pipeline: Indonesian text preprocessing, DBSCAN outlier handling, 10-fold CV, Optuna tuning, and Streamlit deployment. Undergraduate thesis project.
Using unsupervised custom GraphSAGE based neural networks to perform outlier detection
Add a description, image, and links to the outlier-detection-datasets topic page so that developers can more easily learn about it.
To associate your repository with the outlier-detection-datasets topic, visit your repo's landing page and select "manage topics."