A professional Python-based tool that intelligently crawls, analyzes, and reports the 50 largest files on your system, with smart suggestions for cleanup and a stylish output report.
- 🔎 Interactive Mode – Choose between scanning the whole system or a specific drive
- 📊 Top 50 Largest Files – Sorted by size, displayed with full path
- 🧹 Smart Cleanup Suggestions – Flags cache/temp/log/junk files for removal
- 📄 Structured Report – Generates a clean, formatted text report saved locally
- ⏳ Progress Bar – Smooth, responsive progress tracking with
tqdm
- 💬 Human-Readable Sizes – File sizes shown in friendly units like
MB
,GB
, etc. - 🧠 Intelligent Junk Detection – Detects unnecessary files using names and extensions
- 🖥️ Works on Windows, requires only Python 3.x
Make sure you have Python 3 installed, then clone and install dependencies:
git clone https://github.com/xoxxel/smart-disk-analyzer.git
cd smart-disk-analyzer
pip install -r requirements.txt
Start the scan using:
npm run analyze
Or directly:
python smart_disk_analyzer.py
You'll be prompted with:
📦 Choose scan option:
1 - Full system scan (all drives)
2 - Scan a specific drive (e.g., D)
Enter choice [1 or 2]: 2
Enter drive letter (e.g., "D"): D
The script will start scanning and display a progress bar as it crawls your file system.
================================================================================
📄 File Size Report - 2025-07-10 18:20:03
🔍 Scanned paths: D:\
📌 Top 50 Largest Files
D:\Games\HugeFile.iso
Size: 4.2 GiB
D:\Temp\debug.log
Size: 1.3 GiB | ✅ Recommended for Deletion
--------------------------------------------------------------------------------
Files are marked as recommended for deletion if:
- Filename contains:
cache
,temp
,log
,bak
,old
, etc. - Extensions include:
.tmp
,.log
,.bak
,.dmp
,.swp
,.chk
The report is saved in the root folder of the project as:
file_scan_report.txt
You can use NPM for a cleaner command:
// package.json
"scripts": {
"analyze": "python smart_disk_analyzer.py"
}
Then run:
npm run analyze
Created by xoxxel with ❤️ using Python and a passion for disk health.
This project is open-source under the MIT License.