GitHub - DishaAggarwal31/Job-Market-Data-Analysis: An interactive job market analytics dashboard built with Python, Matplotlib, and ipywidgets. Explore job trends by industry, location, and experience with dynamic filters and visual insights.

Job Market Data Engineering Project -

This project aims to scrape, clean, standardize, and enrich job listing data from multiple sources such as Naukri, LinkedIn, Monster, and others. The goal is to build a structured dataset for exploratory analysis, dashboards, or feeding into ML pipelines.

✅ Key Features Implemented So Far 🔍 Web Scraping Scraped job listings from multiple job portals (Naukri, LinkedIn, Monster, etc.).

Parsed job title, company, location, experience, and other fields.

Handled edge cases like missing or malformed data, multiple locations, etc.

🧹 Data Cleaning & Preprocessing Cleaned and standardized job titles and experience strings.

Extracted structured experience ranges (min, max years).

Removed non-relevant scraped text artifacts.

Normalized company names (partial logic to handle typos, misformats).

🏙️ Location Extraction Built a robust logic to extract city, state, and country from messy location strings.

Managed entries with multiple cities or "remote" indicators.

Utilized a cities.json file with a structured city-to-state mapping.

Standardized city spellings (e.g., “bengluru” → “bengaluru”, “gurgaon” → “gurugram”).

🧾 Experience & Industry Categorization Parsed experience level from job title and/or metadata.

Categorized jobs into experience buckets (Fresher, Junior, Mid, Senior).

Inferred industry sector from job title or company sector.

🆔 Unique Job ID Generation Implemented a persistent job ID system per source.

Format: e.g., TAL0001, MON0002.

Maintained counters using a job_id_counter.json.

🧠 Flexible Structure Designed code to be modular and extensible.

Used external config/mapping files where needed (cities.json, job title keywords, etc.).

Plan to expand with deduplication, ML-based matching, and dashboarding.

🔧 Tech Stack Python 3.10+

BeautifulSoup, requests for scraping

pandas for data processing

json, os, re, sys for file and text handling

Job Market Insights Dashboard

An interactive, filter-based dashboard for analyzing job listings across various sectors, countries, states, and experience levels. This project focuses on data-driven insights using Python, enabling users to explore job trends and patterns through visualizations.

Features

General Overview: High-level summary of top industries, job locations, and experience categories.
Interactive Filters: Select Country, State, Industry, and Experience Level to customize your view.
Dynamic Visualizations: Automatically update graphs based on selected filters:
- Top industries and cities
- Jobs by country and state
- Stacked bar charts for state/city breakdown
- Pie charts and bar charts for experience category distribution
Built entirely in Jupyter Notebook using:
- Pandas for data manipulation
- Matplotlib for plotting
- ipywidgets for interactive controls

Skills Highlighted

Data Cleaning & Filtering
Interactive Data Exploration
Dashboard Design with Widgets
Real-world Data Analytics Use Case

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
data_preprocessing		data_preprocessing
documents		documents
notebooks		notebooks
output		output
scrapers		scrapers
utils		utils
visualizations		visualizations
README.md		README.md
city_state_scraper_main.py		city_state_scraper_main.py
data_preprocessing_main.py		data_preprocessing_main.py
job_scraper_main.py		job_scraper_main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Job Market Insights Dashboard

Features

Skills Highlighted

About

Releases

Packages

Languages

DishaAggarwal31/Job-Market-Data-Analysis

Folders and files

Latest commit

History

Repository files navigation

Job Market Insights Dashboard

Features

Skills Highlighted

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages