Skip to content

This project involves scraping quotes from the web, performing exploratory data analysis (EDA), and using SQL for deep data insights. The goal is to uncover patterns in author popularity, sentiment distribution, common themes, and keyword trends.

Notifications You must be signed in to change notification settings

J-TECH-bot/Quotes_Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Quotes_Analysis

This project involves scraping quotes from the web, performing exploratory data analysis (EDA), and using SQL for deep data insights. The goal is to uncover patterns in author popularity, sentiment distribution, common themes, and keyword trends. Ideal for practicing web scraping, data cleaning, SQL querying, and data visualization.

Key Features: •⁠ ⁠Web Scraping: Collected quotes using Python libraries like ⁠ requests ⁠ and ⁠ BeautifulSoup ⁠. •⁠ ⁠Data Cleaning: Removed duplicates, cleaned text, and standardized author names. •⁠ ⁠EDA (Exploratory Data Analysis):

  • Top authors with the most quotes
  • Most frequent keywords
  • Quote length distribution
  • Common themes and topics •⁠ ⁠SQL Analysis:
  • Insights using advanced SQL queries (group by,substring, subquery,Order by,limit)
  • Frequency of words or phrases
  • Author-wise sentiment or theme analysis

Technologies Used

•⁠ ⁠Python (BeautifulSoup, Pandas, Matplotlib, Seaborn, Regex) •⁠ ⁠SQL (MySQL or SQLite) •⁠ ⁠Jupyter Notebook •⁠ •⁠ ⁠Web Scraping: Collected quotes using Python libraries like ⁠ requests ⁠ and ⁠ BeautifulSoup ⁠. •⁠ ⁠Data Cleaning: Removed duplicates, cleaned text, and standardized author names. •⁠ ⁠EDA (Exploratory Data Analysis):

  • Top authors with the most quotes
  • Most frequent keywords
  • Quote length distribution
  • Common themes and topics •⁠ ⁠SQL Analysis:
  • Insights using advanced SQL queries (group by,subquery, substring, order by, limit)
  • Frequency of words or phrases
  • Author-wise sentiment or theme analysis

About

This project involves scraping quotes from the web, performing exploratory data analysis (EDA), and using SQL for deep data insights. The goal is to uncover patterns in author popularity, sentiment distribution, common themes, and keyword trends.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published