Skip to content

To understand and predict how the student's performance (test scores) is affected by the other variables (Gender, Ethnicity, Parental level of education, Lunch, Test preparation course).

Notifications You must be signed in to change notification settings

bhuvaneshwarguttula/Student-Performance-Indicator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Student Performance Indicator

End to End Machine Learning Project


Project Overview

Education is a critical factor in shaping an individual's future. The primary objective of this project is to develop a predictive model that can predict the performance of students in their academics. This project aims to explore various factors that may affect students' performance, including demographic information, family background, and school-related attributes. By analyzing this data, we can gain insights into potential areas for improvement in educational systems.

Dataset Overview

The dataset consists of the following columns:

  • gender: Gender of the student (categorical: 'male', 'female')
  • race/ethnicity: Ethnic background of the student (categorical: five levels)
  • parental level of education: Highest level of education attained by the student's parents (categorical: 'high school', 'some college', 'associate's degree', 'bachelor's degree', 'master's degree')
  • lunch: Type of lunch received (categorical: 'standard', 'free/reduced')
  • test preparation course: Whether the student completed a test preparation course (categorical: 'none', 'completed')
  • math score: Score in mathematics (numerical: 0-100)
  • reading score: Score in reading (numerical: 0-100)
  • writing score: Score in writing (numerical: 0-100)

Key Features

  • Data Cleaning and Preprocessing: Handling missing values, correcting data types, and preparing the dataset for analysis.
  • Exploratory Data Analysis (EDA): Visualization of key variables to understand their distribution and relationships.
  • Statistical Analysis: Identifying significant factors that correlate with student performance using correlation analysis and regression models.
  • Visualization: Visualizing relationships between variables through various plots, such as bar charts, and scatter plots.
  • Model Training: Model training on several machine learning models and finding out which model performed the best.

Technologies Used

  • Python
    • pandas
    • NumPy
    • seaborn
    • matplotlib
    • scikit-learn
    • Flask
  • Jupyter Notebook

Results

  • The project provides insights into which factors have a significant impact on student performance. These findings can be utilized to tailor educational strategies and improve academic outcomes.

Installation Steps

Follow these steps to install and set up the project directly from the GitHub repository:

  1. Clone the Repository

    • Open your terminal or command prompt.
    • Navigate to the directory where you want to install the project.
    • Run the following command to clone the GitHub repository:
      git clone https://github.com/bhuvaneshwarguttula/Student-Performance-Indicator.git
      
  2. Create a Virtual Environment (Optional but recommended)

    • It's a good practice to create a virtual environment to manage project dependencies. Run the following command:
      conda create -p <Environment_Name> python==<python version> -y
      
  3. Activate the Virtual Environment (Optional)

    • Activate the virtual environment based on your operating system:
      conda activate <Environment_Name>/
      
  4. Install Dependencies

    • Navigate to the project directory:
      cd [project_directory]
      
    • Run the following command to install project dependencies:
      pip install -r requirements.txt
      
  5. Run the Project

    • Start the project by running the appropriate command.
      python app.py
      
  6. Access the Project

    • Open a web browser or the appropriate client to access the project.

About

To understand and predict how the student's performance (test scores) is affected by the other variables (Gender, Ethnicity, Parental level of education, Lunch, Test preparation course).

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages