Skip to content
#

datapreparation

Here are 62 public repositories matching this topic...

Learn data visualization through Tableau 2020 and create opportunities for you or key decision-makers to discover data patterns such as customer purchase behavior, sales trends, or production bottlenecks. This Course on Udemy

  • Updated Jul 25, 2020

The project deals with determining and predicting the type of accident taking place in the city of Austin. The data would help in understanding what possible factors are leading to the accidents based on the severity of the incident that has occurred.

  • Updated Jul 5, 2022
  • Jupyter Notebook

Trying to predict survival rate of passengers using algorithms like Logistic Regression, Ada Boost, Gradient Boost , Decision Tree Classifiers , Extra Tree Classifiers , Random Forest Classifiers and XG Boost with appropriate data preprocessing techniques.

  • Updated Nov 19, 2020
  • Jupyter Notebook

In this project, I have used logistic regression, a supervised machine learning algorithm, to predict whether a person has diabetes or not based on various features such as age, blood pressure, glucose level, body mass index, etc. I have used Python and popular libraries such as Pandas, Scikit-Learn, and Matplotlib to perfom model building

  • Updated Jan 26, 2024
  • Jupyter Notebook

The Bikes Sales Analysis Excel Project is a practical exploration of sales data analysis using Microsoft Excel. This project showcases how Excel can be a powerful tool for data cleaning, preprocessing, visualization, and dashboard creation, all within a familiar spreadsheet environment.

  • Updated Aug 4, 2023

ScrapySub is a Python library designed to recursively scrape website content, including subpages. It fetches the visible text from web pages and stores it in a structured format for easy access and analysis. This library is particularly useful for NLP and AI developers who need to gather large amounts of web content for their projects.

  • Updated Jul 14, 2024
  • Python

This project involves the use of K-Means Clustering to find the best accommodation for students in Bangalore (or any other city of your choice) by classifying accommodation for incoming students on the basis of their preferences on amenities, budget and proximity to the location.

  • Updated Jan 9, 2023
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the datapreparation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the datapreparation topic, visit your repo's landing page and select "manage topics."

Learn more