Skip to content

alangrosso/pySpark_tutorial

 
 

Repository files navigation

pySpark_tutorial

List of contents

  • RDDs and DataFrame
  • Exploratory data analysis
  • Handeling multiple dataframes
  • Visualization
  • Machine learning

About

Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple DataFrames, visualization, Machine Learning

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Jupyter Notebook 100.0%