Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
-
Updated
May 19, 2021 - Scala
Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
Apache Spark Course Material
Spark BigQuery Parallel
Lightweight type-safe operations for Spark
This repository is created by Dharshan Kumar K S and Siva Prakash as part of our semester project from 'Big Data Analysis' subject
Hadoop hdfs mapreduce hive spark使用案例
Techniques for analyzing and visualizing data at scale.
A Spark framework written in Scala with gradle as build tool.
Demonstration of basic data transformations using Spark RDD and Spark DataFrame in Scala
This is the repository for Youtube Project for the subject PBDA. We are implementing analysis for finding top videos in each and every category. Also, we are planning to find the top trending words in each category.
Assignment for Scalable Machine Learning which aims to study the basics of regression and classification in Spark.
Demo of spark program with cucumber framework using scala
Repository to demonstrate sample data engineering assignment
Using Scala for big data computations for basic tasks
This Spark App analyses various covid cases data and enables you to create custom mathematical insights using a unified data structure and a trait method. After processing data it then writes to Cassandra which is then used as primary source for Data Visualization.
Learning Journey: Spark using Scala, Python, PySpark
Coursework from Functional Programming in Scala Coursera specialization.
Add a description, image, and links to the spark-scala topic page so that developers can more easily learn about it.
To associate your repository with the spark-scala topic, visit your repo's landing page and select "manage topics."