Spotify API ETL of Track Information to PostgreSQL

This project connects to the Spotify API to collect all useful track, album, and artist information about The Beatles. An ETL pipeline then loads all track information by The Beatles to PostgreSQL, in which the data is normalized utilizing a star schema.

While this project was originally focused on creating an ETL pipeline for The Beatles as a band, this project can be configured for any artist.

Methods Used

ETL
Data Modeling
Normalization
API Connection

Technologies Used

Python
PostgreSQL
pgAdmin

Packages Used

Psycopg2
Spotipy
Pandas

How To Run

Adjust Configurations

Configurations in the config.ini file will need to be adjusted per local PostGres settings (username, password, host, and database). The config.ini file is also where the artist name can be changed should that be desired.

Obtain Spotify API Tokens

In order to use the Spotipy package, API tokens will need to be obtained directly from Spotify. Click here for more information on this process.

Set Environment Variables

For this project to process, the Spotify API access key needs to be set as an environment variable called "spotify_id", and secret key needs to be set as an environment variable called "spotify_secret". These environment variables will need to be set on the operating system this project is to be run on.

Install Requirements and Run

On the command line of your operating system, navigate to the repository directory (ideally using a Python virtual environment).

Run the following code on the command line to install requirements:

pip install -r requirements.txt

Run the following code on the command line to run this project:

Python run.py

Featured Scripts or Deliverables

run.py

Sources

Spotipy

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
modules		modules
README.md		README.md
config.ini		config.ini
requirements.txt		requirements.txt
run.py		run.py
schema_design.jpg		schema_design.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spotify API ETL of Track Information to PostgreSQL

Methods Used

Technologies Used

Packages Used

How To Run

Adjust Configurations

Obtain Spotify API Tokens

Set Environment Variables

Install Requirements and Run

Featured Scripts or Deliverables

Other Repository Contents

Sources

About

Releases

Packages

Languages

ErikaJacobs/Beatles-Bops

Folders and files

Latest commit

History

Repository files navigation

Spotify API ETL of Track Information to PostgreSQL

Methods Used

Technologies Used

Packages Used

How To Run

Adjust Configurations

Obtain Spotify API Tokens

Set Environment Variables

Install Requirements and Run

Featured Scripts or Deliverables

Other Repository Contents

Sources

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages