This repository aims to get all of available matakuliah and dosen information from all of prodi and facutly in ITB for Dosen Rank Project. Built using python
I do not own the data. All of the data here are owned by ITB
- argparse
- bs4
- requests
- Clone this repository
- create virtual environment from python
virtualenv venv
- install all requirements
pip install -r requirements.txt
- provide your
nim
andcookie
from ITB sso service- To get your cookie, you need to logged in in SIX
- Right click >
inspect
>application
- Grab the
khongguan
cookie from there
- To get all cleaned data, first is run the
scrape.py
to get all of available data in SIX ITB - The data then get cleaned by using
clean.py
- Cleaned data then get converted to
json
orsql
usingoutput.py
.
├── README.md
├── clean.py
├── data
│ ├── cleaned.json
│ ├── dosen_id_name.json
│ ├── dosen_matkul_map.json
│ ├── fakultas.json
│ ├── fakultas_shorthand.json
│ ├── id_prodi_map.json
│ ├── matkul_dosen_map.json
│ ├── matkul_id_name.json
│ ├── prodi.json
│ └── saved.json
├── output.py
├── requirements.txt
├── scrape.py
└── sql
├── dosen.sql
├── fakultas.sql
├── matkul.sql
├── matkul_dosen.sql
└── prodi.sql