Six ITB Scrapper

This repository aims to get all of available matakuliah and dosen information from all of prodi and facutly in ITB for Dosen Rank Project. Built using python

DISCLAIMER

I do not own the data. All of the data here are owned by ITB

library used

argparse
bs4
requests

How to run

Clone this repository
create virtual environment from python virtualenv venv
install all requirements pip install -r requirements.txt
provide your nim and cookie from ITB sso service
1. To get your cookie, you need to logged in in SIX
2. Right click > inspect > application
3. Grab the khongguan cookie from there

How it works

To get all cleaned data, first is run the scrape.py to get all of available data in SIX ITB
The data then get cleaned by using clean.py
Cleaned data then get converted to json or sql using output.py

Folder structure

.
├── README.md
├── clean.py 
├── data
│   ├── cleaned.json
│   ├── dosen_id_name.json   
│   ├── dosen_matkul_map.json
│   ├── fakultas.json        
│   ├── fakultas_shorthand.json
│   ├── id_prodi_map.json
│   ├── matkul_dosen_map.json
│   ├── matkul_id_name.json
│   ├── prodi.json
│   └── saved.json
├── output.py
├── requirements.txt
├── scrape.py
└── sql
    ├── dosen.sql
    ├── fakultas.sql
    ├── matkul.sql
    ├── matkul_dosen.sql
    └── prodi.sql

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Six ITB Scrapper

DISCLAIMER

library used

How to run

How it works

Folder structure

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
data		data
sql		sql
.gitignore		.gitignore
README.md		README.md
clean.py		clean.py
output.py		output.py
requirements.txt		requirements.txt
scrape.py		scrape.py

IloveNooodles/SIX-ITB-Scrapper

Folders and files

Latest commit

History

Repository files navigation

Six ITB Scrapper

DISCLAIMER

library used

How to run

How it works

Folder structure

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages