Prediction of Air Quality in Seoul Using a Multi-Scale Convolutional Neural Network on Weather Data

Author: John W.S. Lee

1. Introduction

This project is a sequel to the previous study that examined the correlation between weather parameters and fine dust concentration. The primary aim of the prior investigation was not to anticipate fine dust concentration based on weather parameters, but rather to explore the interplay between fine dust concentration and weather factors.

In this study, the objective is to predict the air quality for the upcoming day using a 24-hour dataset of weather parameters from the given day. Specifically, a total of eight features that demonstrated utility in the earlier study have been employed as inputs for the predictive model. These features encompass wind_direction, humidity(%), lowest_ceiling(100m), temp(°C), wind_speed(m/s), local_P(hPa), precipitation(mm), and PM10_Counts. The target variable, denoted as Air_is_bad?, assumes a binary state and is determined based on the PM10 particle counts for the following day (True for `PM10 counts`` exceeding 45, and False for counts below 45). A detailed outline of the data preprocessing procedure can be found in the data_preprocessing_notebook.

2. Modelling

Given that the input data for prediction exists in the form of a time series, a 1-D convolutional neural network was employed as the foundational architecture for the model. The 24-hour dataset of the eight input features underwent scaling and conversion into a NumPy array. Subsequently, these arrays were stacked to create an input data structure with eight channels (For an in-depth process, refer to the data_preprocessing_notebook). The following figure shows a representative instance of input features, accompanied by the corresponding target value as the title.

The architecture of the models was refined through the implementation of the optuna library. The following figures display the classification report, confusion matrix, and ROC curve of the model. The accuracy of the prediction on the test set was 83%.

Comprehensive procedures for model setup and hyperparameter tuning, as well as model evaluations, are outlined in the modeling notebook and evaluation notebook respectively. The subsequent illustration provides a preview of predictions alongside actual labels.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
archived_notebooks		archived_notebooks
data		data
img		img
models		models
notebooks		notebooks
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
env.yml		env.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Prediction of Air Quality in Seoul Using a Multi-Scale Convolutional Neural Network on Weather Data

1. Introduction

2. Modelling

About

Releases

Packages

Languages

License

johnwslee/fine_dust_analysis_2

Folders and files

Latest commit

History

Repository files navigation

Prediction of Air Quality in Seoul Using a Multi-Scale Convolutional Neural Network on Weather Data

1. Introduction

2. Modelling

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages