Author: John W.S. Lee
This project is a sequel to the previous study that examined the correlation between weather parameters and fine dust concentration. The primary aim of the prior investigation was not to anticipate fine dust concentration based on weather parameters, but rather to explore the interplay between fine dust concentration and weather factors.
In this study, the objective is to predict the air quality for the upcoming day using a 24-hour dataset of weather parameters from the given day. Specifically, a total of eight features that demonstrated utility in the earlier study have been employed as inputs for the predictive model. These features encompass wind_direction
, humidity(%)
, lowest_ceiling(100m)
, temp(°C)
, wind_speed(m/s)
, local_P(hPa)
, precipitation(mm)
, and PM10_Counts
. The target variable, denoted as Air_is_bad?
, assumes a binary state and is determined based on the PM10 particle counts for the following day (True for `PM10 counts`` exceeding 45, and False for counts below 45). A detailed outline of the data preprocessing procedure can be found in the data_preprocessing_notebook.
Given that the input data for prediction exists in the form of a time series, a 1-D convolutional neural network was employed as the foundational architecture for the model. The 24-hour dataset of the eight input features underwent scaling and conversion into a NumPy
array. Subsequently, these arrays were stacked to create an input data structure with eight channels (For an in-depth process, refer to the data_preprocessing_notebook). The following figure shows a representative instance of input features, accompanied by the corresponding target value as the title.
The architecture of the models was optimized by implementing optuna
library, and the following figures show classification report, confusion matrix, and roc curve of the model.