-
-
-
SCOR Datathon in 2020. Acquired and processed open data, predicted level of Glycohemoglobin, Cholesterol and probability of diabetes, then identified the probability change with Random Survival For…
-
-
Demonstration of experimentation analysis approach, including A/B testing, A/A testing, and quasi-experimental testing for data driven product decision making.
-
Python data pipeline to acquire, clean, and calculate vegetation indices from Sentinel-2 satellite image. Package is available only for our clients.
-
Prediction of market premiums for property damage and business interruption insurance products. Added natural hazard data and stacked 3 best models as the final model.
-
Launchmetrics Datathon 2020. Identified the impact of Instagram Image color property to company revenues with OpenCV using HSV and RGB color space.
-
Datathon with a retargeting ad company. Churn date prediction (Normalized RSME 38), clustering (98% Silhouette), automated identification of the gaps between best and average client within a cluster.
-
Forecast of advertisement revenue for the coming months and reserved price prediction for bidding price.
-
Extremely-Imbalanced Public
Extremely imbalanced binary classification for Travel Insurance Claim. Seeking for an improvement. Usage of Several models including Sequential Neural Network with Back Propagation, Calibration cur…
-
NLP-QA-Forum Public
Prediction of the best answers of the questions using a scraped QA forum data, including the texts. Neural Network (linear stack of layers): Accuracy 92%, MSE 0.08
Jupyter Notebook UpdatedMay 13, 2020