Oil Price Prediction Backend

title	Oil Price Prediction API
emoji	🛢️
colorFrom	blue
colorTo	green
sdk	docker
app_port	7860
pinned	false

Oil Price Prediction Backend

FastAPI backend for Brent oil price forecasting using a trained ensemble model.

Features

14-day price forecast using VMD-based ensemble model
Locked daily forecast: one prediction record per day, refreshed after market close
Automatic price fetching from Yahoo Finance (BZ=F ticker)
REST API with both auto and manual prediction modes
Sentiment Analysis using FinBERT

Quick Start (Local Development)

# Install dependencies
pip install -r requirements.txt

# Run the server
uvicorn app.main:app --reload --host 0.0.0.0 --port 8000

Environment Variables

Configure the application using environment variables (create a .env file):

Variable	Default	Description
`NEWSAPI_KEY`	-	NewsAPI.org API key for news fetching
`NEWSDATA_KEY`	-	NewsData.io API key for news fetching
`SENTIMENT_MODE`	`finbert`	Sentiment analysis mode: `simple` or `finbert`
`SKIP_FINBERT_PRELOAD`	`false`	Skip FinBERT model preloading at startup (set to `true` for deployments without HuggingFace access)
`PREDICT_CACHE_TTL_SECONDS`	`45`	In-memory TTL (seconds) for the latest forecast returned by `/predict`
`PREDICTION_PRECOMPUTE_ENABLED`	`true`	Enables background forecast refresh so `/predict` can return warm cached responses
`PREDICTION_PRECOMPUTE_INTERVAL_SECONDS`	`900`	Background precompute interval in seconds (default 15 minutes)
`PREDICTION_LOCK_SCHEDULE_ENABLED`	`true`	Enables daily locked prediction scheduler
`PREDICTION_LOCK_SCHEDULE_HOUR`	`0`	Daily lock job hour (scheduler timezone)
`PREDICTION_LOCK_SCHEDULE_MINUTE`	`30`	Daily lock job minute (scheduler timezone)
`PREDICTION_LOCK_SCHEDULE_TIMEZONE`	`America/New_York`	Timezone used for daily lock schedule
`PREDICTION_CLOSE_LOCK_BUFFER_MINUTES`	`20`	Minutes to wait after Yahoo regular session end before treating close as stable
`HF_SPACE_TARGET`	-	Hugging Face Space host used by local Prometheus scraper (e.g., `your-space.hf.space`)
`GRAFANA_REMOTE_WRITE_URL`	-	Grafana Cloud Prometheus remote-write endpoint
`GRAFANA_METRICS_USER_ID`	-	Grafana Cloud metrics username
`GRAFANA_API_TOKEN`	-	Grafana Cloud API token with Metrics Publisher permissions

Example .env file:

NEWSAPI_KEY=your_api_key_here
NEWSDATA_KEY=your_api_key_here
SENTIMENT_MODE=finbert
SKIP_FINBERT_PRELOAD=false
PREDICTION_LOCK_SCHEDULE_ENABLED=true
PREDICTION_LOCK_SCHEDULE_TIMEZONE=America/New_York
PREDICTION_LOCK_SCHEDULE_HOUR=0
PREDICTION_LOCK_SCHEDULE_MINUTE=30
PREDICTION_CLOSE_LOCK_BUFFER_MINUTES=20

FinBERT Model Loading

The FinBERT sentiment model is loaded from Hugging Face on first use. For deployments in restricted network environments:

Set SKIP_FINBERT_PRELOAD=true to skip model preloading during startup
The model will be loaded on the first sentiment analysis request instead
Ensure the deployment environment can access huggingface.co for model downloads

API Endpoints

Endpoint	Method	Description
`/predict`	GET	Return locked daily forecast from database
`/predictions/lock/run`	POST	Manually trigger daily locked forecast generation
`/predictions/fan`	GET	Fan chart quantile bands for latest forecast
`/predictions/compare`	GET	Compare stored forecasts vs actual prices
`/prices`	GET	View fetched price data
`/historical/prices`	GET	Historical imported prices (daily/weekly/monthly)
`/historical/features/combined`	GET	Historical joined price + news features
`/health`	GET	Health check
`/sentiment/add`	POST	Add daily sentiment
`/sentiment/bulk`	POST	Bulk upload sentiment
`/docs`	GET	Swagger API Documentation

Example Usage

# Generate latest 14-day forecast
curl http://localhost:8000/predict

# Generate reasoning for forecasted prices
curl http://localhost:8000/explain

# Manually run today's locked daily forecast
curl -X POST http://localhost:8000/predictions/lock/run
 
# Fan chart bands for frontend visualization
curl "http://localhost:8000/predictions/fan?min_samples_per_horizon=20"

# Historical actual vs predicted comparison
curl "http://localhost:8000/predictions/compare?start_date=2025-01-01&end_date=2025-12-31"

Daily Locked Forecast Flow

The backend now uses a lock-once-per-day workflow:

At 12:30 AM ET (configurable), a scheduler job runs once.
It fetches historical prices and selects a stable close using Yahoo session timing. Before market close + buffer it uses the previous trading close; after close + buffer it can use the same-day close.
The model generates forecast output and stores one locked row for that prediction_date.
All daytime /predict requests read from the database record only (no model rerun).

This removes intraday drift and prevents prediction-table duplicates.

Model Artifacts

The trained model artifacts should be placed in model_artifacts/:

config.pkl - Model configuration
mid_gru.pt - Mid-frequency GRU model
sent_gru.pt - Sentiment GRU model
xgb_hf_models.pkl - XGBoost high-frequency models
meta_models.pkl - Ridge meta-ensemble models
meta_scalers.pkl - Meta-model scalers
scaler_mid.pkl, scaler_price.pkl, scaler_sent.pkl - Feature scalers

Architecture

Prices (30 days) → Feature Engineering → Component Models → Meta-Ensemble → 14-day Forecast
                                              ↓
                                    [ARIMA, Mid-GRU, Sent-GRU, XGBoost]

Testing

This project includes comprehensive test coverage with 200+ test cases covering API endpoints, services, models, and integrations.

Running Tests

# Run all tests
pytest

# Run with coverage report
pytest --cov=app --cov-report=html

# Run specific test category
pytest tests/test_api_endpoints.py
pytest tests/test_services.py
pytest tests/test_models.py

# Run in parallel (faster)
pytest -n auto

Using Test Scripts

Windows:

run_tests.bat          # Run all tests
run_tests.bat coverage # Run with coverage
run_tests.bat api      # Run API tests only
run_tests.bat clean    # Clean test artifacts

Linux/Mac:

./run_tests.sh          # Run all tests
./run_tests.sh coverage # Run with coverage
./run_tests.sh api      # Run API tests only
./run_tests.sh clean    # Clean test artifacts

Using Make:

make test              # Run all tests
make test-cov          # Run with coverage
make test-fast         # Run in parallel
make test-api          # Run API tests only

Test Structure

tests/
├── test_api_endpoints.py    # API endpoint tests (health, predict, prices, etc.)
├── test_services.py         # Service layer tests (prediction, sentiment, etc.)
├── test_schemas.py          # Data validation tests
├── test_models.py           # Model loading and inference tests
├── test_database.py         # Database operation tests
├── test_integration.py      # End-to-end integration tests
└── conftest.py             # Shared fixtures and configuration

Test Coverage

Current coverage: >80%

View detailed coverage report:

pytest --cov=app --cov-report=html
# Open htmlcov/index.html in browser

Development Dependencies

Install test dependencies:

pip install -r requirements-dev.txt

See tests/README.md for detailed testing documentation.

Code Quality & Analysis

SonarCloud

This project uses SonarCloud for continuous code quality and security analysis.

Key Metrics:

Coverage: Tracks test coverage across the codebase
Security: Identifies security vulnerabilities and hotspots
Maintainability: Measures code smells and technical debt
Reliability: Detects bugs and code issues
Duplications: Identifies duplicate code blocks

View Full Report:
https://sonarcloud.io/dashboard?id=PramudithaN_fyp_backend

Local Analysis

Run SonarScanner locally (requires SonarCloud token):

# Install SonarScanner (one-time setup)
# macOS: brew install sonar-scanner
# Windows: Download from https://docs.sonarcloud.io/advanced-setup/ci-based-analysis/sonarscanner-cli/

# Run analysis
sonar-scanner \
  -Dsonar.organization=pramudithan \
  -Dsonar.projectKey=PramudithaN_fyp_backend \
  -Dsonar.sources=app \
  -Dsonar.host.url=https://sonarcloud.io \
  -Dsonar.login=YOUR_SONAR_TOKEN

Setting Up SonarCloud (For Maintainers)

Go to SonarCloud
Import the GitHub repository
Add SONAR_TOKEN secret to GitHub repository settings
The workflow will automatically run on push and pull requests

Name		Name	Last commit message	Last commit date
Latest commit History 106 Commits
.github/workflows		.github/workflows
app		app
docs		docs
grafana		grafana
model_artifacts		model_artifacts
prometheus		prometheus
scripts		scripts
tests		tests
.coveragerc		.coveragerc
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile		Dockerfile
Makefile		Makefile
OPTIMIZATION_SUMMARY.md		OPTIMIZATION_SUMMARY.md
Procfile		Procfile
README.md		README.md
README_PROJECT_OVERVIEW.md		README_PROJECT_OVERVIEW.md
docker-compose.yml		docker-compose.yml
pytest.ini		pytest.ini
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
run_tests.bat		run_tests.bat
run_tests.sh		run_tests.sh
runtime.txt		runtime.txt
sonar-project.properties		sonar-project.properties

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Oil Price Prediction Backend

Features

Quick Start (Local Development)

Environment Variables

FinBERT Model Loading

API Endpoints

Example Usage

Daily Locked Forecast Flow

Model Artifacts

Architecture

Testing

Running Tests

Using Test Scripts

Test Structure

Test Coverage

Development Dependencies

Code Quality & Analysis

SonarCloud

Local Analysis

Setting Up SonarCloud (For Maintainers)

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Oil Price Prediction Backend

Features

Quick Start (Local Development)

Environment Variables

FinBERT Model Loading

API Endpoints

Example Usage

Daily Locked Forecast Flow

Model Artifacts

Architecture

Testing

Running Tests

Using Test Scripts

Test Structure

Test Coverage

Development Dependencies

Code Quality & Analysis

SonarCloud

Local Analysis

Setting Up SonarCloud (For Maintainers)

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages