Skip to content

A repo for all spark examples using Rapids Accelerator including ETL, ML/DL, etc.

License

Notifications You must be signed in to change notification settings

GaryShen2008/spark-rapids-examples

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

spark-rapids-examples

A repo for all spark examples using Rapids Accelerator including ETL, ML/DL, etc.

It includes docs and example applications that demonstrate the RAPIDS.ai GPU-accelerated XGBoost-Spark project. It now supports Spark 3.0.0+

Examples

1. Xgboost examples

Getting Started Guides

1. Xgboost examples guide

Try one of the "Getting Started Guides" below. Please note that they target the Mortgage dataset as written, but with a few changes to EXAMPLE_CLASS and dataPath, they can be easily adapted to the Taxi or Agaricus datasets.

You can get a small size datasets for each example in the datasets folder. These datasets are only provided for convenience. In order to test for performance, please prepare a larger dataset by following Preparing Datasets via Notebook. We also provide a larger dataset: Morgage Dataset (1 GB uncompressed), which is used in the guides below.

These examples use default parameters for demo purposes. For a full list please see "Supported Parameters" for Scala or Python

API

1. Xgboost examples API

Troubleshooting

Contributing

See the Contributing guide.

Contact Us

Please see the RAPIDS website for contact information.

License

This content is licensed under the Apache License 2.0

About

A repo for all spark examples using Rapids Accelerator including ETL, ML/DL, etc.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 68.1%
  • Shell 16.3%
  • Dockerfile 15.6%