This project demonstrates the use of TPOT (Tree-based Pipeline Optimization Tool) for automated machine learning on a synthetic classification dataset.
The script performs the following tasks:
- Generates a synthetic classification dataset
- Prepares the data for machine learning
- Uses TPOT to automatically find the best machine learning pipeline
- Evaluates the model's performance
- Exports the best pipeline for future use
To run this project, you need the following Python libraries:
- pandas
- numpy
- scikit-learn
- tpot
You can install these dependencies using pip: