PastryVision is a computer vision project designed to help small pastry shops modernize their operations. By automatically detecting and tallying items in a customer's cart, the system enables a self-serve workflow that enhances efficiency while reducing costs. It eliminates the need for plastic packaging and barcode printing, promotes pastry freshness, and operates locally without requiring web hosting. With its simple and user-friendly setup, PastryVision empowers pastry shops to deliver a modern customer experience effortlessly.
- The frontend camera scans the pastry items on customer's tray.
- The customer takes a snapshot of their tray.
- The snapshot is sent to the backend to detect and identify the pastry items on the tray.
- The backend automatically tallies the quantities and costs of the tray and sends this information to the frontend.
- The customer can confirm the cost breakdown provided from the backend and proceed to pay or cancel their transaction.
- MediaPipe
- React & ML Object Detection Example
- Vite Configuration Guide
- TailwindCSS Documentation
- MediaPipe Object Detection Example
- Training your own model using mediapipe
- Bread dataset on roboflow
- FastAPI - to build our backend
- OpenCV - computer vision operations library
- Python multipart parser
- Mediapipe Examples GitHub Repo
- BB Brain (example solution in Japan)
- Viscovery
PastryVision was developed for the CVI620 NZA class at Seneca Polytechnic (Fall 2024), under Professor Savita Seharawat's guidance.