🛍️ Google Shopping Scraper with Python (Crawlbase API)

🛍️ Google Shopping Scraper with Python (Crawlbase API)

📝 Description

This project includes two Python-based scrapers that use the Crawlbase Crawling API to extract product data from Google Shopping:

A SERP scraper to collect multiple products from the shopping search results.
A product page scraper to extract detailed info from individual product listings.

📖 Read the full blog post here: How to Scrape Google Shopping Data

⚙️ Tech Stack

Crawlbase Crawling API
requests handled internally by the SDK
BeautifulSoup for HTML parsing
json for structured data output

📦 Installation

Install the required dependencies:

pip install crawlbase beautifulsoup4

🔑 Setup

Update the script(s) with your Crawlbase token:

crawling_api = CrawlingAPI({'token': 'YOUR_CRAWLBASE_TOKEN'})

🛒 Scraper 1: Google Shopping SERP Scraper (`google_shopping_serp_scraper.py`)

✅ What It Does

Scrapes multiple pages of Google Shopping search results.
Extracts:
- Product Title
- Price
- Image URL
- Retailer
- Product URL

🧠 Pagination Strategy

Uses the start parameter to paginate (20 products per page).

▶️ How to Run

python google_shopping_serp_scraper.py

📁 Output

Saves results to:

products.json

🧪 Sample Output

[
  {
    "title": "Louis Vuitton Neverfull MM",
    "price": "$2,030.00",
    "image": "https://example.com/image.jpg",
    "retailer": "Louis Vuitton",
    "product_url": "https://www.google.com/shopping/product/123456789"
  },
  ...
]

📄 Scraper 2: Product Page Scraper (`google_shopping_product_scraper.py`)

✅ What It Does

Extracts detailed info from a single Google Shopping product page:

Title
Price
Description
Image URLs

▶️ How to Run

Update the product_url in the script, then:

python google_shopping_product_scraper.py

📁 Output

Saves details to:

product_details.json

🧪 Sample Output

{
	"title": "Louis Vuitton Neverfull MM",
	"price": "$2,030.00",
	"description": "Iconic Louis Vuitton tote with a timeless design...",
	"images": ["https://example.com/image1.jpg", "https://example.com/image2.jpg"]
}

🔒 Note on Anti-Bot Measures

Google Shopping employs strict bot protection. Using Crawlbase Crawling API ensures:

IP rotation
JavaScript rendering
User-Agent spoofing
Geo-targeting support

✅ To-Do

Add CLI support for dynamic search terms and product links
Combine both scrapers into a single flow
Output data in CSV

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
google_shopping_product_scraper.py		google_shopping_product_scraper.py
google_shopping_serp_scraper.py		google_shopping_serp_scraper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🛍️ Google Shopping Scraper with Python (Crawlbase API)

📝 Description

⚙️ Tech Stack

📦 Installation

🔑 Setup

🛒 Scraper 1: Google Shopping SERP Scraper (`google_shopping_serp_scraper.py`)

✅ What It Does

🧠 Pagination Strategy

▶️ How to Run

📁 Output

🧪 Sample Output

📄 Scraper 2: Product Page Scraper (`google_shopping_product_scraper.py`)

✅ What It Does

▶️ How to Run

📁 Output

🧪 Sample Output

🔒 Note on Anti-Bot Measures

✅ To-Do

About

Uh oh!

Releases

Packages

Languages

ScraperHub/google-shopping-scrapers

Folders and files

Latest commit

History

Repository files navigation

🛍️ Google Shopping Scraper with Python (Crawlbase API)

📝 Description

⚙️ Tech Stack

📦 Installation

🔑 Setup

🛒 Scraper 1: Google Shopping SERP Scraper (google_shopping_serp_scraper.py)

✅ What It Does

🧠 Pagination Strategy

▶️ How to Run

📁 Output

🧪 Sample Output

📄 Scraper 2: Product Page Scraper (google_shopping_product_scraper.py)

✅ What It Does

▶️ How to Run

📁 Output

🧪 Sample Output

🔒 Note on Anti-Bot Measures

✅ To-Do

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

🛒 Scraper 1: Google Shopping SERP Scraper (`google_shopping_serp_scraper.py`)

📄 Scraper 2: Product Page Scraper (`google_shopping_product_scraper.py`)

Packages