Example 1: (Right Click) Select a product title, the same type of title will be automatically matched, click the 'Select All' option -> Click the 'Collect Data' option, you can collect the title information of all products.
Example 2: (Right click) Select a large product block -> Click the 'Select All' option -> Click the 'Select Child Elements' option -> Click the 'Collect Data' option, you can collect the information of all products, and will be saved by sub-field.
Design and execute crawler/browser automation test tasks visually like using Excel, regardless of coding experience.
You can configure the cloud server address for unified storage of task information and unified management.
Supports infinite loop nesting and conditional judgments, can also execute custom JavaScript instructions and system instructions. You can insert the exit loop Break statement into any position of the flowchart.
The code is open source, and all features of the software are free (except for commercial use). There are no pop-ups or external advertisements.
The software can run on Windows, MacOS, and Linux.
Graphical interface for setting up and executing web scraping tasks. Typically, designing a web scraping task takes only 2-5 minutes.
No registration required. All tasks and data are saved locally and do not pass through any third-party servers.
Can freely add browser extensions, execute JavaScript instructions, and directly manipulate the browser using Selenium statements.
Can run multiple instances of the program simultaneously to achieve parallel data collection.
Supports various captcha recognition methods, such as image captchas and reCAPTCHA.
Supports element screenshot, OCR recognition, and image downloading.
Supports switching tunnel IPs and private IPs.
Supports usage within local area networks without internet access.
Can invoke external programs to fulfill complex requirements.
Can execute tasks through API calls, enabling advanced automation.
Supports scheduled task execution, becoming a helpful assistant in daily life.
Pause task execution anytime for manual debugging, page inspection, or entering captchas.
Can modify the Python environment at any time, such as defining custom variables, and write the variable values into any code statement.
Supports emulation of mobile devices for scraping mobile web pages.
Supports reading Excel files for importing large batches of input parameters.
Can export data as Excel/CSV/TXT files and write to MySQL databases.
Tasks can be copied and migrated to other machines, regardless of the operating system environment.
Can set up page scrolling to retrieve content that loads dynamically.
Supports scraping data within iframe tags, including multi-layer nested iframes.
Can retrieve and modify page cookies.
Execute tasks directly from the command line, seamlessly integrating with other programs.
Can collect JSON files.
Can download files, such as picture, PDF, etc.
This software is intended for educational and communication purposes only. It is strictly prohibited to use the software for any illegal activities or operations, such as crawling government/military websites that are not allowed to be crawled. The user bears all consequences resulting from the use of this software and the author shall not be held responsible or liable in any way. Furthermore, the software is protected by patent rights. If you intend to use it for commercial purposes or profit-making activities, such as using the software for client orders, selling the collected data, please contact author: [email protected] for patent authorization and payment operations: https://www.patentguru.com/cn/search?q=一种自定义提取流程的服务封装系统
For individual users, EasySpider is a completely free and ad-free open-source software. The development and maintenance of the software rely entirely on the author's voluntary contributions. Therefore, you can choose to support the author and enable them to have more passion and energy to maintain this software. If you have benefited from this software and made a profit, you are also welcome to support the author through the following methods:
1. Alipay account: [email protected]. You can also scan the QR code below.
2. WeChat Pay: Scan the QR code below.
3. PayPal account: naibowang. You can also scan the QR code below.