Skip to content

[Self-Host] crawl job can't stop #874

Closed
@TChengZ

Description

Describe the Issue
Provide a clear and concise description of the self-hosting issue you're experiencing.

To Reproduce
Steps to reproduce the issue:
1、i self-host the firecrawl and crawl website 'https://www.16888.com' with crawl params
image
but the job seems to be endless until the OOM happens to redis and the app crash
image
image

2、then i change the way using firecrawl with api-key, crawl the same website 'https://www.16888.com' with same crawl params,
then everything goes well,it crawl only 10 pages
image

i think there must have some bug in open-source firecrawl

Expected Behavior
A clear and concise description of what you expected to happen when self-hosting.

Screenshots
If applicable, add screenshots or copies of the command line output to help explain the self-hosting issue.

Environment (please complete the following information):

  • OS: linux
  • Firecrawl Version: main branch last commit time:Mon Oct 28 20:28:30 2024 -0300
  • Node.js Version: 20.11.1

Logs
If applicable, include detailed logs to help understand the self-hosting problem.

Configuration
Provide relevant parts of your configuration files (with sensitive information redacted).

Additional Context
Add any other context about the self-hosting issue here, such as specific infrastructure details, network setup, or any modifications made to the original Firecrawl setup.

Activity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions