Skip to content

Website hosted on CDN | status not 200 #1601

Open
@LeonemZhang

Description

What type of suggestion are you making?

Modification of existing behavior

What is the problem that your feature request solves?

Some websites are hosted on CDN. Attempting to obtain the page PDF may result in unexpected outcomes.
Sometimes when accessing these URLs, the HTTP status is not even 200.
The archiving results of these websites have been determined as successful, so they can be successfully obtained when using success status filtering

Image

Image

Image

What is your proposed solution?

If the website's access result is not 200, the archive result should also be a failure

What hacks or alternative solutions have you tried to solve the problem?

non

What version of ArchiveBox are you currently using?

0.8.5rc51

How badly do you want this new feature?

  • It's an urgent deal-breaker, I can't live without it
  • It's important to add it in the near-mid term future
  • It would be nice to have eventually
  • I'm willing to work on a PR to develop this myself
  • I have donated money to go towards fixing this issue

Mini Survey

  • I like ArchiveBox so far / would recommend it to a friend
  • I've had a lot of difficulty getting ArchiveBox set up
  • I would pay $10/mo for a hosted version of ArchiveBox if it had this feature

Activity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Metadata

Assignees

Labels

No labels
No labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions