Skip to content
Discussion options

You must be logged in to vote

Ok, makes sense, just wanted to make sure you're aware of the limitations for this use case.

As for your specific concerns:

ArchiveBox will need to be taught to commit to the repository after each job runs. I can probably do this myself.

Yeah I would do this out-of-band with a periodic cronjob or inotify watcher.

If a file isn't present locally (to a given checkout), it shows up as a broken symlink. If ArchiveBox itself checks for the existence of such files, it might get confused.

ArchiveBox exclusively uses os.path.isfile(...) for file existence, which returns False for broken symlinks. However, ArchiveBox wont try to re-download files once a success is recorded in the DB, even if i…

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@hatzka-nezumi
Comment options

@pirate
Comment options

Answer selected by hatzka-nezumi
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants