Open
Description
One of the most likely problems we see is failed transfers leading to truncated WARC.GZ files. We can spot this with gunzip -t
but it would be good if warcio check
also raised this as a validation error. My tests so far have indicated that the warcio and cdxj-indexer etc. tools all skip over these errors silently.
Metadata
Metadata
Assignees
Labels
No labels
Activity