-
Notifications
You must be signed in to change notification settings - Fork 81
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
HTML manuscript not updating #1177
Comments
Oh no!!! If that's the case, I wonder if it's time to move to true book-like HTML formatting, with a TOC and "read next section" links at the bottom. It is so hard to load anyways... in terms of how to do that, I can play around with see how the suggestion from this post generates locally! |
I don't think the problem is the size of a single manuscript file. Rather, the branch also tracks archived versions of every historical version of the manuscript we've deployed before so we can maintain permalinks. Check out the contents of https://github.com/greenelab/covid19-review/tree/gh-pages/v We have about 600 past copies of the manuscript archived there. Bringing in @dhimmel in case the permalink and archiving is a possible cause. |
They really need a 😱 react for posts like this... that makes sense and would definitely explain the time outs! |
Looking at the raw CI logs for this build:
So the |
https://github.com/orgs/community/discussions/35197 provides more details. The artifacts grew to 10 GB in size, which leads to the 10 min timeout @dhimmel detected. We can monitor the artifacts size from the actions pages such as https://github.com/greenelab/covid19-review/actions/runs/3129872205 Archiving the old versions of the manuscript files isn't too hard. We already have a Zenodo repository linked to releases of this GitHub repository, so we could create a release from the That would destroy our old permalinks, which is unfortunate. We could manually try to preserve the old versions that correspond to releases (e.g. the arXiv preprints), but would miss some. I don't see a general solution though if we are going to continue hosting the manuscript on GitHub pages. Maybe we set the old permalinks to redirect to the Zenodo DOI? That would be better than a 404. @dhimmel do you think this is a general issue for large Manubot projects worth discussing in the rootstock repo, or does this review just push the Manubot workflow to the extreme? |
Slightly unfortunate, but you could do something in between like just delete the images directory.
Yeah, I don't think Manubot creates permalinks for git tags, but that would be a nice feature if it did.
possibly, I think it's a reason to recommend embedding images by link if you plan to have large images and many commits. The insights from the discussions#35197 might be valuable in USAGE. As well as what you end up deciding in terms of pruning things. |
I'm documenting my process to prune the Checkout
Our repository also had Zenodo archiving enabled, so make a tag and release to archive the
Zenodo created an archive of the release that is 8.2 GB compressed. (I also noticed at https://help.zenodo.org/ that Zenodo now supports metadata in a I start by checking the size of the contents and iterative delete until it is back to a reasonable size.
It's still huge even after removing images and pdfs. Time to remove entire manuscripts arbitrarily.
Removing those HTML files is a reminder the disk usage must be elsewhere. It's in the
Let's blast a few more HTML manuscripts. My favorite number is "5" so it stays.
I'm stopping here. If we address the problem below we have a reasonable artifact size and many past version of manuscripts left (HTML only though). I can restore complete archives from my local copy, and anyone could do this by downloading the zip from Zenodo. I'm only restoring the two versions we refer to in the manual references for now and the latest version. I could restore more later that correspond to releases or other special versions.
I noticed I broke the symbolic links for git status excerpt
I restored those and then commit the other changes. Had to do the taboo
If you browse We still should do this before closing the issue or merging too many more changes to the manuscript:
Every time we push to |
The last version of the HTML manuscript is from August 17, 2022.
gh-pages
branch commits 471a614 and c269dc0 failed to update. Both have build logs that end with the messagesThe last successful HTML deployment had a similar failure but without the cancellation message
I'm not sure how to debug this, so I asked for advice in the GitHub community discussions. My initial guess is that the content on our
gh-pages
branch has gotten too large.The text was updated successfully, but these errors were encountered: