Skip to content

PYWB stripping out part of URLs on timeline page <url>#/<something> #863

Open
@ChrisDoyleMW

Description

@ChrisDoyleMW

Describe the bug

PYWB seems to be stripping out part of the URL when a timeline page is requested. For
example:
https://webarchive.nationalarchives.gov.uk/*/https://www.arcgis.com/apps/op sdashboard/index.html#/f94c3c90da5b4e9f9a0b19484dd4bb14
loads a timeline for
https://www.arcgis.com/apps/opsdashboard/index.html
Each instance shown is for index.html and not index.html#/f94c3c90da5b4e9f9a0b19484dd4bb14

Steps to reproduce the bug

  1. Open this url in your browser:
    https://webarchive.nationalarchives.gov.uk/*/https://www.arcgis.com/apps/opsdashbo
    ard/index.html#/f94c3c90da5b4e9f9a0b19484dd4bb14
  2. Click on the link dated 02 April 2020.
  3. Initially a page with the url:
    https://webarchive.nationalarchives.gov.uk/ukgwa/20200402132156/https://www.arcg is.com/apps/opsdashboard/index.html starts to load. Note that the string after the # symbol has been stripped out.
  4. The page does not load but redirects to this url:
    https://webarchive.nationalarchives.gov.uk/ukgwa/20200328185042/https://www.arcg is.com/sharing/rest/oauth2/authorize?client_id=opsdashboard&display=default&respo nse_type=token&expiration=20160&redirect_uri=https%3A%2F%2Fwww.arcgis.co m%2Fapps%2Fopsdashboard%2FpostSignIn.html&locale=en- gb&state=%7B%22redirect%22%3A%22https%3A%2F%2Fwww.arcgis.com%2Fap ps%2Fopsdashboard%2Findex.html%22%2C%22portalUrl%22%3A%22https%3A% 2F%2Fwww.arcgis.com%2Fsharing%2Frest%2F%22%7D which displays as a blank page.

Expected behavior

I'd expect the timeline page to show the correct URL timeline and allow visitors to view the history of capture for this specific URL - and not strip out the final part of the url.

Screenshots

Screenshot 2023-09-04 at 12 52 44 Screenshot 2023-09-04 at 12 51 21 Screenshot 2023-09-04 at 12 50 44 Screenshot 2023-09-04 at 12 55 20

Environment

• OS: Linux
• Browser Any
• Version PYWB 2.7

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions