‼ web.archive.org mirrors do not work with new media url substitution #47
Labels
bug
Something isn't working
help wanted
Extra attention is needed
HIGH PRIORITY
PRIORITY ISSUE - breaking
refactor
Feature requires a refactor/rework
Milestone
The wayback machine (WBM) rewrites all the media URLs even on a javascript and json entries level.
This means all media URLs are being rewritten to their web.archive.org archived counterparts.
However, when trying to retrieve a URL stored in an attribute in the html, the WBM rewrites that URL as well, removing the web.archive.org prefix from it at some point between the value in the html and the function in js.
This is an issue because we are storing media file replacements with the keys set to the (cleaned up) original media file URL.
Because of WBM messing with the URLs, all keys (and all property values) have the prefix, but the value we retrieved does not. We can not use this value to retrieve its associated media_replacements object. Rewriting the URL to include the prefix again might also not be an issue as the timestamp of when the URL was archived is part of that prefix, and there is probably no way to tell what it might be.
Potential fixes:
The text was updated successfully, but these errors were encountered: