You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
timcosgrove
changed the title
Node.js script that crawls va.gov based on Sitemap and checks for broken links
Node.js script that crawls a Sitemap and checks for broken links
Jan 3, 2024
Requirements
We need a script that checks broken links on VA.gov, so that we can verify that our content works correctly for Veterans.Acceptance criteria
Background & implementation details
There are any number of broken link checking packages in the Node ecosystem: https://www.npmjs.com/search?q=broken%20link
Linkinator is a likely candidate. Note however that Linkinator does not currently crawl via Sitemap: JustinBeckwith/linkinator#346
A separate Sitemap reading package would likely be necessary to find URLs to pass to Linkinator for checking.
Parallelizing the checks is ideal, though if it adds significant complication it should not block completion of the ticket.
The text was updated successfully, but these errors were encountered: