Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Node.js script that crawls a Sitemap and checks for broken links #16639

Closed
3 tasks done
Tracked by #16780 ...
timcosgrove opened this issue Jan 3, 2024 · 0 comments
Closed
3 tasks done
Tracked by #16780 ...

Node.js script that crawls a Sitemap and checks for broken links #16639

timcosgrove opened this issue Jan 3, 2024 · 0 comments
Assignees

Comments

@timcosgrove
Copy link
Contributor

timcosgrove commented Jan 3, 2024

Requirements

We need a script that checks broken links on VA.gov, so that we can verify that our content works correctly for Veterans.

Acceptance criteria

Preview Give feedback

Background & implementation details

There are any number of broken link checking packages in the Node ecosystem: https://www.npmjs.com/search?q=broken%20link

Linkinator is a likely candidate. Note however that Linkinator does not currently crawl via Sitemap: JustinBeckwith/linkinator#346

A separate Sitemap reading package would likely be necessary to find URLs to pass to Linkinator for checking.

Parallelizing the checks is ideal, though if it adds significant complication it should not block completion of the ticket.

@timcosgrove timcosgrove changed the title Node.js script that crawls va.gov based on Sitemap and checks for broken links Node.js script that crawls a Sitemap and checks for broken links Jan 3, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants