Blacklight Query

A command-line tool to fetch Blacklight scans for a list of urls. Directly queries the open-source Blacklight Collector tool and runs entirely locally.

Prerequisites

nvm
npm

Getting Started

nvm use
npm install
./blacklight-query urls.txt where urls.txt has newline-separated absolute URLs to scan

Inputs

Write all URLs you wish to scan as absolute URLs (including protocol, domain, and path). Separate each URL with a newline.

Sample `urls.txt` file

https://www.themarkup.org
https://www.calmatters.org

You can use pipes

You can also pipe your list of URLs.

echo "https://themarkup.org/" | ./blacklight-query
./blacklight-query < urls.txt

Collector Options

All of the blacklight-collector options can be specified using this tool, by editing the config object in main.ts.

Out of the box, this tool sets the following options:

headless: true, this sets the collector to use a headless, behind-the-scenes browser
outDir: ./outputs/[URL], specifies which directory the collector should store its results in. Makes use of the url being scanned
numPages: 0, tells the collector not to scan an additional page. Setting this to 1, 2, or 3 scans that number of randomly chosen pages that are accessible from the homepage

Some other options you may find useful are:

emulateDevice, this specifies which device the collector should scan as
headers, allows you to set custom headers on the headless browser

Read the blacklight-collector README for a full list of options and their defaults.

Outputs

All scans will be saved in the outputs folder, in subdirectories named for the hostname of the url being scanned.

Notes

Be aware that the Collector is fairly resource-heavy, and may slow down your computer. We recommend scanning smaller lists if hardware becomes overtaxed.

Testing

npm run test

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
.github/workflows		.github/workflows
__tests__		__tests__
src		src
.gitignore		.gitignore
.nvmrc		.nvmrc
FUNDING.yml		FUNDING.yml
LICENSE		LICENSE
README.md		README.md
blacklight-query		blacklight-query
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Blacklight Query

Prerequisites

Getting Started

Inputs

Sample `urls.txt` file

You can use pipes

Collector Options

Outputs

Notes

Testing

About

Releases

Sponsor this project

Packages

Contributors 3

Languages

License

the-markup/blacklight-query

Folders and files

Latest commit

History

Repository files navigation

Blacklight Query

Prerequisites

Getting Started

Inputs

Sample urls.txt file

You can use pipes

Collector Options

Outputs

Notes

Testing

About

Resources

License

Stars

Watchers

Forks

Releases

Sponsor this project

Packages 0

Contributors 3

Languages

Sample `urls.txt` file

Packages