Ignoring URLs

You can provide an options to ignore one or more domains or paths, and you can do that with a help of the -d and -p options (--domain and --path if full version). Both options, are not required. Links that match domain or path are placed into “ignored” results group.

Both - domains and paths compared via simple string comparison, regular expressions or glob masks aren’t working.

# Setup used to check "opsdroid" project documentation
#
# `deadlinks` will check URL <https://docs.opsdroid.dev/en/latest/> and all pages
# found on it (including external URLs), but not ones with domains that match
# "redis.io" or "opencollective.com" or path matching "edit/master", after
# crawling - program will show a list of ignored URLs.

deadlinks https://docs.opsdroid.dev/en/latest/ -n4 -e \
    -d redis.io -d opencollective.com \
    -p edit/master -s ignored

Changelog:

  • Prior version 0.2.0 --domain was --domains

  • Prior version 0.2.0 --path was --pathes