Honour robots.txt #26

cesine · 2020-07-17T19:34:41Z

We noticed this bot was crawling/fetching app-ads.txt one of our dev sites.

::ffff:127.0.0.1 - - [17/Jul/2020:19:29:54 +0000] "GET /app-ads.txt HTTP/1.0" 404 - "-" "AdsTxtCrawler/1.0; +https://github.com/InteractiveAdvertisingBureau/adstxtcrawler"

Add instructions in the Readme on how to disallow the bot by adding an entry to your robots.txt

User-agent: AdsTxtCrawler
Disallow: /

Fetch the robots.txt and honour it if it has a disallow

The text was updated successfully, but these errors were encountered:

berjayasompo mentioned this issue Oct 29, 2020

We noticed this bot was crawling/fetching app-ads.txt one of our dev sites. berjayasompo/stayatthome#1

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Honour robots.txt #26

Honour robots.txt #26

cesine commented Jul 17, 2020 •

edited

Loading

Honour robots.txt #26

Honour robots.txt #26

Comments

cesine commented Jul 17, 2020 • edited Loading

cesine commented Jul 17, 2020 •

edited

Loading