add custom robots.txt to block indexing /user and AI bots #55

thepsalmist · 2025-01-08T09:50:41Z

This PR creates a custom robots.txt overriding CKAN's default robots.txt
We'd like to block indexing &/ crawling on /user and block generic AI bots not in Cloudflare's Verified bot's list

kilemensi

👍🏽 ... based on the default robots.txt link you shared, shouldn't we just be overriding specific blocks (all_user_agents, additional_user_agents, etc.) rather than creating a completely new robots.txt file?

kilemensi · 2025-01-09T08:58:20Z

ckanext/openafrica/templates/robots.txt

+Disallow: /
+
+
+# Generatedy by RoboShield (https://roboshield.trustlab.africa)


RoboShield FTW!

kilemensi · 2025-01-09T09:00:08Z

ckanext/openafrica/templates/robots.txt

+Disallow: /user/
+
+# Amazonbot
+User-agent: Amazonbot


Comparing to other sites, we may be missing some crawlers
The Guardian, Washington Post, BBC, etc.

add custom robots.txt to block indexing /user and AI bots

eb89c44

thepsalmist added the enhancement label Jan 8, 2025

thepsalmist self-assigned this Jan 8, 2025

thepsalmist added 4 commits January 8, 2025 12:51

add robots.txt file

9c132d7

render robot txt as txt type

6de3779

render robot txt as txt type

90d6567

render robot txt as txt type

a6d4bd2

thepsalmist requested a review from a team January 9, 2025 06:14

kilemensi reviewed Jan 9, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add custom robots.txt to block indexing /user and AI bots #55

add custom robots.txt to block indexing /user and AI bots #55

thepsalmist commented Jan 8, 2025

kilemensi left a comment

kilemensi Jan 9, 2025

kilemensi Jan 9, 2025

		Disallow: /


		# Generatedy by RoboShield (https://roboshield.trustlab.africa)

add custom robots.txt to block indexing /user and AI bots #55

Are you sure you want to change the base?

add custom robots.txt to block indexing /user and AI bots #55

Conversation

thepsalmist commented Jan 8, 2025

kilemensi left a comment

Choose a reason for hiding this comment

kilemensi Jan 9, 2025

Choose a reason for hiding this comment

kilemensi Jan 9, 2025

Choose a reason for hiding this comment