about summary refs log tree commit diff stats
diff options
context:
space:
mode:
authorJason A. Donenfeld2013-05-28 14:17:00 +0200
committerJason A. Donenfeld2013-08-12 13:14:10 -0600
commit23debef62104c70600be2b745ec3957538eeac6e (patch)
tree3b68eab7624907836b0e614328a529e686224830
parentuse favicon by default (diff)
downloadcgit-23debef62104c70600be2b745ec3957538eeac6e.tar.gz
cgit-23debef62104c70600be2b745ec3957538eeac6e.zip
robots.txt: disallow access to snapshots
My dmesg is filled with the oom killer bringing down processes while the
Bingbot downloads every snapshot for every commit of the Linux kernel in
tar.xz format. Sure, I should be running with memory limits, and now I'm
using cgroups, but a more general solution is to prevent crawlers from
wasting resources like that in the first place.

Suggested-by: Natanael Copa <ncopa@alpinelinux.org>
Suggested-by: Julius Plenz <plenz@cis.fu-berlin.de>
Signed-off-by: Jason A. Donenfeld <Jason@zx2c4.com>
-rw-r--r--Makefile1
-rw-r--r--robots.txt3
2 files changed, 4 insertions, 0 deletions
diff --git a/Makefile b/Makefile index 00b3269..f11b60f 100644 --- a/Makefile +++ b/Makefile
@@ -78,6 +78,7 @@ install: all
78 $(INSTALL) -m 0644 cgit.css $(DESTDIR)$(CGIT_DATA_PATH)/cgit.css 78 $(INSTALL) -m 0644 cgit.css $(DESTDIR)$(CGIT_DATA_PATH)/cgit.css
79 $(INSTALL) -m 0644 cgit.png $(DESTDIR)$(CGIT_DATA_PATH)/cgit.png 79 $(INSTALL) -m 0644 cgit.png $(DESTDIR)$(CGIT_DATA_PATH)/cgit.png
80 $(INSTALL) -m 0644 favicon.ico $(DESTDIR)$(CGIT_DATA_PATH)/favicon.ico 80 $(INSTALL) -m 0644 favicon.ico $(DESTDIR)$(CGIT_DATA_PATH)/favicon.ico
81 $(INSTALL) -m 0644 robots.txt $(DESTDIR)$(CGIT_DATA_PATH)/robots.txt
81 $(INSTALL) -m 0755 -d $(DESTDIR)$(filterdir) 82 $(INSTALL) -m 0755 -d $(DESTDIR)$(filterdir)
82 $(COPYTREE) filters/* $(DESTDIR)$(filterdir) 83 $(COPYTREE) filters/* $(DESTDIR)$(filterdir)
83 84
diff --git a/robots.txt b/robots.txt new file mode 100644 index 0000000..4ce948f --- /dev/null +++ b/robots.txt
@@ -0,0 +1,3 @@
1User-agent: *
2Disallow: /*/snapshot/*
3Allow: /