lilypond-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: LilyPond.org down?


From: Jan Nieuwenhuizen
Subject: Re: LilyPond.org down?
Date: Thu, 08 Aug 2013 08:36:10 +0200
User-agent: Gnus/5.13 (Gnus v5.13) Emacs/24.2 (gnu/linux)

Phil Holmes writes:

> I'm not aware of any changes being made, unless it's the regularity of
> the upgrades and people downloading the PDFs more often.

It turns out that it's the notorious AhrefsBot.  I fixed .htaccess but
we need this patch below.  Please apply.

Greetings,
Jan

>From 08c4b0e80428db285ba3865d4ea795fbdf2d17ff Mon Sep 17 00:00:00 2001
From: Jan Nieuwenhuizen <address@hidden>
Date: Thu, 8 Aug 2013 08:34:12 +0200
Subject: [PATCH] [Web]: Deny rogue crawler AhrefsBot.  Fixes web load.

The AhrefsBot is checking crawling files every second, including
all binaries.  This increases the load in an unacceptable way.
---
 Documentation/web/server/lilypond.org.htaccess | 4 ++++
 1 file changed, 4 insertions(+)

diff --git a/Documentation/web/server/lilypond.org.htaccess 
b/Documentation/web/server/lilypond.org.htaccess
index b4780a9..5e7dfae 100644
--- a/Documentation/web/server/lilypond.org.htaccess
+++ b/Documentation/web/server/lilypond.org.htaccess
@@ -23,6 +23,10 @@ RewriteEngine On
 RewriteCond %{HTTP_USER_AGENT} httrack [NC]
 RewriteRule ^.*/source/.*$ /please-respect-robots.txt.html [L]
 
+# Deny rogue crawler
+RewriteCond %{HTTP_USER_AGENT} ^(.*)AhrefsBot(.*) [NC]
+RewriteRule .* - [F,L]
+
 # Permanent top level entry points -- ./doc
 RedirectMatch ^/music-glossary /glossary
 RedirectMatch ^/tutorial /learning
-- 
1.8.1.2

-- 
Jan Nieuwenhuizen <address@hidden> | GNU LilyPond http://lilypond.org
Freelance IT http://JoyofSource.com | AvatarĀ®  http://AvatarAcademy.nl  

reply via email to

[Prev in Thread] Current Thread [Next in Thread]