[Evergreen-general] Dealing with significant traffic increase caused by AI bots

Linda Jansová linda.jansova at gmail.com
Fri Apr 19 07:18:11 EDT 2024


Thank you very much, Jane!

We will certainly give fail2ban a try, though - as we use Apache - some 
implementation details will probably be a bit different :-).

Linda

On 4/19/24 13:05, Jane Sandberg wrote:
> Hi Linda,
>
> It's not for Evergreen, but my colleague recently blocked claudebot 
> using fail2ban on our load balancer 
> <https://github.com/pulibrary/princeton_ansible/commit/6f9009249a168442391d90e2b75028d40a8a9e91>.  
> Essentially, fail2ban is configured to watch Nginx's access log, and 
> if more than 10 claudebot requests appear within the past minute from 
> a particular IP, it automatically blocks all requests from that IP for 
> the next 24 hours.  I would think that something similar could work 
> for Apache's access log.
>
> Good luck with the bots!
>
>   -Jane
>
> El vie, 19 abr 2024 a la(s) 3:42 a.m., Linda Jansová via 
> Evergreen-general (evergreen-general at list.evergreen-ils.org) escribió:
>
>     Dear all,
>
>     Have any of you encountered an extensive crawling by Bytespider and
>     Bytedance (see e.g.,
>     https://wordpress.org/support/topic/psa-bytedance-and-bytespider-bots-recommend-blocking/),
>
>     Claudebot or other AI bots?
>
>     If so, do you have any secret recipe how to disable the crawler from
>     accessing the site?
>
>     Thank you very much for sharing your experience!
>
>     Linda
>
>     _______________________________________________
>     Evergreen-general mailing list
>     Evergreen-general at list.evergreen-ils.org
>     http://list.evergreen-ils.org/cgi-bin/mailman/listinfo/evergreen-general
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://list.evergreen-ils.org/pipermail/evergreen-general/attachments/20240419/993177fb/attachment.htm>


More information about the Evergreen-general mailing list