<div dir="ltr">Hi Linda,<div><br></div><div>It's not for Evergreen, but my colleague <a href="https://github.com/pulibrary/princeton_ansible/commit/6f9009249a168442391d90e2b75028d40a8a9e91">recently blocked claudebot using fail2ban on our load balancer</a>.  Essentially, fail2ban is configured to watch Nginx's access log, and if more than 10 claudebot requests appear within the past minute from a particular IP, it automatically blocks all requests from that IP for the next 24 hours.  I would think that something similar could work for Apache's access log.</div><div><br></div><div>Good luck with the bots!</div><div><br></div><div>  -Jane</div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">El vie, 19 abr 2024 a la(s) 3:42 a.m., Linda Jansová via Evergreen-general (<a href="mailto:evergreen-general@list.evergreen-ils.org">evergreen-general@list.evergreen-ils.org</a>) escribió:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">Dear all,<br>

<br>

Have any of you encountered an extensive crawling by Bytespider and <br>

Bytedance (see e.g., <br>

<a href="https://wordpress.org/support/topic/psa-bytedance-and-bytespider-bots-recommend-blocking/" rel="noreferrer" target="_blank">https://wordpress.org/support/topic/psa-bytedance-and-bytespider-bots-recommend-blocking/</a>), <br>

Claudebot or other AI bots?<br>

<br>

If so, do you have any secret recipe how to disable the crawler from <br>

accessing the site?<br>

<br>

Thank you very much for sharing your experience!<br>

<br>

Linda<br>

<br>

_______________________________________________<br>

Evergreen-general mailing list<br>

<a href="mailto:Evergreen-general@list.evergreen-ils.org" target="_blank">Evergreen-general@list.evergreen-ils.org</a><br>

<a href="http://list.evergreen-ils.org/cgi-bin/mailman/listinfo/evergreen-general" rel="noreferrer" target="_blank">http://list.evergreen-ils.org/cgi-bin/mailman/listinfo/evergreen-general</a><br>

</blockquote></div>