StatusNethttp://rainbowdash.net/api/statusnet/conversation/3169624.atomConversation2024-03-29T14:46:03+00:00http://activitystrea.ms/schema/1.0/commenthttp://oracle.skilledtests.com/notice/531861@mk the only halfway useful reference site I've found is http://www.botsvsbrowsers.com/ - practically everything else turned up by searching is regurgitated lists of deceased bots that everyone keeps copying from each other. Anyway, I now have a much reduced list of bots to ban...@<span class="vcard"><a href="http://oracle.skilledtests.com/user/2368" class="url" title="Temporary Marjolein"><span class="fn nickname mention">mk</span></a></span> the only halfway useful reference site I've found is <a href="http://www.botsvsbrowsers.com/" title="http://www.botsvsbrowsers.com/" rel="nofollow external">http://www.botsvsbrowsers.com/</a> - practically everything else turned up by searching is regurgitated lists of deceased bots that everyone keeps copying from each other. Anyway, I now have a much reduced list of bots to ban...http://activitystrea.ms/schema/1.0/post2014-04-19T08:39:24+00:002014-04-19T08:39:24+00:00http://activitystrea.ms/schema/1.0/personhttp://oracle.skilledtests.com/user/2368mkmkmkTemporarily lost photographer. Thanks to Erkan for providing me with a temporary home while I have not been able to set up my own SN instance yet. [note: I do use some 'self tags' and 'people tags' but I do *not* use any "lists" - those are a SN misinterpretation of those tags]Amsterdam, NL, Terrahomepagehttp://wiki.unsim.pl/truehttp://activitystrea.ms/schema/1.0/notehttp://oracle.skilledtests.com/notice/531766I am endlessly annoyed by all the lists of "bad bots" that should be "blocked" from your site. it seems everyone keeps copying everyone else, repeating in their list generic tools like wget, a python library or even 'pyhon' and ancient bad biot that haven't been used for years (often a decade or more) because everyone was blocking them anyway. I have not found a single page yet that actually validates each item on the list by stating what is "bad" about its behavior (if it even exists any more)... there used to be a page like that (but that was a decade ago, too). Of course there are a few "known" bad bots that ignore robots.txt or just hammer your site (Alexa's ia_archiver is one) - but endlessly growing lists of supposed bad bots will do nothing for your site except slow it down. #rantI am endlessly annoyed by all the lists of "bad bots" that should be "blocked" from your site. it seems everyone keeps copying everyone else, repeating in their list generic tools like wget, a python library or even 'pyhon' and ancient bad biot that haven't been used for years (often a decade or more) because everyone was blocking them anyway. I have not found a single page yet that actually validates each item on the list by stating what is "bad" about its behavior (if it even exists any more)... there used to be a page like that (but that was a decade ago, too). Of course there are a few "known" bad bots that ignore robots.txt or just hammer your site (Alexa's ia_archiver is one) - but endlessly growing lists of supposed bad bots will do nothing for your site except slow it down. #<span class="tag"><a href="http://oracle.skilledtests.com/tag/rant" rel="tag">rant</a></span>http://activitystrea.ms/schema/1.0/post2014-04-19T06:28:19+00:002014-04-19T06:28:19+00:00http://activitystrea.ms/schema/1.0/personhttp://oracle.skilledtests.com/user/2368mkmkmkTemporarily lost photographer. Thanks to Erkan for providing me with a temporary home while I have not been able to set up my own SN instance yet. [note: I do use some 'self tags' and 'people tags' but I do *not* use any "lists" - those are a SN misinterpretation of those tags]Amsterdam, NL, Terrahomepagehttp://wiki.unsim.pl/true