If anybody has experience with "effectively" blocking them please share.
In my efforts I have blocked user agents, first tried to emulate http.conf in htaccess with
SetEnvIfNoCase User-Agent "^Baiduspider" block_bot Order Allow,Deny Allow from All Deny from env=block_bot
In conjuction with pure .htaccess user agent block:
RewriteEngine On
RewriteCond %{HTTP_USER_AGENT} ^Baiduspider [NC]
RewriteRule .* - [F]
Finally resorting to baning IP's and host:
order allow,deny deny from *.baidu.com deny from 203.125.234. deny from 220.181.7. deny from 123.125.66. deny from 123.125.71. deny from 119.63.192. deny from 119.63.193. deny from 119.63.194. deny from 119.63.195. deny from 119.63.196. deny from 119.63.197. deny from 119.63.198 deny from 119.63.199. deny from 180.76.5. deny from 202.108.249.185 deny from 202.108.249.177 deny from 202.108.249.182 deny from 202.108.249.184 deny from 202.108.249.189 deny from 61.135.146.200 deny from 61.135.145.221 deny from 61.135.145.207 deny from 202.108.250.196 deny from 68.170.119.76 deny from 207.46.199.52 allow from all
Yet, Baidu appears to be masking itself under different ISPs. I've seen msn, kimsufi.com, and now wowrack.com as the ISP but the user agent is still baiduspider. No idea how they are getting around my user agent blocks but they are..
This is currently on my site:
208-115-111-72-reverse.wowrack.com
IP address: 208.115.111.72
User agent: Mozilla/5.0 (compatible; Baiduspider/2.0; +http://www.baidu.com/search/spider.html)
I can of course ban this IP but they seem to have limitless ISPs and IPs to draw from, and we don't really like banning IP's unless they are from another country in which we don't do business.
Edited by NodsDorf, 17 January 2012, 00:15.















