Results 1 to 9 of 9
-
10th May 2010, 08:02 AM #1OP
Block Spider
I want to block baidu spider, i have googled and i got results , but it seems baidu spider doesnt obey robots.txt and also i found a htaccess block for that, but still i am not sure whehter it will block it or not...
Do you guys have any suggesstion ?GoPantheoN Reviewed by GoPantheoN on . Block Spider I want to block baidu spider, i have googled and i got results , but it seems baidu spider doesnt obey robots.txt and also i found a htaccess block for that, but still i am not sure whehter it will block it or not... Do you guys have any suggesstion ? :) Rating: 5
-
10th May 2010, 08:15 AM #2Member
Three diferant methods here with pro/con`s
Code:http://www.simplemachines.org/community/index.php?topic=350439.0
is it really using that much bandwidth that you have to block it?
-
10th May 2010, 08:18 AM #3OP
ya i checked the smf forums.... nah it doesnt use a lot, but its always on my site and the worst thing is it never indexes it...
-
10th May 2010, 08:24 AM #4Member
check carefully in the thread, it may not be baidu but a fake bot using that useragent.
if so you would need to check the IP in your logs and then ban it.
-
10th May 2010, 08:29 AM #5OP
ya may be thats the only way
-
10th May 2010, 08:45 AM #6Respected DeveloperWebsite's:
wrzc.orgBadu does follow robots.txt laws and will not index your site if you don't want it to but you have to remember it can take several weeks for it to update it's search results so can take a while to remove your site.
Why exactly do you want to block Badu? Putting an exact figure on it's size is hard as it's very secretive and in a country where information is hard to get but most believe it's the second biggest search engine after Google. That makes it bigger than Yahoo and Bing which means your rejecting a huge potential market.Tutorial How to SEO your Warez Site a guide to help you increase your organic traffic
Huge list of Warez Sites and free Multiposter Templates
-
11th May 2010, 08:32 AM #7OP
^^ i would be glad if it get indexed but it doesnt and all the time there are atleast 5 baidu spider...... so just wanted to know they are really baidu spiders or other bots....
-
12th May 2010, 12:13 AM #8Respected Member
Easiest way is to get the ip and trace it back to owner. Network-tools.com will do that for you.
-
12th May 2010, 12:20 AM #9Respected DeveloperWebsite's:
wrzc.orgoh right. Ya I used to do that for Google. I had a script that scanned forums and gained access pretending it was Google. It's possible that it's not a real Badu spider. As Lock Down said if you just get any of the IP's which will be in your logs it should go back to the Badu website. If not then you know someone is crawling your site pretending it's Badu. This is sorta rare though and I've never heard of anyone pretending to be Badu. It's obviously better to pretend to be Google. Generally Badu doesn't index much though and it's easily possible for them to have 5 of their spiders on your site.
Tutorial How to SEO your Warez Site a guide to help you increase your organic traffic
Huge list of Warez Sites and free Multiposter Templates
Sponsored Links
Thread Information
Users Browsing this Thread
There are currently 1 users browsing this thread. (0 members and 1 guests)
Similar Threads
-
need Spider Permissions mode for vbulletin 4.x
By wakaski in forum Web Application/Script SupportReplies: 0Last Post: 25th Jan 2012, 11:24 PM -
Which spider man do you like?
By hulkman in forum General DiscussionReplies: 5Last Post: 1st Jul 2011, 06:53 PM -
vBulletin Spider XML List and How to Guide
By Raven Faust in forum Tutorials and GuidesReplies: 3Last Post: 9th Jan 2011, 09:32 AM -
Spider-Man 4 A New Video (Epic) lol lol lol
By CyberAff in forum General DiscussionReplies: 2Last Post: 3rd Nov 2010, 04:39 PM -
So much for Spider-Man 4
By Th3_Narcissist in forum General DiscussionReplies: 1Last Post: 1st Mar 2010, 04:38 PM
themaCreator - create posts from...
Version 3.57 released. Open older version (or...