You are missing our premiere tool bar navigation system! Register and use it for FREE!

NukeCops  
•  Home •  Downloads •  Gallery •  Your Account •  Forums • 
Readme First
- Readme First! -

Read and follow the rules, otherwise your posts will be closed
Modules
· Home
· FAQ
· Buy a Theme
· Advertising
· AvantGo
· Bookmarks
· Columbia
· Community
· Donations
· Downloads
· Feedback
· Forums
· PHP-Nuke HOWTO
· Private Messages
· Search
· Statistics
· Stories Archive
· Submit News
· Surveys
· Theme Gallery
· Top
· Topics
· Your Account
Who's Online
There are currently, 59 guest(s) and 1 member(s) that are online.

You are Anonymous user. You can register for free by clicking here
Nuke Cops :: View topic - FAO: Freelinuxer - Module Exclusion Tip. [ ]
 Forum FAQ  •  Search  •   •  Memberlist  •  Usergroups   •  Register  •  Profile •    •  Log in to check your private messages  •  Log in

 
Post new topic  Reply to topicprinter-friendly view
View previous topic Log in to check your private messages View next topic
Author Message
GibsonXXI
Private
Private


Joined: Apr 25, 2004
Posts: 48

Location: United Kingdom

PostPosted: Fri Mar 11, 2005 9:00 am Reply with quoteBack to top

Seeing as the robots.txt file can only block by folder, you just need to specify the literal path to the folder you want to include.

Example:

Disallow: /modules/Gallery/

to disallow the Gallery module file folder from being indexed.

However, this won't stop bad bots that ignore the robots.txt file altogether, and there are quite a few. Including quite a few email harvester bots. The best way to stop these is by using mod rewrites to send them to another site/page altogether. or by placing custom environment handlers in your .htacess file to block them, but you will suffer a performance hit doing it like this.

A tutorial on this has been posted here before, do a search on the forum and see if you can find it.

Also do a search on the net and look up what ways you can block bad spider-bots from getting anywhere near your site, let alone certain modules.

_________________
"Sic vis pacem para bellum!"
RAF71_Hornet / GibsonXXI
Find all posts by GibsonXXIView user's profileSend private messageVisit poster's websiteYahoo MessengerMSN MessengerICQ Number
Imago
Captain
Captain


Joined: Jan 17, 2003
Posts: 629

Location: Europe

PostPosted: Sat Mar 19, 2005 9:01 am Reply with quoteBack to top

I am using robots.txt two years now and closely monitoring the sites. So far no problems with bad bots. Better run this risk than overloading the CPU with tons of rules to folow from .htaccess

_________________
www.vdsp.net | www.indopedia.org | www.orientalia.org | www.indology.net | www.yogadarsana.org | www.husserl.info | www.medicum.net
Find all posts by ImagoView user's profileSend private messageVisit poster's website
Display posts from previous:      
Post new topic  Reply to topicprinter-friendly view
View previous topic Log in to check your private messages View next topic
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum



Powered by phpBB © 2001, 2005 phpBB Group

Ported by Nuke Cops © 2003 www.nukecops.com
:: FI Theme :: PHP-Nuke theme by coldblooded (www.nukemods.com) ::
Powered by · TOGETHER TEAM srl ITALY http://www.togetherteam.it · DONDELEO E-COMMERCE http://www.DonDeLeo.com
Web site engine's code is Copyright © 2002 by PHP-Nuke. All Rights Reserved. PHP-Nuke is Free Software released under the GNU/GPL license.
Page Generation: 0.165 Seconds - 295 pages served in past 5 minutes. Nuke Cops Founded by Paul Laudanski (Zhen-Xjell)
:: FI Theme :: PHP-Nuke theme by coldblooded (www.nukemods.com) ::