You are missing our premiere tool bar navigation system! Register and use it for FREE!

NukeCops  
•  Home •  Downloads •  Gallery •  Your Account •  Forums • 
Readme First
- Readme First! -

Read and follow the rules, otherwise your posts will be closed
Modules
· Home
· FAQ
· Buy a Theme
· Advertising
· AvantGo
· Bookmarks
· Columbia
· Community
· Donations
· Downloads
· Feedback
· Forums
· PHP-Nuke HOWTO
· Private Messages
· Search
· Statistics
· Stories Archive
· Submit News
· Surveys
· Theme Gallery
· Top
· Topics
· Your Account
Who's Online
There are currently, 62 guest(s) and 0 member(s) that are online.

You are Anonymous user. You can register for free by clicking here
Nuke Cops :: View topic - FAST-WebCrawler and/or ia_archiver [ ]
 Forum FAQ  •  Search  •   •  Memberlist  •  Usergroups   •  Register  •  Profile •    •  Log in to check your private messages  •  Log in

 
Post new topic  Reply to topicprinter-friendly view
View previous topic Log in to check your private messages View next topic
Author Message
chris-au
Elite Nuker
Elite Nuker


Joined: Jan 31, 2003
Posts: 717


PostPosted: Sun Aug 10, 2003 10:34 pm Reply with quoteBack to top

Can anybody answer this?

I get a lot of visitors like:

209.237.238.173 - - [11/Aug/2003:13:26:45 +1000] "GET /modules.php?name=My_eGallery&file=index&do=upload HTTP/1.0" 200 22660 "-" "ia_archiver"

and:

66.77.73.89 - - [11/Aug/2003:10:36:26 +1000] "GET /robots.txt HTTP/1.0" 200 297 "-" "FAST-WebCrawler/3.7/FirstPage (atw-crawler at fast dot no;http://fast.no/support/crawler.asp)"

Is this something to worry about?

_________________
Chris
Find all posts by chris-auView user's profileSend private messageVisit poster's website
Zhen-Xjell
Nuke Cops Founder
Nuke Cops Founder


Joined: Nov 14, 2002
Posts: 5939


PostPosted: Mon Aug 11, 2003 5:07 am Reply with quoteBack to top

http://nukecops.com/postt537.html

_________________
Paul Laudanski, Microsoft MVP Windows-Security
CastleCops: [de] [en] [wiki]
Find all posts by Zhen-XjellView user's profileSend private messageSend e-mailVisit poster's website
MikeMiles
Lieutenant
Lieutenant


Joined: May 29, 2003
Posts: 231


PostPosted: Mon Aug 11, 2003 5:29 am Reply with quoteBack to top

ia_archiver is Alexa's bot. It's primary mission is to make a huge archive of the net. It feeds the waybackmachine at http://www.archive.org/ . The results of its indexing usually do not show up for about six months. Alexa also runs a third-rate search engine at http://www.alexa.com/ which is powered by Google with some of the archive pages peppered in the results showing up as "related searches." Alexa also takes the information it indexes off of everyone's websites and sells it to other people.

IMO this bot is a bandwidth hog which doesn't return that much traffic for the amount it eats. They sell your webpages' content and images (copyrighted material) to others without your permission and without giving you a commission. This bot has always been on my ban list. If you're an Amazon affiliate, their TOS forbids you from banning it because they use it to do spot checks for their affiliate program. Some here like this bot, but I don't.

The second one crawls HTML documents, pictures, video, and audio. It's a bot used for FAST Web Search: http://www.fastsearch.com/us/products/fast_web_search/crawler_faq which powers the All The Web Search engine and a bunch of small search engines primarily located outside the U.S.

Both of these respect the robots.txt.
Find all posts by MikeMilesView user's profileSend private message
Display posts from previous:      
Post new topic  Reply to topicprinter-friendly view
View previous topic Log in to check your private messages View next topic
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum



Powered by phpBB © 2001, 2005 phpBB Group

Ported by Nuke Cops © 2003 www.nukecops.com
:: FI Theme :: PHP-Nuke theme by coldblooded (www.nukemods.com) ::
Powered by · TOGETHER TEAM srl ITALY http://www.togetherteam.it · DONDELEO E-COMMERCE http://www.DonDeLeo.com
Web site engine's code is Copyright © 2002 by PHP-Nuke. All Rights Reserved. PHP-Nuke is Free Software released under the GNU/GPL license.
Page Generation: 0.359 Seconds - 260 pages served in past 5 minutes. Nuke Cops Founded by Paul Laudanski (Zhen-Xjell)
:: FI Theme :: PHP-Nuke theme by coldblooded (www.nukemods.com) ::