You are missing our premiere tool bar navigation system! Register and use it for FREE!

NukeCops  
•  Home •  Downloads •  Gallery •  Your Account •  Forums • 
Readme First
- Readme First! -

Read and follow the rules, otherwise your posts will be closed
Modules
· Home
· FAQ
· Buy a Theme
· Advertising
· AvantGo
· Bookmarks
· Columbia
· Community
· Donations
· Downloads
· Feedback
· Forums
· PHP-Nuke HOWTO
· Private Messages
· Search
· Statistics
· Stories Archive
· Submit News
· Surveys
· Theme Gallery
· Top
· Topics
· Your Account
Who's Online
There are currently, 78 guest(s) and 0 member(s) that are online.

You are Anonymous user. You can register for free by clicking here
Nuke Cops :: View topic - email when search engines spider [ ]
 Forum FAQ  •  Search  •   •  Memberlist  •  Usergroups   •  Register  •  Profile •    •  Log in to check your private messages  •  Log in

 
Post new topic  Reply to topicprinter-friendly view
View previous topic Log in to check your private messages View next topic
Author Message
antipas
Private
Private


Joined: Mar 10, 2003
Posts: 40


PostPosted: Sun Apr 06, 2003 8:14 pm Reply with quoteBack to top

Insert in a .php page that will email you when Google and other search engines are indexing that page
On my page I added this to the index.php file but it could be added anywhere you want.

Code:


bot.php (change the $to var with your email address)
Code / Sample:
<?php

   $botlist = array(   
               "Teoma",                   
               "alexa",
               "froogle",
               "inktomi",
               "looksmart",
               "URL_Spider_SQL",
               "Firefly",
               "NationalDirectory",
               "Ask Jeeves",
               "TECNOSEEK",
               "InfoSeek",
               "WebFindBot",
               "girafabot",
               "crawler",
               "www.galaxy.com",
               "Googlebot",
               "Scooter",
               "Slurp",
               "appie",
               "FAST",
               "WebBug",
               "Spade",
               "ZyBorg",
               "rabaz");


   foreach($botlist as $bot) {
       
      if(ereg($bot, $HTTP_USER_AGENT)) {
       
         if($bot == "Googlebot") {
            if (substr($REMOTE_HOST, 0, 11) == "216.239.46.") $bot = "Googlebot Deep Crawl";
            elseif (substr($REMOTE_HOST, 0,7) == "64.68.8") $bot = "Google Freshbot";
         }
         if ($QUERY_STRING != "") {
            $url = "http://" . $SERVER_NAME . $PHP_SELF . "?" . $QUERY_STRING . "";
         } else {
            $url = "http://" . $SERVER_NAME . $PHP_SELF . "";
         }

// settings
$to = "email@your-domain.com";
$subject = "Detected: $bot on $url";
$body = "$bot was deteched on $url\n\n
Date.............: " . date("F j, Y, g:i a") . "
Page.............: " . $url . "
Robot Name.......: " . $HTTP_USER_AGENT . "
Robot Address....: " . $REMOTE_ADDR . "
Robot Host.......: " . $REMOTE_HOST . "
";

mail($to, $subject, $body);
               
      }
   
   }

?>
 


“I did not code this just sharing it”

_________________
"I feel sorry for people who don't drink. When they wake up in the morning, that's as good as they're going to feel all day."

Last edited by antipas on Sun Apr 06, 2003 9:59 pm; edited 1 time in total
Find all posts by antipasView user's profileSend private messageVisit poster's website
antipas
Private
Private


Joined: Mar 10, 2003
Posts: 40


PostPosted: Sun Apr 06, 2003 8:27 pm Reply with quoteBack to top

Another good tool that I recently installed to watch Search engine spiders

Quote:

GoogleStats is a free Open Source PHP application analyzing the visits of Googlebot any many more robots on your site. The idea is quite simple: you just have to include a PHP script on each page of your site (you can choose the pages where you include it, only those ones will be analyzed). When Googlebot (or any other robot) comes to crawl one page, its visit is recorded in a MySQL database.

The second part of the application displays the statistics for a given period of time. Use the calendar on the left to select a day, a week or even a month. Three types of analysis are given:

Results:
nb of visits of each robot
nb of different pages viewed by each robot
average nb of visits per day
list of all the robots detected during the defined period
list of the IP addresses of each detected robot
Pages:
list of all pages viewed by each robot within the analyzed period, with URL, date, number of visits, IP address and DNS. You may sort these columns by clicking on its header.
Graph: displays in a bar diagram the statistics of the selected robot's visits.


http://www.googlestats.com/english/index.htm

_________________
"I feel sorry for people who don't drink. When they wake up in the morning, that's as good as they're going to feel all day."
Find all posts by antipasView user's profileSend private messageVisit poster's website
ulissesnelson
Lieutenant
Lieutenant


Joined: Apr 09, 2003
Posts: 188


PostPosted: Wed Apr 09, 2003 5:12 am Reply with quoteBack to top

Antipas can u help me? i tryed to insert the google stats but the code when i put in my nuke theme.php crashes my rigth blocks .

im using phpnuke 6.0
Find all posts by ulissesnelsonView user's profileSend private message
antipas
Private
Private


Joined: Mar 10, 2003
Posts: 40


PostPosted: Wed Apr 09, 2003 9:36 am Reply with quoteBack to top

Quote:
Uncompress the file 'googlestats.zip'. A directory named 'googlestats'
should be created. Once all the stuff listed above is done, transfer it via
FTP to the root of your web site.

Edit the file 'googlestats/admin/config.php' with a simple text editor
like Notepad and replace the 4 following elements by the real values
(if you need help you may ask your host provider):
- "HOST" : the MySQL hostname
- "LOGIN" : your login
- "PASSWORD" : your password (to access the database)
- "BASE" : the MySQL database name

Fill in your email address (this address will be used to send you the Full
Crawl alerts or to warn you about MySQL execution errors) : variable
ADRESSE_EMAIL

The options of GoogleStats are configured by the following variables.
If you want, you may modify some of them:
- URL_REWRITING : put "y" if you're using it, otherwise "n"
- GRAPH_SCALE : put "lin" (linear) or "log" (logarithmic) for the
scale of the graphs
- VALEURS_GRAPH : put "y" if you want to display the values on the
graph. Otherwise put "n"
- SET_EXEC_TIME : put "y" if you want to set a maximum execution time
for the graph script. Otherwise put "n"
- TEST_FULL_CRAWL : put "y" if you want to detect when the Full Crawl
begins and to be alerted by email (the email will be
sent to the address ADRESSE_EMAIL). Otherwise put "n"
- SEND_ERROR_MYSQL : put "y" to receive emails when an MySQL error occurs.
Otherwise put "n".

To customize the installation, you can also modify these variables:
- LANGUE : put "en" (english), "fr" (french) or "de" (deutsch)
- TABLE_LOG : name of the MySQL table used to store the visits
- TABLE_ROBOTS : name of the MySQL table used to store the robots
- NB_J_DET_FULL_CRAWL : for the Full Crawl detection: an alert is sent to
warn the webmaster that the Full Crawl has begun as
soon as a Full Crawl robot's visit is detected and no
visit of the Full Crawl robot has been recorded in
the last NB_J_DET_FULL_CRAWL days.


4- Transfer the whole directory 'googlestats' by FTP to your server.

5- Protect the directory 'admin' (that you should have renamed) with a file
'.htaccess' and a file '.htpassword'.

6- Installation:
* If this is the first time that you install GoogleStats, go to the URL
http://__your_site__/googlestats/install.php and follow the instructions.
Once the installation process is over, delete the file 'install.php' from
your server.

7- Now you must insert the following code in each page of your site (apart from
the pages you don't want to be analyzed). If you're using Dreamweaver (or
any other software supporting templates), you can insert this code in your
template. In this case, use the code display and insert this code in a
non-editable zone:

<?php
include($DOCUMENT_ROOT.'/googlestats/admin/config.php');
include($DOCUMENT_ROOT.'/googlestats/googlestats.inc.php');
?>



I added it to the bottom of my index.php
Code:

        include("footer.php");
    }
}
include($DOCUMENT_ROOT.'/googlestats/admin/config.php');
include($DOCUMENT_ROOT.'/googlestats/googlestats.inc.php');   
 
?>

_________________
"I feel sorry for people who don't drink. When they wake up in the morning, that's as good as they're going to feel all day."
Find all posts by antipasView user's profileSend private messageVisit poster's website
Display posts from previous:      
Post new topic  Reply to topicprinter-friendly view
View previous topic Log in to check your private messages View next topic
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum



Powered by phpBB © 2001, 2005 phpBB Group

Ported by Nuke Cops © 2003 www.nukecops.com
:: FI Theme :: PHP-Nuke theme by coldblooded (www.nukemods.com) ::
Powered by · TOGETHER TEAM srl ITALY http://www.togetherteam.it · DONDELEO E-COMMERCE http://www.DonDeLeo.com
Web site engine's code is Copyright © 2002 by PHP-Nuke. All Rights Reserved. PHP-Nuke is Free Software released under the GNU/GPL license.
Page Generation: 0.224 Seconds - 159 pages served in past 5 minutes. Nuke Cops Founded by Paul Laudanski (Zhen-Xjell)
:: FI Theme :: PHP-Nuke theme by coldblooded (www.nukemods.com) ::