Track Search Engine Spiders / Robots

You are here : Home > Tools > Track Search Engine Spiders / Robots

You have just placed your new site online.
How can you find out that a search engine is crawling your site ?
Some tool is needed to track search engine spiders or robots.
Or consult your logfiles. Boring and time consuming isn't it ?

Use TrackBot to track search engines, it sends you an e-mail when a search engine visits your site. This way you are informed, just seconds later.

What is TrackBot ?
TrackBot is a PHP script based on the script called RobotVisit from Steve Dawson.
I found his script difficult to update with new search engine spiders. So I modified it to my needs and gave it a new name.

This information will be mailed to you :
  • Name of the search engine (user Agent)
  • The page that is crawled
  • IP and hostname of the search engine
  • Exact date and time of the visit
The mail looks like this :
Hi,
Bot : Mozilla/5.0 (compatible; Yahoo! Slurp; http://help.yahoo.com/help/us/ysearch/slurp)
Crawled : http://on2bbp.byze.be/contact.php
Used ip : 66.196.90.152
Host : lj1136.inktomisearch.com
Time of visit : 11-04-2005 at 23:03:27

Tracked Search Engines
It will track these search engines by default:
  • Google
  • Yahoo (Slurp)
  • MSN Search
  • Alexa
  • Alta Vista
  • GigaBlast
  • Ask Jeeves
You can add as many search engines as you wish. Add the correct botname in the script and it will also be tracked.
More information can be found on the Web Robots Database.

Installation
Installation is simple. Upload the script to your site and add the following code on the first line in the page that have to be tracked. Do this for every page.
<?php include("TrackBot.php") ?>

I assume that your pages are written in PHP and not static HTML.

Testing 1 2 3
Add 'Mozilla' to the array SearchBot$ or any other word that could identify your browser. The script should then be triggered the moment you or someone else visits the page.
Remove it after testing or you get a mail for every visitor.
And those visitors are not search engines :-)

Warning
If your site contains many pages and you include the code on every page, you may end up with tons of mails.

Download
Download TrackBot

Help
This script is free to use, but donations via PayPal are allways welcome.
I'll use them to keep the site running.
or place a link on your page to this page using the following HTML code :
<a href="http://on2bbp.byze.be/tools/track-search-engine-spiders-robots.php" alt="Track
Search Engine Spiders / Robots with TrackBot">Track
Search Engine Spiders / Robots with TrackBot</a>