www.mamboteam.com
European Network Dynamics
Home arrow Articles arrow Analytics arrow Testing crawlerbots indexing speed
Monday, 07 July 2008
 
 
Main Menu
Home
Forums
Articles
News
Directory
Chat
Search
FAQs
Sponsored Links
Testing crawlerbots indexing speed Print E-mail
User Rating: / 5
PoorBest 
Written by European network dynamics principal author   
According to serveral observations, see 1 and 2, many questions are to be asked concerning the incredible reactivity of crawlerbots such as Googlebot. First of all ameasurement is realized then results are discussed about in terms different means to acces newly published information by bots. According to several concerns, measurements, and observations, the time given by bots before indexing newly published articles has drastically decreased these latest days dropping as low as a few minutes when published on small to average audiences websites (1 and 2 ).
To our knowledge, no objective measurements have been objectively realized and publicly published concerning this indexing speed. Therefore we decided to do the following experiments using European network dynamics website and with the help of Webrankinfo.com, a well known french SEO website (3 ) .

Hypothesis

  • Are crawler bots warned of new articles, and how are they warned ?
  • What tools do bots use (rss, browsers helping bars, deep sites linking) ?
  • Are ads delivery systems involved (Googlebot <-> Adsense <-> Adwords ) ?




Methods

The time between the publication of 4 articles within
European network dynamics and the passage of an indexing crawling bot such as googlebot were cautionously measured

All articles published
within European network dynamics are submitted to a RSS feed taken into consideration by bots and feeds agregators.

The  articles discussed above were writted using 'clean' browser (Firefox2, fresh install)with no helping bar such as google bar, nor alexa bar.

A total of 4 articles are written for the study:

(URL are given for information purposes only, nothing of interest can be found there)

Article 1: totally new article with adsenses present in left column
http://www.netdynamics.eu/analytics/testing-googlebots-indexing-speed-1.html
 

Article 2: absolutely similar to Article 1, URL ends with ...-2.html, but the real URL (although easy to guess is not given here), Adsenses present as Article 1

Article 3: New article, with no adsense, using the following URL :
http://www.netdynamics.eu/Testing_Googlebots_indexing_speed_3.html

Article 4: exactly similar to Article 3 with no Adsenses, and no URL published, although ends with ..-4.html (see article 1 and 2)

All four articles were submitted to publication exactly at 11.30 AM on Friday Aug.31st  Western European time.

In addition, all these 4 articles were also presented and discussed about in (3 ) as a 'promotionnal' discussion (see 4 ).


Results

An intensive log analysis taking only these 4 articles into consideration, of visitors after 11:30 AM this days gives the following results:
Article 1
1st 'human' visitor : 11:34:02 AM
1st bot : Googlebot : 11:35:24 AM (5 minutes 24 seconds after unveiling the article)
2nd bot : Ask Jeeves / Teoma : 15:06:54
3rd bot : Baiduspider : 19:25:02
4 th bot : Yahoo slurp : 01/Sep/2007:21:38:33
Article 2
1st 'human' visitor : 15:14:48
1st bot : Googlebot : 15:14:48 (4 hours 14 minutes after unveil)
2nd bot : Ask Jeeves / Teoma : 17:26:56
3rd bot : Baiduspider : 20:10:04
4 th bot : Yahoo slurp : 02/Sep/2007:05:58:40

At the time these lines are being written, Articles 3 and 4 have not been indexed yet. No other bots have been indexing these 4 articles.

Discussion

The speed at which Googlebot indexes new article seems tremendous, About 5 minutes only after an articla has been unveiled it is already indexed. Furthermore, the 'promotionnal' discussion used in (3) was indexed in google as soon as 12:51 on Aug 31st, see (5

Results strongly suggest that adsenses are responsible of the short delay between unveiling and indexing, because Googlebot's first fisit on Article 1 has no referrer. Let us recall that One 'human' visitor came before the bot. Apache log analysis does not reveal this visitor used any helping bar equipped browser, but even if not the case, this visit invoked a new url for Adsense, therefore triggering bot's visit.

Main referrences








The authors of this study wish to thank webrankinfo.com for their help in this study.

End notes concerning this study:




This article was first published on
Tuesday, 04 September 2007 22:18 and its first visit by googlebot occured some 30' after






crawl-66-249-66-163.googlebot.com www.netdynamics.eu - [04/Sep/2007:22:55:31 +0200] "GET /analytics/testing-crawlerbots-indexing-speed.html HTTP/1.1" 200 10942 "-" "Mediapartners-Google"





Del.icio.us!Google!Live!Slashdot!Furl!Yahoo!Ma.gnolia!
 
< Prev   Next >
Login Form





Lost Password?
No account yet? Register
Latest links
Telethon 
Shugalmella 
Fietsvakantie Frankrijk 
Les restaus du coeur 
Organization Kikoulol 
Related Items
Who's Online
We have 1 guest online
Support this site

Enter Amount:



 
Top! Top!