vBulletin

Thank you for visiting. This is our website archive. Please visit our main website by clicking the banner above.
vBulletin FAQ is dedicated to helping the forum owner build, manage and profit from his vBulletin Forum
vBulletin Web Hosting - Free skins and styles for your vBulletin - Search Engine Optimization




Google Bot follows Adsense bot?

Big Dan
08-02-2006, 10:11 AM
Has anyone elsed noticed once you implement adsense, the regular indexing bot seems to follow with in a couple of days?

I started a forum a while back without adsense it took more than 2 months for my domain to get index. 2 days ago I installed vB on a clean domain implemented adsense and the GoogleBot has hit me several times.

Just wondering if anyone has noticed the same.

noppid
08-02-2006, 10:31 AM
Are you certain it's not the media bot?

Big Dan
08-02-2006, 10:33 AM
The adsense bot and google bot are listed seperately.. Take a look http://ourdjtalk.com/spiders.php

noppid
08-02-2006, 11:07 AM
The adsense bot and google bot are listed seperately.. Take a look http://ourdjtalk.com/spiders.php


Just making sure. In this case I'd bet a good backlink helped or it was a coincedence. Either way, good for you!

dchapman
08-05-2006, 08:37 PM
With the new technology Google is using, both the Adsense and normal Googlebot utilize the same cache. So what you're experiencing is not too surprising. I thought I read about this on MattCutts but a quick search didn't find it. The next time I come across the information i'll update this thread.

Damocles
08-28-2006, 05:29 PM
I submitted my site at Google. I had bots in 2 weeks and am indexed. I do not use Adsense at this time.

minstrel
08-28-2006, 06:19 PM
With the new technology Google is using, both the Adsense and normal Googlebot utilize the same cache.
That's correct.

Matt Cutts Q&A on Big Daddy and other issues (http://www.mattcutts.com/blog/q-a-thread-march-27-2006/)

Q: “What’s the story on the Mozilla Googlebot? Is that what Bigdaddy sends out?”

A: Yes, I believe so. You will probably see less crawling by the older Googlebot, which has a User-Agent of “Googlebot/2.1 (+http://www.google.com/bot.html)”. I believe crawling from the Bigdaddy infrastructure has a new User-Agent, which is “Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)”

Matt Cutts: Google Crawl Caching Proxy (http://www.mattcutts.com/blog/crawl-caching-proxy/)

Several people have noticed content from other Google bots showing up in our main web index, and are wondering… why/how does that happen? Last week I was at WebmasterWorld Boston and I talked about this issue there, but I’d like to do a blog post about Google’s crawl caching proxy, because some people have questions about it.

First off, let me mention what a caching proxy is just to make sure that everyone’s aware. I’ll use an example from a different context: Internet Service Providers (ISPs) and users. When you surf around the web, you fetch pages via your ISP. Some ISPs cache web pages and then can serve that page to other users visiting the same page. For example, if user A requests www.cnn.com, an ISP can deliver that page to user A and cache that page. If user B requests www.cnn.com a second later, the ISP can return the cached page. Lots of ISPs and companies do this to save bandwidth. For example, Squid is one web proxy cache that is free and common that a lot of people have heard of.

As part of the Bigdaddy infrastructure switchover, Google has been working on frameworks for smarter crawling, improved canonicalization, and better indexing. On the smarter crawling front, one of the things we’ve been working on is bandwidth reduction. For example, the pre-Bigdaddy webcrawl Googlebot with user-agent “Googlebot/2.1 (+http://www.google.com/bot.html)” would sometimes allow gzipped encoding. The newer Bigdaddy Googlebots with user-agent “Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)” are much more likely to support gzip encoding. That reduces Googlebot’s bandwidth usage for site owners and webmasters. From my conversations with the crawl/index team, it sounds like there’s a lot of head-room for webmasters to reduce their bandwith by turning on gzip encoding.

Another way that Bigdaddy saves bandwidth for webmasters is by using a crawl caching proxy.

.....

In this example, if the blogsearch crawl or AdSense wants to fetch a page that the web crawl already fetched, it can get it from the crawl caching proxy instead of fetching more pages. That could reduce the number of pages fetched down to as little as 11. In the same way, a page that was fetched for AdSense could be cached and then returned to if the web crawl requested it.

So the crawl caching proxy work like this: if service X fetches a page, and then later service Y would have fetched the exact same page, Google will sometimes use the page from the caching proxy. Joining service X (AdSense, blogsearch, News crawl, any Google service that uses a bot) doesn’t queue up pages to be include in our main web index. Also, note that robots.txt rules still apply to each crawl service appropriately. If service X was allowed to fetch a page, but a robots.txt file prevents service Y from fetching the page, service Y wouldn’t get the page from the caching proxy. Finally, note that the crawl caching proxy is not the same thing as the cached page that you see when clicking on the “Cached” link by web results. Those cached pages are only updated when a new page is added to our index. It’s more accurate to think of the crawl caching proxy as a system that sits outside of webcrawl, and which can sometimes return pages without putting extra load on external sites.

Just as always, participating in AdSense or being in our blogsearch doesn’t get you any “extra” crawling (or ranking) in our web index whatsoever. You don’t get any extra representation in our index, you don’t get crawled/indexed any faster by our webcrawl, and you don’t get any boost in ranking.

This crawl caching proxy was deployed with Bigdaddy, but it was working so smoothly that I didn’t know it was live. That should tell you that this isn’t some sort of webspam cloak-check; the goal here is to reduce crawl bandwidth.

developer-gspp
01-17-2007, 10:46 PM
I have also noticed more traffic from the googlebot after installing adsense. I don't know if they are related or just coincidence though.


vBulletin

seo book

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35

EZ Archive Ads Plugin for vBulletin Copyright 2006 Computer Help Forum