Googlebot Meta Tags
Besides the keyword and description meta tags being the most important ones, there are some meta tags that are proprietary to Google’s
own Googlebot. According to Google
, the tags that would apply specifically to Google’s
Googlebot are the ones that have name="googlebot" in the meta tag.
There are many reasons not to have a Search Engine cache a page. If it is copyrighted, if it contains private information that you do not want displayed from a search engine, or for any other reason you do not want the page crawled by a search engine, you can add the following code to the meta tags of a page and it will keep the page from being indexed:
<META NAME="ROBOTS" CONTENT="NOINDEX, NOFOLLOW">
If you would like specifically to keep Google
from indexing the page, but allow other Search Engines to index it, you can place the following meta tags in your headers of the page:
<META NAME="GOOGLEBOT" CONTENT="NOINDEX, NOFOLLOW">
On the other hand, if you would like the robots to index the file, but not follow the links on the page, you can use this code.
<META NAME="ROBOTS" CONTENT="NOFOLLOW">
When the Search Engines spiders, bots or robots crawl and index a page, they take a snapshot of a page. This is called a cached copy or a cache of the page. If for some reason, the site is unavailable when you visit it, a Search Engine can show the cached copy of the page. Or, if the cached copy of the page has a different version, then for various reasons, someone might want to use this cached version. It will be what the page looked like at the time of the crawl. So, if a robot crawled the site at 12:00pm on Saturday, and Monday more articles were added and changed on the page that was cached, you could still view what the cached version of that page looked like if the last time the bot crawled it was before it was changed on Monday. If you do not want Search Engines to cache a copy of this page or retain an older version of the page, you can add the following meta tags to the header of the page:
<META NAME="ROBOTS" CONTENT="NOARCHIVE">
If it is specifically Google’s
Search Engine that you do not want to have a copy of that page, but you want the other Search Engines to have a copy, then you would use this meta tag in the headers:
<META NAME="GOOGLEBOT" CONTENT="NOARCHIVE">
Using the above meta tag for Google
will result only in the cached copy not being stored. Google’s Googlebot will still continue to index the page with what is known as a snippet for that link on the Search Engine. A snippet is a preview of the webpage that is indexed. This is the preview that appears below the title of the page when you search on Google
. If you set it to use no snippet, it will display what is in the Description Meta tag of the page instead of a snippet.
.
Note: The snippet will also remove the cached page:
Aleeya dotNet ![]()
Daily life information for the family and people of all ages. Household Tips,
DVDs, Books, Reviews, Search Engine Optimization, News, Amateur Radio and more …
www.aleeya.net/ - 34k - Oct 13, 2005 - Similar pages ![]()
Without a snippet using the Description Meta tag:
Aleeya dotNet
Daily life information for the family and people of all ages. Household Tips,
DVDs, Books, Reviews, Search Engine Optimization, News, Amateur Radio and more …
www.aleeya.net/ - 34k - Oct 12, 2005 - Cached - Similar pages
Below is a result for a search for Aleeya (in Google) and an example of what the results look like when it is not using a snippet or a Description Meta tag.
Aleeya.net: October 2005
… and kept the non technical site as http://www.aleeya.net . But, from previous,
Aleeya.net only had a 4/10 on Google’s page rank when I used PHPnuke. …
aleeya-net.blogspot.com/ 2005_10_01_aleeya-net_archive.html - 21k - Cached - Similar pages
[ More results from aleeya-net.blogspot.com ]
So, to set it so the Google
Bots will index a page, but not use a snippet - but use the Description Meta Tag, use the code below:
This will remove the cache of the page:
<META NAME="GOOGLEBOT" CONTENT="NOSNIPPET">
There are many other meta tags available to use for HTML. The ones above are just a sample and also demonstrate the ones that are proprietary to Google
.
![]()
For further reading:
Google Information for Webmasters
The Robots META tag
W3C Recommendations - Meta Data
Related Articles on Aleeya.net:
Recent Entries:
- 07/05/2008: Windows Does not Report All Memory
- 01/01/2008: Firefox 3 beta - Minefield
- 01/01/2008: Happy New Year!
- 01/01/2008: Photoshop Thumbnails (.psd)
- 01/01/2008: Adobe Photoshop CS3 error
Search: Cosmos | BlogPulse
Bookmark: Del.icio.us | Furl It | Spurl | Tag!RawSugar | Simpy This! | Shadows Tag! | Blink It | My Web
Aleeya dotNet Tags: Googlebot, meta, tags, google
Technorati Tags: Googlebot, meta, tags, google
Filed under: SEO (Technorati) .
You can follow any responses to this entry through the RSS 2.0 feed. You can leave a response, or trackback from your own site.
This entry was posted on Monday, October 17th, 2005 at 6:44 am
You can also choose to read SEO Category, which is the previous entry, or Titles, Description, and Keywords, the next entry.