There are two types of cloaking: user-agent based and IP based (also known by the euphamism “IP delivery”). Cloakers try to cover their tracks by making it difficult to examine the version meant only for spiders. They do this with a “noarchive” command embedded within the meta tags. Googlebot will obey that directive and not archive the page, which then causes the “Cached” link in that page’s search listing to disappear.
So getting a view behind the curtain to see what is being served to the spider can be a bit tricky. If the type of cloaking is solely user-agent based, you can use the User Agent Switcher extension for Firefox. Just create the following user-agent under Tools > User Agent Switcher > Options > Options > User Agents:
Description: Googlebot
User Agent: Googlebot/2.1 (+https://www.googlebot.com/bot.html)
Then switch to that user agent by selecting Googlebot under Tools > User Agent Switcher.
But that won’t work if the cloaker is doing IP delivery. If there’s no “Cached” link in the SERPs, you might think you’re out of luck. But you may not be!
A lot of times, Google’s “Translate This Page” functionality can be used to view the cloaked content, because many cloakers don’t bother to differentiate between the bot coming in for the purpose of translating or coming in for the purpose of crawling. Either way, it uses the same range of Google IP addresses. Thus, when a cloaker is doing IP delivery they tend to serve up the Googlebot-only version of the page to the Translate tool. This loophole can be plugged, but many cloakers miss this.
And I bet you didn’t know that you can actually set the Translation language to English even if the source document is in English! You simply set it in the URL, like so:
https://translate.google.com/translate?hl=en&sl=en&u=URL&sa=X&oi=translate&resnum=9&ct=result
(Above, replace URL with the actual URL of the page you want to view)
That way, when you are reviewing someone’s cloaked page, you can see the page in English instead of having to see the page in a foreign language.Â
You can also sometimes use this trick to view paid content. i.e. if you’re too cheap to pay for content from sites like WebmasterWorld where that content has been placed behind a registration wall and removed from Google’s cache.
Do pay for WebmasterWorld, though. Do right by Brett.
hey..google bot doesnt work…
even on webmasterworld..
im using firefox 2.0
That will only grab bottom-end cloakers. Real ones, there’s only a few ways to get around it.
…which I’m not looking to repeat here.