In recent weeks, as much as half of the linkage accorded to me on Technorati has come from scraper sites, a subset of spam blogs (“splogs”) which are explained thusly:
The purpose of a splog can be to increase the PageRank or backlink portfolio of affiliate websites, to artificially inflate paid ad impressions from visitors, and/or use the blog as a link outlet to get new sites indexed. Spam blogs are usually a type of scraper site, where content is often either Inauthentic Text or merely stolen (see blog scraping) from other websites. These blogs usually contain a high number of links to sites associated with the splog creator which are often disreputable or otherwise useless websites.
One which perplexed me greatly was Treadmill Reviews and Information, a subdomain under, of all things, a John from Cincinnati message board. The operator basically scrapes everything that mentions the word “treadmill” — including this recent post of mine, which uses the usual “42nd and Treadmill” shorthand to describe my workplace. Obviously it has nothing whatever to do with treadmills, but the splog is just jam-packed with the Google AdSense links you might expect.
Of course, I’m putting this up to see if it gets scraped — which is why I put all the derogatory definitional stuff in the first couple of paragraphs.