Eric'o'theque!
Sunday, September 16, 2007
Google, PostSecret, and Spam Blogs

 

So I read PostSecret as a guilty indulgence. Right now, it is locked down due to a programmatic sweep profiling it as a Spam blog:

Your blog is disabled

Blogger's spam-prevention robots have detected that your blog has characteristics of a spam blog. (What's a spam blog?) Since you're an actual person reading this, your blog is probably not a spam blog. Automated spam detection is inherently fuzzy, and we sincerely apologize for this false positive.

We received your unlock request on September 16, 2007. On behalf of the robots, we apologize for locking your non-spam blog. Please be patient while we take a look at your blog and verify that it is not spam.

(From: PostSecret: Blogger has locked PostSecret )

First of all, given how many blog search queries I run from day to day (either tracking InfoPath back in my Office days or now the Windows Live Photo Gallery) I have deep appreciation of getting rid of Spam blogs. Most of which seem to pop-up on blogspot. In fact, if I see a link to a blogspot.com blog, I'm highly likely to skip looking at it because usually it's a modern day textual mash-up of reposted text from popular blog feeds and Spam links. Tracking InfoPath discussion became near impossible.

But you've got to have a better pattern than what happened to PostSecret to detect if something is a Spam blog. PostSecret should have lots of incoming links from high-quality sources. Well, maybe that's not apparent scanning at the first of 28,000 incoming links, but let's see... being a finalist in the 2005 Weblog Awards (and being around so long) probably breaks any Spam metric.

This seems like a sloppy shotgun approach (yes, indeed, "inherently fuzzy")... like a 20% project gone horribly wrong and that should be suspended. Probably more important than this is a Spam blog scanner is something else like this is a cherished content blog scanner to ensure quality blogs never get blacklisted.

Technorati tags: , , ,
 
Comments: Post a Comment

Links to this post:

Create a Link



<< Home
Eric Richards' place of techno (as in technology) happiness, rants, and corporate love. I work in Microsoft Office as a development lead.

My Photo
Name: Eric Richards
Location: Redmond, Washington, United States

Non-technical stuff going on with EricRi in the Northwest.

email: Eric_Richards at ericri dot com

Lots More About Eric.

Eric-isms
Archives
May 2005
August 2005
November 2005
January 2006
February 2006
March 2006
April 2006
May 2006
June 2006
July 2006
August 2006
September 2006
October 2006
November 2006
December 2006
January 2007
February 2007
March 2007
April 2007
May 2007
June 2007
July 2007
August 2007
September 2007

Disclaimer

Disclaimer: The postings (and comments) here represent personal point of views and in no way represent the point of view or official opinions of my employer (Microsoft Corporation). The postings here are provided "AS IS" with no warranties, and confers no rights. And if you're reading this blog, you're not only incredibly discerning, you're also knee-weakening good looking.


More blogs about Eric Richards.

Powered by Blogger