Eww, You Stepped In a Dupe

|

Slashdot posted yet another dupe today. That's at least 5 this year. That has me thinking that a site that looks like Slashdot but only contains the dupes could be fun to create. It may also tie into an idea of mine using Bayesian (sp?) filtering. My idea was to use it to filter RSS feeds, but for dupes each post would require a seperate token table. It may work to detect dupes. I know today's dupe would score big on SMS and morse. Shoot, if that doesn't work then filtering the comments for "dupe" would definitely work.

I'll have to start archiving RSS feeds to test my initial idea with, unless someone has a ready archive for me. For the dupe detector I just need to get a filter and RSS parser.

Ad's by Google