OK, my gorgeous wife suggested that my last couple of posts were a little more technical than they ought to have been. So, to explain some of the backstory:
A programmer named Paul Graham recently wrote
A Plan For Spam, which describes his attempts to improve spam filtering. Graham gave his advanced statistical filter away, and suddenly lots of companies advertised that they had advanced statistical spam filters. It's a well-written essay (as is his essay
Why Nerds are Unpopular [which I only read out of curiousity]).
Graham noticed that companies tried half-baked ideas to cut down on spam. Some companies blacklist email addresses that spammers used. Others get together for a brainstorming session and decide that emails with certain terms (Viagra?) won't get through. It turns out that the return address on an email is just as easy to lie about as the return address on a real envelope. And if "Viagra" won't get through a spam filter, "Via-Gra" may. Using statistics solves this, and A Plan For Spam explains why.
The strange part is that many companies still use the half-baked ideas instead of the freely available good stuff. It seems odd to me, but I guess it's possible that some people haven't read A Plan For Spam.
Of course, math isn't exactly my strong suit. That's partly where the CRM Discriminator comes in. Don't understand the math behind the statistics? Worried that you'll write the wrong equation? Use the CRM's built-in text classifiers (written by MIT professors), and don't worry about it.
The really cool thing is that you can classify better than "spam, not spam." You can have the CRM Discriminator (or any statistical filter)
put emails (or any text) into several piles. For instance,
IF I had my own company, it would probably be a technical company. And
IF I wasn't the only guy handling emails, I could have CRM look at all email coming in to admin@maxlybbertscompany.com and classify it as a Bug Report, Administrative, Technical Question, Financial Data, etc. That's great.
But I'm sure I could do more. I could have CRM look at several webpages I visit during the day (or, probably, feeds from those web pages), and ask "does this article look more like articles that Max likes, or articles he doesn't like?" If I would likely enjoy the article, I'd get an alert (or a web page, or some other link) telling me so. That's what I was suggesting was pie in the sky (because of other technical quesitons, namely databses), until I found out somebody else
is already doing it.
But I'm still trying to think of a good use for the CRM Distriminator. Obviously you shouldn't buy a saw and then say "because this saw is so cool, I think I'll cut a hole in this wall here." That's probably what I am doing. But I'm on the lookout for a place where I can use my new tool.