Back to the Spam stats main page.
This page is an attempt to visualize the spam amounts I get.
My mail server collects automatically every spam (junk mail, not to be confused with SPAM - the food) it receives, and archives it. To these archives I added all the spam in my local inbox (e.g. the one that were not filtered) by converting it to the same format my archives have.
All spam was sent to either my pobox.com account or to any address at bloodgate.com. There is only one person for this domain, but I had the habit to issue multiple dummy addresses as spam-traps. This was a bad decision, since it only let to the fact that I get most spam multiple times - for instance, to my two whois-addresses, my gimp developer address, and to my normal dummy address. I stopped this a long time ago, but the old addresses are still spammed into oblivion. Add to this the fact that the spammers sometimes outright guess addresses or mangle them (by dropping numbrs, for instance), and you know the reason why both the number of target addresses and the spam count are so high.
The archives date back until July 1998. From February to March
2000 and in August 2001 the filter was offline and the spam
sloshed directly into my local inbox. It seems, however, that I lost some of
the automatic archive in these time frames - it is hard to tell.
So please
don't interfer anything from the unusual low spam counts in these two time
frames.
I also lost the archive from 23.6 - 27.6. 2002 due to my clumsy file shuffling
:-/
Fortunately I got a rolling backup of the last x hundred email messages and
made a snapshot. Unfortunately, the messages are saved before the spam
filtering, so I need to re-tag and extract them manually (well, scriptically).
When this is done, the count for the last days in Juni will be corrected. What
you see now are only the 1-2 messages per day that go around my filter and
land in my mailbox directly.
Also, my pobox account generates an unusual high volume of spam, but for some time I had it just delete any spam instead of forwarding to my filter (and thus it didn't get archived). In short: the actual amount of spam is likely much higher for some timeframes.
The target domain stat is slightly wrong, since
the filter at elstner.com address used to lump together everything I received
before I refined my filter setup. In reality, the spam would go to my pobox.com
account, or my dummy address, and then end up at the bloodgate.com system. It
was then forwarded to filter at elstner.com, where it got filtered, and
then stuffed back into my inbox or the spam archive, depending on the filter
outcome. Since the graph tool can not yet figure this out, the stats are wrong.
I already pre-processed some of my spams to add the correct
X-Envelope-To: headers.
The site is not complete yet, I am still working on it. If you have any comments, or want the script to generate these pages, please email me.
| Spams processed: | 6376 |
| Archive size: | |
| compressed: | 11.9 Mb |
| uncompressed: | 44 Mb |
| Spams skipped: | 705 |
| Spams in the last 24 hours: | 32 |
| Spams in the last 7 days: | 211 |
| Spams in the last 30 days: | 823 |
Spams are skipped due to two reasons:
From line, like an invalid date
Please note that a lot of these addresses don't exist, but the spammers tried
to deliver mail to them, nonetheless. Especially noteworthy are the
combinations of the whois_dummy and gimp_dummy addresses. Most of them are
garbled and are very probably due to the spammer trying to defeat (wrongly)
anti-spam measurements by varing the address.
|
Made with Mail::Graph v0.09
Last regenerated: Tue Jul 30 11:59:49 2002 |
|