Don’t call me DOM

24 September 2004

Annospam

I have been busy lately deploying a tool that I (and others) had started to develop one year ago, and had been stalled since then, informally called Annospam; the tool allows to cleanse W3C Mailing List Archives from its huge number of spams they host and are likely to continue to receive, however clever our anti-spams systems are getting.

The idea is to use the Annotea protocol as a way to store and retrieve spam marks on archived messages, and to regenerate the relevant archives based on these marks; it uses lots of W3C Technologies (XSLT as a way to build a user interface, RDF/XML as a data format, HTTP as a query/update protocol), which makes it really interesting, if sometimes somewhat challenging.

13 September 2004

Give Spammers a rest!

Filed under:

Spammers, like many people down here, needs to rest after all their efforts; spammers needs to take a week-end break, too, as shows the repartion of the number of messages per weekday:

Statistics of received message per weekday (1 is Monday, 7 Sunday) These plots are based on the spam I received in the past 2 months.

Note that in fact, this interpretation is probably buggy; for instance, it’s likely that a fair number of Zombies computers used to send spams are shut down during the week-end.

3 August 2004

Mailing-list based filtering and IMAP

Filed under:

Given the big number of mails I receive (around 150 a day), and given that most of it comes from W3C mailing lists (around 80%), I sort all my incoming emails based on the mailing list they were sent to, which allows me to see at a quick glance what mails I should read in priority. Procmail allows to do so quite easily with the following rule:


# filter mail from W3C mailing lists
:0
* ^List-Id: <[a-z0-9-]+.w3.org>
* ^List-Id: </[a-z0-9-]+
  $MATCH

(List-Id is a header defined in RFC 2919 precisely to make it possible to handle mailing lists messages)

« Newer entriesOlder entries »

Picture of Dominique Hazael-MassieuxDominique Hazaël-Massieux (dom@w3.org) is part of the World Wide Web Consortium (W3C) Staff; his interests cover a number of Web technologies, as well as the usage of open source software in a distributed work environment.