Don’t call me DOM

24 September 2004

Annospam

I have been busy lately deploying a tool that I (and others) had started to develop one year ago, and had been stalled since then, informally called Annospam; the tool allows to cleanse W3C Mailing List Archives from its huge number of spams they host and are likely to continue to receive, however clever our anti-spams systems are getting.

The idea is to use the Annotea protocol as a way to store and retrieve spam marks on archived messages, and to regenerate the relevant archives based on these marks; it uses lots of W3C Technologies (XSLT as a way to build a user interface, RDF/XML as a data format, HTTP as a query/update protocol), which makes it really interesting, if sometimes somewhat challenging.

I hope to get on finishing a proper documentation for it soon enough, but if you look well enough, you should already be able to see some mailing lists archives being cleaned through this very system… (hint: to detect a cleaned mailing list, find those where the number of messages displayed in the cover page doesn’t match the one displayed in a period-page; and yes, this is a bug :) )

Comments are closed.

Picture of Dominique Hazael-MassieuxDominique Hazaël-Massieux (dom@w3.org) is part of the World Wide Web Consortium (W3C) Staff; his interests cover a number of Web technologies, as well as the usage of open source software in a distributed work environment.