[Jenkins-infra] Flighting with wiki spam

Kohsuke Kawaguchi kk at kohsuke.org
Mon Mar 2 20:40:40 UTC 2015

This is just an idea.

I was thinking about how we can cope more effectively with Wiki spam, and
spread that workload.

What if we establish a mailing list based workflow? We'll create a mailing
list that spam fighters will join, and this list receives the notifications
from Confluence about new pages.

We'll have a bot monitor this list as well, and if it sees us replying to a
notification email with some keyword, say "BURN IN HELL", it'll go delete
that page. I think this simplifies the workflow for us humans quite a bit,
and it'll make it easier for multiple people to collaborate on this task.
The invitation only ML would serve as a kind of authentication mechanism,
to prevent the bot from going nuts.

The bot could evolve to do more actions, such as removing the user from
LDAP and perhaps feeding that information back to stopforumspam.

I've also experimented with a language detection library, and it seems to
work well. So our bot could automatically delete all new pages if it's
judged Indonesian beyond 99%+ confidence level, and it could auto-reply to
that list saying it deleted the page.

The accumulated archive will serve as a nice record of action to analyze

Is something like this useful?

