On Tue, Jan 15, 2013 at 1:06 PM, Dimi Paun wrote:
Thanks everyone for your help!
I'll take down the Pages spreadsheet.
Now, what about the users? Those are files (not directories) so we don't face the same low limit (32k), but it would be nice if we could, somehow, cleanup those files as well.
If I'm remembering right, a full install of Moinmoin (not just running the service portably in the unpacked tree) puts a moin command into /usr/bin. The documentation for it isn't great yet, but you can find it at http://master.moinmo.in/HelpOnMoinCommand.
Unfortunately, it doesn't have a mechanism for cleaning out users beyond obvious duplicate accounts. One possibility that I was looking at is that v1.5 of Moinmoin updates ".trail" files for all logged-in users, even if the page trail display has been disabled.
The idea was to scan the user directory for all .trail files with a mod-time older than a certain time period (I picked 1 year). If a user has logged in to do anything more recently than then, it should show up in the mod-time of the .trail file.
I wanted to test my scripts a little more, but this was one thing my sweep-once script at https://bitbucket.org/kauble/moin-admin was designed to do. Besides blanking-out and putting "file.new" instead of "file.tmp" in line 96 of split-logs.pl, the logic seemed sound on small test batches. I wanted to try it on a full copy of the Wine Wiki just to be safe though.
On Tue, Jan 15, 2013 at 4:40 AM, André Hentschel wrote:
This should also speed up that old wiki and maybe helps upgrading it (hopefully that'll happen soon :D).
I haven't touched a line of code in a couple of months (had a holiday job that really knocked the wind from my sails at times), but after getting settled into my classes over the next few days, I plan on working on moving the wiki to v1.9 of Moinmoin again.
The one thing that would probably help a lot is if there was a regularly updated tarball of the wiki content either at WineHQ or Lattica's FTP again. I haven't messed with cron itself much, but my archive.cron script should pack up the files correctly. The main complication is that the user dir probably should be shared on a need-to-know basis because it contains weakly-hashed password info.
Kyle