[Imap-uw] Mix filesize
Mark Crispin
mrc at CAC.Washington.EDU
Thu Sep 13 18:41:17 PDT 2007
On Fri, 14 Sep 2007, Per Foreby wrote:
> I have changed MIXDATAROLL in src/osdep/unix/mix.c, and was a bit surprised
> when I used mixcvt and got the default filesize.
> If this definition was moved to a common include file, say c-client/mix.h,
> all mix applications could share the same setting.
mixcvt intentionally does not use any c-client information for mix; the
standalone mix tools follow their own rules and not c-client rules. Note
that mixdfix and mixrbld don't even link with c-client at all, and the
only reason for mixcvt to link with c-client is to read non-mix mailboxes.
If you think about tweaking things, you'd see why you'd want mixcvt to
have its own independent idea. But perhaps it should be a command line
option.
> Has anyone made a serious attempt to see how large the files can be before
> they start to affect backup performance, or is everybody guessing?
For the most part, everybody is guessing, but there seems to be some
concensus that a "good" value is between 1MB and 10MB for an "old" data
file. That is likely to change over time as the average message size
grows. The idea is to have an approximately two order of magnitude
reduction of number of files over maildir, news, mh, Cyrus, etc.; while
keeping the occasional expunge in an "old" file to something manageable.
I think though that we're all groping in the dark. I agree that we
probably don't want it to be less than 1MB, so any tweaks to the number
will be to make it larger. However, 500MB is probably way too large.
I'll be interested to hear how Nancy Lin does with 20MB. I'd think that
"most of the .mix files get touched every day" would indicate that the
size is too large, and needs to be scaled back.
But the only way is to play with lots of values in a real user and
production setting, and collect enough data that you can graph the various
values. Sheesh, this is almost like multivariate calculus (a class I
still have nightmares about over 30 years later...).
-- Mark --
http://panda.com/mrc
Democracy is two wolves and a sheep deciding what to eat for lunch.
Liberty is a well-armed sheep contesting the vote.
More information about the Imap-uw
mailing list