[Evolution] Converting from Kmail
Dan Stromberg
strombrg@dcs.nac.uci.edu
Mon, 01 Nov 2004 15:50:13 -0800
--=-fu7WVh1ek7sG6RjgHBda
Content-Type: text/plain
Content-Transfer-Encoding: quoted-printable
On Mon, 2004-11-01 at 15:14, Jon Biddell wrote:
> >
> > If I recall correctly, this was considered and not implemented-- it's
> > not clear that nearly-identical messages can be identified properly
> > without a lot of processing.
>=20
> Interesting - I wonder how the kmail guys do it - it seems to work pretty=
=20
> well, with only the rarest of false deletions - I had a mailbox with 16k=20
> messages (deliberately created to test it) which was a double-import of=20
> another mailbox - I *knew* there would be exactly 8192 duplicates, and km=
ail=20
> shot through the file in less than 30 seconds.
If you want to be quick, you can just delete all but one copy of
anything with the same Message-id: header.
If you want to be more thorough, you could additionally generate sha-1
or md5 hashes of all messages as they come in, perhaps inserting them
into a heap.
--=20
--=-fu7WVh1ek7sG6RjgHBda
Content-Type: application/pgp-signature; name=signature.asc
Content-Description: This is a digitally signed message part
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.4 (GNU/Linux)
iD8DBQBBhsu0o0feVm00f/8RAmKPAKCQrzCI1QHCE3fGOe7JrLgsY5HWyQCdFMSv
z93vxilxfqghr2/IqYCpS0Q=
=XUpQ
-----END PGP SIGNATURE-----
--=-fu7WVh1ek7sG6RjgHBda--