[Evolution] Converting from Kmail
Dan Stromberg
strombrg@dcs.nac.uci.edu
Mon, 01 Nov 2004 16:44:27 -0800
--=-VYYEEVqGQmlRAR9KliiX
Content-Type: text/plain
Content-Transfer-Encoding: quoted-printable
On Mon, 2004-11-01 at 16:19, Ron Johnson wrote:
> On Mon, 2004-11-01 at 15:50 -0800, Dan Stromberg wrote:
> > On Mon, 2004-11-01 at 15:14, Jon Biddell wrote:
> >=20
> > > >
> > > > If I recall correctly, this was considered and not implemented-- it=
's
> > > > not clear that nearly-identical messages can be identified properly
> > > > without a lot of processing.
> > >=20
> > > Interesting - I wonder how the kmail guys do it - it seems to work pr=
etty=20
> > > well, with only the rarest of false deletions - I had a mailbox with =
16k=20
> > > messages (deliberately created to test it) which was a double-import =
of=20
> > > another mailbox - I *knew* there would be exactly 8192 duplicates, an=
d kmail=20
> > > shot through the file in less than 30 seconds.
> >=20
> > If you want to be quick, you can just delete all but one copy of
> > anything with the same Message-id: header.
> >=20
> > If you want to be more thorough, you could additionally generate sha-1
> > or md5 hashes of all messages as they come in, perhaps inserting them
> > into a heap.
>=20
> That's the exact idea I had, except I was thinking of bash and
> Maildir...
Yeah, that's good too.
BTW, deduping the addressbook would be nice as well. A lot of my
addressbook entries have the same e-mail addresses twice - probably as a
result of a failed experiment in which I tried to migrate from jpilot to
evolution for syncing with my palm.
--=20
--=-VYYEEVqGQmlRAR9KliiX
Content-Type: application/pgp-signature; name=signature.asc
Content-Description: This is a digitally signed message part
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.4 (GNU/Linux)
iD8DBQBBhthro0feVm00f/8RAo61AJwPZsHrjWnUruC/SW/t5n3U1LiU3wCcDiZQ
HzwZI3zhxUoZfBrmy5h8z2k=
=ybxI
-----END PGP SIGNATURE-----
--=-VYYEEVqGQmlRAR9KliiX--