Todo list --------- + Charset conversion should use Unicode Normalisation Form C.