# HG changeset patch # User František Kučera # Date 1376671261 -7200 # Node ID 02c5533d388814c35b648126c8256495b398e96f # Parent 835af238d020e7317ae827dac0efcbf3b1118001 data: UTF-8 diff -r 835af238d020 -r 02c5533d3888 data/dictionary.xml --- a/data/dictionary.xml Fri Aug 16 18:31:05 2013 +0200 +++ b/data/dictionary.xml Fri Aug 16 18:41:01 2013 +0200 @@ -377,7 +377,16 @@ - + + + an character encoding that can represent every character in the Unicode character set; + is multibyte and variable-width encoding which means that one character can be represented by one or more bytes + (i.e. character length of the text does not have to be equal to byte lenght of the data), + in case of UTF-8 it is from 1 byte up to 6 bytes per character; + is backward compatible with ASCII – one-byte characters in UTF-8 are encoded in same way as in ACSII; + nowadays is widely used: on web, in e-mail, in operating systems (file names), in XML files etc. + +