diff options
author | Luke Shumaker <lukeshu@lukeshu.com> | 2004-11-22 04:15:50 -0500 |
---|---|---|
committer | Luke Shumaker <lukeshu@lukeshu.com> | 2004-11-22 04:15:50 -0500 |
commit | e49ab49fe9202b6f875cf9cba6bad2f320e24fe4 (patch) | |
tree | 0aa2ee8c1c8b6188b6725a7ec1e4139d46fbfbdd /readme.txt | |
parent | 040c6f479435a2b4f2a7cb9ef4bd65fca3ec2fcc (diff) | |
parent | 7d347a05ce025a9aef28bcf72089e1388dd48d13 (diff) |
http://web.archive.org/web/20041122041550/http:/www.unicode.org:80/Public/PROGRAMS/CVTUTF/HEADPROGRAMS/CVTUTF
Diffstat (limited to 'readme.txt')
-rw-r--r-- | readme.txt | 43 |
1 files changed, 43 insertions, 0 deletions
diff --git a/readme.txt b/readme.txt new file mode 100644 index 0000000..b9f17fb --- /dev/null +++ b/readme.txt @@ -0,0 +1,43 @@ + +The accompanying C source code file "ConvertUTF.c" and the associated header +file "ConvertUTF.h" provide for conversion between various transformation +formats of Unicode characters. The following conversions are supported: + + UTF-32 to UTF-16 + UTF-32 to UTF-8 + UTF-16 to UTF-32 + UTF-16 to UTF-8 + UTF-8 to UTF-16 + UTF-8 to UTF-32 + +In addition, there is a test harness which runs various tests. + +The files "CVTUTF7.C" and "CVTUTF7.H" are for archival and historical purposes +only. They have not been updated to Unicode 3.0 or later and should be +considered obsolescent. "CVTUTF7.C" contains two functions that can convert +between UCS2 (i.e., the BMP characters only) and UTF-7. Surrogates are +not supported, the code has not been tested, and should be considered +unsuitable for general purpose use. + +Please submit any bug reports about these programs here: + + http://www.unicode.org/unicode/reporting.html + +Version 1.0: initial version. + +Version 1.1: corrected some minor problems; added stricter checks. + +Version 1.2: corrected switch statements associated with "extraBytesToRead" + in 4 & 5 byte cases, in functions for conversion from UTF8. + Note: formally, the 4 & 5 byte cases are illegal in the latest + UTF8, but the table and this code has always catered for those, + cases since at one time they were legal. + +Version 1.3: Updated UTF-8 legality check; + updated to use UNI_MAX_LEGAL_UTF32 in UTF-32 conversions + Updated UTF-8 legality tests in harness.c + + +Last update: October 19, 2004 + + |