summaryrefslogtreecommitdiff
Commit message (Collapse)AuthorAgeFilesLines
* Drop -s option. There's no point implementing it.John Mark Bell2008-11-191-1/+1
| | | | svn path=/trunk/iconv/; revision=5749
* Make handling of EILSEQ resynchronise stream if we've been asked to ignore ↵John Mark Bell2008-11-191-2/+18
| | | | | | | | errors. Some kind of handling for failure to read from input file. svn path=/trunk/iconv/; revision=5748
* Shuffle data through a fixed-size bufferJohn Mark Bell2008-11-191-23/+59
| | | | svn path=/trunk/iconv/; revision=5747
* Dump the list of known encoding aliases when asked. Don't expect any kind of ↵John Mark Bell2008-11-194-8/+9
| | | | | | sort order -- that would require a level of thought I don't have right now. svn path=/trunk/iconv/; revision=5746
* Use a zip binary that has half a chance of preserving filetypes.John Mark Bell2008-11-191-2/+2
| | | | svn path=/trunk/iconv/; revision=5745
* On RISC OS, use Unicode:, rather than attempting to getenv("Unicode$Path") ↵John Mark Bell2008-11-194-8/+8
| | | | | | and concatenating the leafname on the end. svn path=/trunk/iconv/; revision=5744
* Fix riscos-dist target -- failed to ensure Aliases file ended in release tree.John Mark Bell2008-11-191-0/+2
| | | | svn path=/trunk/iconv/; revision=5743
* Release announcement template.John Mark Bell2008-11-191-0/+40
| | | | svn path=/trunk/iconv/; revision=5742
* Remove obsolete makefileJohn Mark Bell2008-11-191-80/+0
| | | | svn path=/trunk/iconv/; revision=5741
* More detailed changelogJohn Mark Bell2008-11-191-1/+18
| | | | svn path=/trunk/iconv/; revision=5740
* Update patchJohn Mark Bell2008-11-191-25/+40
| | | | svn path=/trunk/iconv/; revision=5739
* Stage Aliases file directly into distribution template.John Mark Bell2008-11-197-307/+11
| | | | | | | Fix compilation of makealiases when cross-compiling. Update dependencies so that aliases file gets built when needed. svn path=/trunk/iconv/; revision=5738
* Resurrect the sources of the Aliases file generator.John Mark Bell2008-11-195-0/+2230
| | | | svn path=/trunk/iconv/; revision=5737
* Ugh. Jump through hoops for RiscPkg. I'm not sure why I'm bothering with this.John Mark Bell2008-11-192-2/+20
| | | | svn path=/trunk/iconv/; revision=5736
* Create zip fileJohn Mark Bell2008-11-192-2/+4
| | | | svn path=/trunk/iconv/; revision=5735
* Move the declarations of iconv_initialise/iconv_finalise to a different header.John Mark Bell2008-11-1915-30/+79
| | | | | | | | | | | | | | | This keeps the public iconv.h free of such nonsense. Move the source for the RISC OS stubs to the distribution template tree. We will no longer shipped compiled stubs. People are quite capable of compiling this themselves. Also take the opportunity to tidy it up a bit. Bump the version number to 0.09 Introduce a "riscos-dist" target in Makefile-riscos. Update various bits of documentation. svn path=/trunk/iconv/; revision=5734
* Fix up output buffer length on memory exhaustionJohn Mark Bell2008-11-191-1/+5
| | | | svn path=/trunk/iconv/; revision=5733
* Fix strncasecmp implementation.John Mark Bell2008-11-191-1/+1
| | | | svn path=/trunk/iconv/; revision=5732
* Patch for UnicodeLib to make all enabled tests pass.John Mark Bell2008-11-192-0/+531
| | | | | | Document the need for this patch, and how to apply it. svn path=/trunk/iconv/; revision=5731
* Tools tree. Currently comprises something approximating a parser for the ↵John Mark Bell2008-11-191-0/+127
| | | | | | Unihan database. svn path=/trunk/iconv/; revision=5730
* Restore EUC-KR testdata to original contents, now we've added the ↵John Mark Bell2008-11-191-0/+3
| | | | | | appropriate mappings to KS X 1001. svn path=/trunk/iconv/; revision=5729
* Add three new mappings from CP949:John Mark Bell2008-11-191-0/+0
| | | | | | | | | + 0xA2E6 -> U+20AC + 0xA2E7 -> U+00AE + 0xA2E8 -> U+327E svn path=/trunk/iconv/; revision=5728
* Modify Big5 testdata to match expectations.John Mark Bell2008-11-192-0/+11
| | | | | | UnicodeLib's implementation of Big5 contains mapping table entries for 0xf9d6-0xf9dc, inherited from ETENS/CP950. It also implements a couple of Mac extensions, too. svn path=/trunk/iconv/; revision=5727
* Poke around with the ShiftJIS testdata to make the test pass.John Mark Bell2008-11-192-1880/+1247
| | | | | | | | This probably isn't the best approach, but there we go. UnicodeLib's ShiftJIS implementation incorporates the CP932 extensions, along with a number of Mac extensions. Thus, compare SHIFT_JIS.IRREVERSIBLE.TXT with CP932.IRREVERSIBLE.TXT to give an indication of how complete/bug-free the CP932 support is. svn path=/trunk/iconv/; revision=5726
* Update mapping tables for JIS X 0208 and JIS X 0212.John Mark Bell2008-11-192-0/+0
| | | | | | | | | | | | | | | Changes: JIS X 0208: + The mapping for entry 1/35 has been changed from U+005C to U+FF3C. JIS X 0212: + The mapping for entry 2/23 has been changed from U+007E to U+FF5E. These mappings better represent the codepoints in question. They also remove irreversible mappings for U+005C and U+007E, thus making round-trip conversions more robust. svn path=/trunk/iconv/; revision=5725
* Revert unintended commit of new mapping tablesJohn Mark Bell2008-11-192-0/+0
| | | | svn path=/trunk/iconv/; revision=5724
* Ignore expected failure of EUC-JP test -- there's no way to load two ↵John Mark Bell2008-11-193-1/+1
| | | | | | charsets into G0 at once, which is what the test is expecting. For future reference, should we want to fix this, 0x5C should map to U+005C and U+00A5, 0x7E should map to U+007E and U+203E. U+005C/U+007E come from US-ASCII, which is the default mapping. The other pair come from JIS X 0201 (Roman). svn path=/trunk/iconv/; revision=5723
* Drop mappings to Unicode private use area. These aren't likely to be of any ↵John Mark Bell2008-11-191-1880/+0
| | | | | | use when performing conversion. I can't see where this is specified, either. svn path=/trunk/iconv/; revision=5722
* Disable the GBK test -- at present, we support only GBK/{1,2}, as that's all ↵John Mark Bell2008-11-181-1/+1
| | | | | | | | that can be achieved through the use of an ISO-2022-based codec. To get full GBK support will most likely require a new codec, preferably supporting the whole of GB18030 (which is a superset of GBK). svn path=/trunk/iconv/; revision=5721
* Add missing characters from GB2312-80.John Mark Bell2008-11-181-0/+6
| | | | svn path=/trunk/iconv/; revision=5720
* Make test for Acorn Latin1 work.John Mark Bell2008-11-182-9/+2
| | | | svn path=/trunk/iconv/; revision=5719
* Lose 3 entries that aren't mapped except in CP949, which this isn't.John Mark Bell2008-11-181-3/+0
| | | | svn path=/trunk/iconv/; revision=5718
* Change previous logic -- we want to ignore U+FFFF regardless of the translit ↵John Mark Bell2008-11-181-1/+1
| | | | | | state. svn path=/trunk/iconv/; revision=5717
* Treat U+FFFF as invalid. Otherwise, we end up writing the first mapped out ↵John Mark Bell2008-11-181-1/+1
| | | | | | character encountered in the target encoding. svn path=/trunk/iconv/; revision=5716
* Don't bail when U+FEFF results in EINVAL.John Mark Bell2008-11-181-1/+1
| | | | | | UnicodeLib eats this at the start of input, assuming it's a BOM. There's nothing we can do to avoid this, so work around it here. In practice, this shouldn't be a problem -- noone's going to sanely want to convert a string containing a BOM and nothing else. svn path=/trunk/iconv/; revision=5715
* ECMA-35. This is identical to ISO-2022.John Mark Bell2008-11-181-0/+0
| | | | svn path=/trunk/iconv/; revision=5714
* Bring ISO-2022-KR test data in line with what UnicodeLib produces. This is ↵John Mark Bell2008-11-181-2/+2
| | | | | | semantically equivalent to what was there before, it's just that UnicodeLib outputs the G1 designation at the start rather that immediately before the first character that needs KS X 1001. svn path=/trunk/iconv/; revision=5713
* Rather less hideous approach to error detection and input pointer maintenence.John Mark Bell2008-11-185-159/+63
| | | | | | | | | We now simply decode one character at a time and check for error afterwards. This has the benefit of being less code, clearer, less likely to crash if encoding state changes involve memory (de)allocation, and removes the reliance on UnicodeLib internals. It's probably slower, however, but correctness is more important here. Fix ISO-2022-JP-2 test data to not include characters from the JIS X 0201-1976 Kana set -- this set is not used in ISO-2022-JP-2. Implement verbose flag in *Iconv. svn path=/trunk/iconv/; revision=5712
* Comment out the rest of the tests for encodings we don't support.John Mark Bell2008-11-161-50/+50
| | | | | | | It appears that there's a fair number of issues with the handling of CJK charsets, particularly in the case of ISO-2022-x, which segfault. Make test binaries depend on the module target, so the module gets built if make test is done on a clean tree. svn path=/trunk/iconv/; revision=5710
* Comment out more tests for charsets we don't supportJohn Mark Bell2008-11-161-35/+35
| | | | svn path=/trunk/iconv/; revision=5709
* Bring MacRoman test data into line with current mappings -- 0xDB should map ↵John Mark Bell2008-11-161-1/+1
| | | | | | to U+20AC (euro) and not U+00A4 (currency sign) svn path=/trunk/iconv/; revision=5708
* Fix MacRoman mapping table -- 0xF3 should map to U+00DB and not U+00D8John Mark Bell2008-11-161-0/+0
| | | | svn path=/trunk/iconv/; revision=5707
* Fix error in CP1256 table -- 0xC0 should be mapped to U+06C1, not U+061CJohn Mark Bell2008-11-161-0/+0
| | | | svn path=/trunk/iconv/; revision=5706
* Factor out acquisition of paths to files in the Unicode resource.John Mark Bell2008-11-162-72/+139
| | | | | | | This fixes *ReadAliases on <> RISC OS, and Iconv's eightbit codec. Fix iconv_eightbit_read to ensure that it treats the input as unsigned bytes. svn path=/trunk/iconv/; revision=5705
* Add set 06/06 (Latin 10) from ROOL tree.John Mark Bell2008-11-161-0/+0
| | | | svn path=/trunk/iconv/; revision=5704
* Bring set 05/04 (Thai) into line with 8859-11 spec:John Mark Bell2008-11-161-0/+0
| | | | | | | + 0xdb,0xdc,0xdd,0xde are undefined + 0xfc,0xfd,0xfe,0xff are undefined svn path=/trunk/iconv/; revision=5703
* Bring set 05/14 (Hebrew) into line with 8859-8 spec:John Mark Bell2008-11-161-0/+0
| | | | | | | + 0xaf mapping is now to U+00AF and not U+203E + 0xfd/0xfe should be mapped to U+200E/U+200F and not U+020E/U+020F, respectively svn path=/trunk/iconv/; revision=5702
* Update set 04/06 (Greek) to bring it in line with the spec. It was ↵John Mark Bell2008-11-161-0/+0
| | | | | | previously based on the ISO-8859-7 FCD, after which 0xAE was removed, and the mapping of 0xB7 was changed. svn path=/trunk/iconv/; revision=5701
* We don't support ISO-8859-6John Mark Bell2008-11-161-1/+1
| | | | svn path=/trunk/iconv/; revision=5700
* Ensure that we return the correct errors and, when we do, point to the ↵John Mark Bell2008-11-163-39/+180
| | | | | | | | | | correct place in the input sequence (namely the start of the erroneous sequence). Unfortunately, UnicodeLib reads past the erroneous sequence so we previously returned a pointer to the middle/end of the sequence rather than the start. The only way I could think of doing this was to perform the conversion twice -- counting the number of successfully processed characters first, then to convert that number of characters again. We then play spot-the-difference with the output parameters to determine the correct return value. As encodings provided by UnicodeLib may be stateful, we need to be able to save the current state of the codecs and then restore them when we attempt the second conversion (otherwise, state-based encodings will do entirely the wrong thing). There's no API in UnicodeLib to do this and, given it's such an unlikely use case, I've not added any. Instead, we get to poke around in the UnicodeLib internals and do the save/restore ourselves. svn path=/trunk/iconv/; revision=5699