summaryrefslogtreecommitdiff
path: root/utf8proc_data.c
Commit message (Collapse)AuthorAgeFilesLines
* update to Unifont 9 (for Unicode 9 charwidths) (#75)Steven G. Johnson2016-07-121-505/+506
|
* Smaller tables (#68)Benito van der Zander2016-07-121-11669/+8966
| | | | | | | | | | | | | | | | | | | | | | | | * convert sequences to utf-16 (saves 25kb) * store sequence length in properties instead using -1 termination (saves 10kb) * cache index for slightly faster data creation * store lower/upper/title mapping in sequence array (saves 25kb). Add utf8proc_totitle, as title_mapping cannot be used to get the title codepoint anymore. Rename xxx_mapping to xxx_seqindex, so programs assuming a value with the old meaning fail at compile time * change combination array data type to uint16 (saves 40kb) * merge 1st and 2nd comb index (saves 50kb) * kill empty prefix/suffix in combination array (saves 50kb) * there was no need to have a separate combination start array, it can be merged in a single array * some fixes * mark the table as const again * and regen
* Unicode 9 updates (#70)Keno Fischer2016-06-281-11102/+11438
| | | | | | | | | | | | | | | | | | | | | | | * Updates for Unicode 9.0.0 TR29 Changes - New rules GB10/(12/13) are used to combine emoji-zwj sequences/ (force grapheme breaks every two RI codepoints). Unfortunately this breaks statelessness of grapheme-boundary determination. Deal with this by ignoring the problem in utf8proc_grapheme_break, and by hacking in a special case in decompose - ZWJ moved to its own boundclass, update what is now GB9 accordingly. - Add comments to indicate which rule a given case implements - The Number of bound classes Now exceeds 4 bits, expand to 8 and reorganize fields * Import Unicode 9 data * Update Grapheme break API to expose state override * Bump MAJOR version
* Reduce the size of the binary.Michaël Meyer2015-12-091-6668/+6668
| | | | | Use integers instead of pointers in Unicode tables. Saves 226 kb / 716 kb in the compiled library.
* Update Unicode dataPeter Colberg2015-10-291-2635/+2629
| | | | Fixes Travis builds on Ubuntu 12.04 LTS with Ruby 1.9.3-p551.
* Update Unicode dataJiahao Chen2015-06-291-2728/+2766
|
* Updated Unicode 8 data - now sorted internally by data generatorJiahao Chen (陈家豪)2015-06-261-2770/+2761
|
* Update Unicode dataJiahao Chen2015-06-261-893/+896
|
* fix #46 (make sure symbol-like codepoints have nonzero width even if they ↵Steven G. Johnson2015-06-241-1803/+1715
| | | | aren't in Unifont)
* Updated data file to Unicode 8.0.0Jiahao Chen2015-06-231-7430/+7972
|
* Prefix other C99 typedefs with utf8proc_Tony Kelman2015-04-061-4/+4
|
* fix #2: add charwidth functionSteven G. Johnson2015-03-121-9939/+10242
|
* update graphemes for Unicode 7, add utf8proc_grapheme_break functionSteven G. Johnson2014-12-121-10315/+10549
|
* Update utf8proc_data.c (generated by data_generator.rb)Jiahao Chen2014-07-181-8830/+11182
|
* import of utf8proc-v1.1.6v1.1.6Steven G. Johnson2014-07-151-0/+13383