[indic] Disallow vowel mark combinations that spoof other vowel marks

System Internals / HarfBuzz - Behdad Esfahbod [behdad.org] - 3 October 2018 13:53 EDT

Fixes https://github.com/harfbuzz/harfbuzz/issues/1019

New numbers:

BENGALI: 353725 out of 354188 tests passed. 463 failed (0.130722%) DEVANAGARI: 707261 out of 707394 tests passed. 133 failed (0.0188014%) GUJARATI: 366353 out of 366457 tests passed. 104 failed (0.0283799%) GURMUKHI: 60729 out of 60747 tests passed. 18 failed (0.0296311%) KANNADA: 951300 out of 951913 tests passed. 613 failed (0.0643966%) MALAYALAM: 1048136 out of 1048334 tests passed. 198 failed (0.0188871%) ORIYA: 42327 out of 42329 tests passed. 2 failed (0.00472489%) SINHALA: 271596 out of 271847 tests passed. 251 failed (0.0923313%) TAMIL: 1091754 out of 1091754 tests passed. 0 failed (0%) TELUGU: 970555 out of 970573 tests passed. 18 failed (0.00185457%)

Devanagari regressed because Uniscribe doesn't enforce the full set.

Tests added with the *-vowel-letters.txt files in tree and Noto fonts.

df32eaae [indic] Disallow vowel mark combinations that spoof other vowel marks
src/hb-ot-shape-complex-indic.cc | 256 ++++++++++++++++++++-
test/shaping/data/in-house/Makefile.sources | 1 +
.../03e3f463c3a985bc42096620cc415342818454fb.ttf | Bin 0 -> 2904 bytes
.../1a5face3fcbd929d228235c2f72bbd6f8eb37424.ttf | Bin 0 -> 8188 bytes
.../2c25beb56d9c556622d56b0b5d02b4670c034f89.ttf | Bin 0 -> 2460 bytes
.../604026ae5aaca83c49cd8416909d71ba3e1c1194.ttf | Bin 0 -> 4120 bytes
.../738d9f3b8c2dfd03875bf35a61d28fd78faf17c8.ttf | Bin 0 -> 2336 bytes
.../7d18685e1529e4ceaad5b6095dfab2f9789e5bce.ttf | Bin 0 -> 3452 bytes
.../881642af1667ae30a54e58de8be904566d00508f.ttf | Bin 0 -> 2760 bytes
.../af85624080af5627fb050f570d148a62f04fda74.ttf | Bin 0 -> 2656 bytes
.../tests/indic-vowel-letter-spoofing.tests | 53 +++++
11 files changed, 309 insertions(+), 1 deletion(-)

Upstream: cgit.freedesktop.org


  • Share