`af_glyph_hints_reload` called `FT_Outline_Get_Orientation` once per glyph solely to choose the major-direction sign.
Version 0.
Since the NSS reorganization in glibc 2.33 the --enable-static-nss option no longer changes the build: the files and dns services are built into libc unconditionally, and there is no longer any mechanism to bake the remaining services into libc.a.
Restore the POWER10 memchr implementation reverted in commit a7877bb6685300f159fa095c9f50b22b112cddb8 [BZ #33059], addressing the actual cause: The non-volatile vector register v20 used as the zero vector (M_VREG_ZERO) is replaced with the volatile register v17.
This has two immediate consequences: 1.
These can be implemented efficiently on most modern 64-bit compilers.
This is needed by the ops code, to represent intermediate values for 32-bit formats, which can exceed the value range of int32_t (especially for intermediate products). I copied the math almost 1:1 from rational.c, but adapted to use the 128-bit integer wrappers defined by int128.h.
NVDEC and CUVID now output AV_PIX_FMT_P012 (12-bit 4:2:0), AV_PIX_FMT_P212 (12-bit 4:2:2) and AV_PIX_FMT_YUV444P10MSB / AV_PIX_FMT_YUV444P12MSB (10/12-bit 4:4:4) for high-bit-depth content, but these CUDA filters rejected the formats in their supported-format lists, breaking pipelines such as "-hwaccel cuda ...
NVDEC and CUVID decode 12-bit 4:2:0 content to AV_PIX_FMT_P012 and 12-bit 4:2:2 to AV_PIX_FMT_P212, but these formats were missing from the CUDA frames context supported format list.
This can be useful for any sort of separable filtering with arbitrary weights.
This may be faster or slower than the existing specialized kernels, so I opted not to prefer it by default.
This avoids an underflow if short_seek is negative (which can happen if e.g.
Adds lists for math.
These represent components which have not (yet) been modified from their input values (i.e.
Not needed currently but will be used for parallel splits.
This already helps performance as-is, but will help performance massively once we add the ability for the memcpy backend to do a refcopy instead of an actual copy.
This affects a large number of conversions across the board, either: 1.