Manually unrolling loops increases code size, which can sometimes improve performance, but more often than not, it degrades performance.
This splits out the integer and floating point handling to add support for software, single and quad float ABI's.
Get and set remote addr when dtls server mode.
Implement NEON optimization for compute_weights_line. Also update the function signature to use ptrdiff_t for stack arguments (max_meaningful_diff, startx, endx).
This fixes a design mistake made way back in 2009 (80e384598265) that for reasons now unknown used std::shared_ptr<T> to manage sysex and patch change canvas items.