I am closing the issue for now as it is too intrusive to alter the allocators for one module.
One future possiblity is to add a xx_malloc16 xx_realloc16 to explicitly request for 16-byte alignment for each allocator.
* For `tls_wolfssl` I will think of other strategies (bundling perhaps like OPENSIPS) * wrappers for `ser_*` functions that will internally align to 16: for realloc this costs one `memmove` if the returned array has different alignment from the alternative array