This is supported on x86-64 and target feature
Horizontal subtraction of adjacent pairs in the two packed vectors
of 8 32-bit floating points
In the result, sums of elements from
a are returned in locations of
indices 0, 1, 4, 5; while sums of elements from
b are locations
2, 3, 6, 7.