unsafe fn compute_d(h: __m128i) -> __m128i
Compute D from H using the R/F algorithm
D = swap(H) ⊕ (H0 × P1)