pub fn _mm256_hadd_ps(a: __m256, b: __m256) -> __m256Available on (x86 or x86-64) and target feature 
avx and x86-64 only.Expand description
Horizontal addition of adjacent pairs in the two packed vectors
of 8 32-bit floating points a and b.
In the result, sums of elements from a are returned in locations of
indices 0, 1, 4, 5; while sums of elements from b are locations
2, 3, 6, 7.