reminders are good.
reminder:
- understand the requirements of the task before you get started executing.
- really understand the root of the issue before you start coming up with solutions.
learn that __m128i _mm_hadd_epi32 (__m128i a, __m128i b)
is an SSE3 instruction, not SSE2. that is sad.