added SIMD wrappers for ARM NEON, Intel SSE4.1 and AVX2

This commit is contained in:
Ahmet Inan 2018-12-11 09:05:12 +01:00
commit 4f11a22311
5 changed files with 4004 additions and 0 deletions

View file

@ -65,3 +65,10 @@ It computes the following, but having only O(N) complexity and using O(1) extra
output[i] = op(output[i], input[j]);
```
### [simd.hh](simd.hh)
Single instruction, multiple data ([SIMD](https://en.wikipedia.org/wiki/SIMD)) wrappers for:
* [ARM NEON](https://en.wikipedia.org/wiki/ARM_architecture#Advanced_SIMD_(NEON)) ([neon.hh](neon.hh))
* [Intel SSE4.1](https://en.wikipedia.org/wiki/SSE4) ([sse4_1.hh](sse4_1.hh))
* [Intel AVX2](https://en.wikipedia.org/wiki/Advanced_Vector_Extensions) ([avx2.hh](avx2.hh))