This repository has been archived by the owner on Apr 2, 2021. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 87
In depth SIMD level
Jordan Peck edited this page Mar 14, 2017
·
11 revisions
##CPU instruction support SSE2
- Intel Pentium 4 - 2001
- AMD Opteron/Athlon - 2003
SEE4.1
- Intel Penryn - 2007
- AMD Bulldozer - Q4 2011
AVX
- Intel Sandy Bridge - Q1 2011
- AMD Bulldozer - Q4 2011
AVX2
- Intel Haswell - Q2 2013
- AMD Carrizo - Q2 2015
FMA3
- Intel Haswell - Q2 2013
- AMD Piledriver - 2012
AVX512
- Not available to consumers yet
##Performance Comparisons
Timings below are x1000 ns to generate 32x32x32 points of noise.
- CPU: Intel Xeon Skylake @ 2.0Ghz
- Compiler: Intel 17.0 x64
Noise Type | AVX512 | AVX2 | SSE4.1 | SSE2 |
---|---|---|---|---|
White Noise | 9 | 13 | 23 | 69 |
Value | 110 | 147 | 327 | 760 |
Value Fractal | 365 | 474 | 1108 | 2595 |
Perlin | 167 | 334 | 624 | 1021 |
Perlin Fractal | 534 | 1045 | 2015 | 3174 |
Simplex | 173 | 338 | 592 | 732 |
Simplex Fractal | 513 | 1121 | 1793 | 2242 |
Cellular | 1055 | 1406 | 3299 | 4024 |
Cubic | 1001 | 1349 | 2586 | 5672 |
Cubic Fractal | 2880 | 4043 | 7778 | 17035 |
CF Sampled 1 | 695 | 720 | 1308 | 2669 |
CF Sampled 2 | 181 | 172 | 298 | 502 |
CF Sampled 3 | 119 | 101 | 135 | 170 |
CF (Cubic Fractal) Sampled {sampling level}: Comparison of performance increases with different levels of sampling