Skip to content
This repository has been archived by the owner on Apr 2, 2021. It is now read-only.

In depth SIMD level

Jordan Peck edited this page Mar 14, 2017 · 11 revisions

##CPU instruction support SSE2

  • Intel Pentium 4 - 2001
  • AMD Opteron/Athlon - 2003

SEE4.1

  • Intel Penryn - 2007
  • AMD Bulldozer - Q4 2011

AVX

  • Intel Sandy Bridge - Q1 2011
  • AMD Bulldozer - Q4 2011

AVX2

  • Intel Haswell - Q2 2013
  • AMD Carrizo - Q2 2015

FMA3

  • Intel Haswell - Q2 2013
  • AMD Piledriver - 2012

AVX512

  • Not available to consumers yet

##Performance Comparisons

Timings below are x1000 ns to generate 32x32x32 points of noise.

  • CPU: Intel Xeon Skylake @ 2.0Ghz
  • Compiler: Intel 17.0 x64
Noise Type AVX512 AVX2 SSE4.1 SSE2
White Noise 9 13 23 69
Value 110 147 327 760
Value Fractal 365 474 1108 2595
Perlin 167 334 624 1021
Perlin Fractal 534 1045 2015 3174
Simplex 173 338 592 732
Simplex Fractal 513 1121 1793 2242
Cellular 1055 1406 3299 4024
Cubic 1001 1349 2586 5672
Cubic Fractal 2880 4043 7778 17035
CF Sampled 1 695 720 1308 2669
CF Sampled 2 181 172 298 502
CF Sampled 3 119 101 135 170

CF (Cubic Fractal) Sampled {sampling level}: Comparison of performance increases with different levels of sampling

Clone this wiki locally