Skip to content
This repository has been archived by the owner on Apr 2, 2021. It is now read-only.

In depth SIMD level

Jordan Peck edited this page Mar 17, 2017 · 11 revisions

CPU instruction support

SSE2

  • Intel Pentium 4 - 2001
  • AMD Opteron/Athlon - 2003

SEE4.1

  • Intel Penryn - 2007
  • AMD Bulldozer - Q4 2011

AVX

  • Intel Sandy Bridge - Q1 2011
  • AMD Bulldozer - Q4 2011

AVX2

  • Intel Haswell - Q2 2013
  • AMD Carrizo - Q2 2015

FMA3

  • Intel Haswell - Q2 2013
  • AMD Piledriver - 2012

AVX512

  • Not available to consumers yet

Performance Comparisons

Timings below are x1000 ns to generate 32x32x32 points of noise on a single thread.

  • CPU: Intel Xeon Skylake @ 2.0Ghz
  • Compiler: Intel 17.0 x64
Noise Type AVX512 AVX2 SSE4.1 SSE2 FastNoise FastNoise 2D
White Noise 9 13 22 69 141 111
Value 101 160 337 797 635 364
Value Fractal 322 511 1074 2451 2006 1091
Perlin 152 342 605 986 964 476
Perlin Fractal 475 1060 1902 2999 3111 1472
Simplex 157 340 580 732 1189 875
Simplex Fractal 480 1045 1780 2251 2776 2589
Cellular 984 1472 3202 4079 2933 1074
Cubic 899 1393 2652 5957 2933 872
Cubic Fractal 2811 4255 8021 17905 9250 2755
CF Sampled 1 494 730 1321 2805
CF Sampled 2 96 176 282 503
CF Sampled 3 41 96 132 171

CF (Cubic Fractal) Sampled {sampling level}: Comparison of performance increases with different levels of sampling

Clone this wiki locally