Support for SIMD operations for RGG1555
Compare changes
+ 16
− 0
@@ -2095,6 +2095,18 @@ impl Ppu {
@@ -2132,6 +2144,10 @@ impl Ppu {
Up to 3x the performance for RGG1555 with SIMD operations. The current implementation is hardcoded for 16 bytes at a time (128 bits). Still requires some testing and validation for AVX2 and AVX512.
Fixes #45 (closed)