Support for SIMD operations for RGG1555
Compare changes
Files
3+ 15
− 5
@@ -12,7 +12,7 @@ use std::{
@@ -2091,6 +2091,9 @@ impl Ppu {
@@ -2099,12 +2102,16 @@ impl Ppu {
@@ -2215,8 +2222,11 @@ impl Ppu {
Up to 3x the performance for RGG1555 with SIMD operations. The current implementation is hardcoded for 16 bytes at a time (128 bits). Still requires some testing and validation for AVX2 and AVX512.
Fixes #45 (closed)