Added ESP32-S3 SIMD
This release adds initial support for accelerating the decode by using the ESP32-S3's SIMD instructions. My measurements show a 20-40% speedup depending on the options. I wrote a short blog post about how I figured out how to use these instructions here:
https://bitbanksoftware.blogspot.com/2024/01/surprise-esp32-s3-has-few-simd.html