The SDK contains an implementation of all algorithms in software, but you can optionally output hardware-optimized code. For example, on Cortex-M microcontrollers we leverage CMSIS-DSP to optimize certain vector operations. These optimizations are selected at compile time in config.hpp
, and mostly live in numpy.hpp
. If you want to add optimizations for a new target this would be a good place to start. We welcome contributions!