Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enable inlining for FFT #40

Open
sparker-arm opened this issue Oct 21, 2020 · 0 comments
Open

Enable inlining for FFT #40

sparker-arm opened this issue Oct 21, 2020 · 0 comments

Comments

@sparker-arm
Copy link

The C implementation could be improved considerably if the compiler was able to inline the complex_* functions into the loops. Could the implementations in common/complex_simple.c be moved to fft.c? The inlining, as well as removing the calls, enables vectorization too (at least by clang) and I see a 17% speedup on my Ryzen box. I'd like to make use of this suite for benchmarking WASM, with particular interest in autovec SIMD code, so it would be great if the suite was more friendly to traditional compiler optimisations that aid this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant