Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Subgroup Level Bluestein Algorithm #145

Open
wants to merge 23 commits into
base: main
Choose a base branch
from
Open
Changes from 1 commit
Commits
Show all changes
23 commits
Select commit Hold shift + click to select a range
cef2ad0
subgroup level batch_interleaved and packed bluestein algorithm working
AD2605 Mar 8, 2024
8cc05db
resolve warnings and add backward and multi-dim subgroup level small …
AD2605 Mar 8, 2024
5237a37
further changes
AD2605 Mar 9, 2024
5292f41
modifier offset calculation bugfix
AD2605 Mar 11, 2024
a3aa5d6
initialize local memory with zeros to avoid nans
AD2605 Mar 12, 2024
5b89d10
format
AD2605 Mar 12, 2024
b0d1661
not copy in between an aligned pointer
AD2605 Mar 12, 2024
5afe834
prevent OOB read/writes in packed format
AD2605 Mar 12, 2024
8a83350
prevent OOB read writes in PACKED bluestein condition
AD2605 Mar 13, 2024
eb3291f
Merge remote-tracking branch 'origin/main' into atharva/sg_wg_bluestein
AD2605 Mar 13, 2024
011e780
refactor
AD2605 Mar 13, 2024
8c3b40b
bugfix after refactor
AD2605 Mar 13, 2024
20e3c78
doxygens and logging, and lower tolerance value
AD2605 Mar 14, 2024
ae929d3
remove unused shuffle_transpose function
AD2605 Mar 14, 2024
708893c
remove unused bluestein header from workgroup_dispatcher.hpp
AD2605 Mar 14, 2024
594d224
remove unused headers from subgroup.hpp
AD2605 Mar 14, 2024
5f1ab4d
add missing array header
AD2605 Mar 14, 2024
99d8cfb
slightly bump tolerance value for tests to pass on Nvidia
AD2605 Mar 14, 2024
243c793
format
AD2605 Mar 14, 2024
3a77953
review comments 1
AD2605 Mar 18, 2024
cfd2ab8
updated doxygens
AD2605 Mar 18, 2024
d0e705d
addressed missed comments from first round of review
AD2605 Mar 18, 2024
823b84f
prevent OOB read from global memory
AD2605 Mar 19, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
remove unused bluestein header from workgroup_dispatcher.hpp
AD2605 committed Mar 14, 2024

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
commit 708893cdce75d288a5cbb8f8e66210c85f9c051f
1 change: 0 additions & 1 deletion src/portfft/dispatcher/workgroup_dispatcher.hpp
Original file line number Diff line number Diff line change
@@ -21,7 +21,6 @@
#ifndef PORTFFT_DISPATCHER_WORKGROUP_DISPATCHER_HPP
#define PORTFFT_DISPATCHER_WORKGROUP_DISPATCHER_HPP

#include "portfft/common/bluestein.hpp"
#include "portfft/common/helpers.hpp"
#include "portfft/common/logging.hpp"
#include "portfft/common/memory_views.hpp"