Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CQRRP (blocked CQRRPT) #58

Merged
merged 279 commits into from
Nov 17, 2023
Merged
Show file tree
Hide file tree
Changes from 250 commits
Commits
Show all changes
279 commits
Select commit Hold shift + click to select a range
6df27c6
Updated col_swap
TeachRaccooon Aug 7, 2023
cadeec1
Q is now stored in A in a packed orhr_col format. Code is not cleaned up
TeachRaccooon Aug 8, 2023
f38f65b
R is stored in A now
TeachRaccooon Aug 8, 2023
618a63b
Update
TeachRaccooon Aug 8, 2023
bba4b28
Saving space by using a portion of A as workspace
TeachRaccooon Aug 8, 2023
ebc04a6
Update just in case
TeachRaccooon Aug 8, 2023
2a1c697
Refactored & commented the code, everything seems to be in a decent s…
TeachRaccooon Aug 8, 2023
8dad5e3
Timing update
TeachRaccooon Aug 8, 2023
24eaad1
Instability or bug is exposed when larger sizes are used
TeachRaccooon Aug 8, 2023
5b40a54
Bug fix, need to perform get_U when updating A_sk
TeachRaccooon Aug 8, 2023
098d649
Update
TeachRaccooon Aug 9, 2023
5585b63
This exposes the fact that our Q representation is good as is. T has …
TeachRaccooon Aug 9, 2023
0de97ea
Update before output format change.
TeachRaccooon Aug 9, 2023
08b99c9
Changed output format & tests
TeachRaccooon Aug 9, 2023
520cbe8
For a large size and sampling factor != 1, there is a bug. Check if i…
TeachRaccooon Aug 9, 2023
21e6b62
Isolated a small example with abug
TeachRaccooon Aug 9, 2023
ae76051
Bugs fixed, need to be extra cautious about updating A_sk
TeachRaccooon Aug 9, 2023
036ae03
This exposes the fact that we do not need to re-compute the full R fa…
TeachRaccooon Aug 9, 2023
98790b5
Optimizing R factor from Cholesky QR.
TeachRaccooon Aug 9, 2023
5b37d29
Added an implementation of QR speed benchmark for CQRRP
TeachRaccooon Aug 9, 2023
5d76588
Benchmark bug fix
TeachRaccooon Aug 9, 2023
99caa16
Update. Benchmarking infastructure is now ready. Flop conversion sand…
TeachRaccooon Aug 10, 2023
f39dd6c
Update before changing hqrrp
TeachRaccooon Aug 10, 2023
ed9cb61
Altered HQRRP to incorporate Riley's latest changes.
TeachRaccooon Aug 10, 2023
75e24cc
Added testing for HQRRP
TeachRaccooon Aug 10, 2023
fa2049e
Update before adding cholqr into hqrrp
TeachRaccooon Aug 10, 2023
3d51d49
Added functions required for cholqr to hqrrp
TeachRaccooon Aug 10, 2023
bbc4aae
Update before i start debugging HQRRP
TeachRaccooon Aug 11, 2023
0b01f3a
Added Cholesky QR option into HQRRP. No need to have direct wrappers …
TeachRaccooon Aug 11, 2023
ad611e2
Updated benchmark (matlab updated separately)
TeachRaccooon Aug 11, 2023
31e7002
Optimized storage in benchmark
TeachRaccooon Aug 11, 2023
ccc8690
Added a benchmark for comparing geqrf and cholqr directly
TeachRaccooon Aug 11, 2023
903e0d9
Updated benchmarks
TeachRaccooon Aug 11, 2023
1927d3c
Added a benchmark for blocks per time in CQRRPt and HQRRP
TeachRaccooon Aug 11, 2023
e5810d3
Benchmark update
TeachRaccooon Aug 11, 2023
3be75ec
Benchmark update
TeachRaccooon Aug 11, 2023
cb7cd13
Bug fix
TeachRaccooon Aug 11, 2023
046db6d
buf gix
TeachRaccooon Aug 11, 2023
c2bb090
Update
TeachRaccooon Aug 11, 2023
89dc53c
Update
TeachRaccooon Aug 11, 2023
710df6c
Update
TeachRaccooon Aug 11, 2023
91f30e8
Got rid of the memory re-allocation in HQRRP's CHOLQR
TeachRaccooon Aug 11, 2023
5753d6d
UPDATE
TeachRaccooon Aug 11, 2023
08ccc2d
UPDATE
TeachRaccooon Aug 11, 2023
9eb009e
Benchmark update
TeachRaccooon Aug 11, 2023
f7f2b6b
Benchmark update
TeachRaccooon Aug 11, 2023
aa71967
Benchmark update
TeachRaccooon Aug 11, 2023
54246aa
Commented out hqrrp timing
TeachRaccooon Aug 11, 2023
6a6c6d2
Benchmark update
TeachRaccooon Aug 11, 2023
96fb227
Update
TeachRaccooon Aug 12, 2023
6424d48
Update
TeachRaccooon Aug 12, 2023
1965596
Update
TeachRaccooon Aug 12, 2023
79e79a1
Update
TeachRaccooon Aug 12, 2023
96c1310
Update
TeachRaccooon Aug 12, 2023
33cd598
Temporary Update
TeachRaccooon Aug 12, 2023
4546448
Temporary Update
TeachRaccooon Aug 12, 2023
09e8e9d
Temporary Update
TeachRaccooon Aug 12, 2023
6441aee
Temporary Update
TeachRaccooon Aug 12, 2023
ae20f20
Verbose HQRRP timing
TeachRaccooon Aug 12, 2023
cc5efb4
Verbose HQRRP timing
TeachRaccooon Aug 12, 2023
3cf2bb3
Verbose HQRRP timing
TeachRaccooon Aug 12, 2023
144b770
Verbose HQRRP timing
TeachRaccooon Aug 12, 2023
f84e90f
Verbose HQRRP timing
TeachRaccooon Aug 12, 2023
67a7cd0
Verbose HQRRP timing
TeachRaccooon Aug 12, 2023
2ce73c6
Verbose HQRRP timing
TeachRaccooon Aug 12, 2023
a1bf493
Verbose HQRRP timing
TeachRaccooon Aug 12, 2023
ef7f8e2
Update
TeachRaccooon Aug 12, 2023
9b2c9f1
Update
TeachRaccooon Aug 12, 2023
2ee390d
Maksing sure the same seeds are used
TeachRaccooon Aug 12, 2023
0dcd862
Update
TeachRaccooon Aug 12, 2023
9178b18
Update
TeachRaccooon Aug 12, 2023
ae8f8dd
Update
TeachRaccooon Aug 12, 2023
2f3700b
Update
TeachRaccooon Aug 12, 2023
486ea9e
Update
TeachRaccooon Aug 12, 2023
b27bbf8
Update
TeachRaccooon Aug 12, 2023
7583f3d
Evidentally, there is a memory leak in HQRRP? The buffer ldim_W conta…
TeachRaccooon Aug 12, 2023
6d6ee5e
Switched all mallocs to callocs to zero-out the used memory
TeachRaccooon Aug 13, 2023
abe110b
Trying to test things with just HQRRP+cholqr
TeachRaccooon Aug 13, 2023
7a6973c
Return to normal state
TeachRaccooon Aug 13, 2023
e105850
Trying to see if performing Cholesky QR on fake data followed by perf…
TeachRaccooon Aug 14, 2023
d146104
Commented out
TeachRaccooon Aug 14, 2023
997ac56
Trying something
TeachRaccooon Aug 14, 2023
54a0cdf
Trying something
TeachRaccooon Aug 14, 2023
a443cf8
Trying something
TeachRaccooon Aug 14, 2023
907ecd3
Trying something
TeachRaccooon Aug 14, 2023
d7451ff
Trying something
TeachRaccooon Aug 14, 2023
12dd91f
Trying something
TeachRaccooon Aug 14, 2023
26fe685
Trying something
TeachRaccooon Aug 14, 2023
0782a0d
Trying something
TeachRaccooon Aug 14, 2023
d4f154d
Trying something
TeachRaccooon Aug 14, 2023
985bb7f
A input into cholqr is all nans
TeachRaccooon Aug 14, 2023
82e56f2
Update
TeachRaccooon Aug 14, 2023
baf5c6d
Update
TeachRaccooon Aug 14, 2023
6965999
Update
TeachRaccooon Aug 14, 2023
0504990
Update
TeachRaccooon Aug 14, 2023
3a46d75
Update
TeachRaccooon Aug 14, 2023
53ff7f6
Update
TeachRaccooon Aug 14, 2023
b5719df
Update
TeachRaccooon Aug 14, 2023
db9dca0
Issue was related to not using the proper version of randblas. FIxed.
TeachRaccooon Aug 14, 2023
177175c
Testing various block sizes
TeachRaccooon Aug 14, 2023
a393948
Testing various block sizes
TeachRaccooon Aug 14, 2023
fec09b8
Adding HQRRP into CQRRP
TeachRaccooon Aug 15, 2023
93c66f2
Update
TeachRaccooon Aug 15, 2023
d7e2ab2
Almost fully switched CQRRP to work with pointers
TeachRaccooon Aug 15, 2023
b8921bf
Benchmark Update
TeachRaccooon Aug 15, 2023
78fcfba
Update
TeachRaccooon Aug 15, 2023
bac134d
Update
TeachRaccooon Aug 15, 2023
8666ea4
Update before adding an object to CQRRP
TeachRaccooon Aug 15, 2023
4bda9cd
Update
TeachRaccooon Aug 15, 2023
abc063c
Update
TeachRaccooon Aug 15, 2023
f0c1618
Update
TeachRaccooon Aug 15, 2023
7aa9d15
Pushing a benchmark that is supposed to show optimal alg for pivoted QR
TeachRaccooon Aug 15, 2023
15a6c5d
Pushing a benchmark that is supposed to show optimal alg for pivoted QR
TeachRaccooon Aug 15, 2023
2b6ac10
Pushing a benchmark that is supposed to show optimal alg for pivoted QR
TeachRaccooon Aug 15, 2023
28e15da
Pushing a benchmark that is supposed to show optimal alg for pivoted QR
TeachRaccooon Aug 15, 2023
11ca8c6
Short vs tall bug fix
TeachRaccooon Aug 15, 2023
d5d7e72
ShUpdate
TeachRaccooon Aug 15, 2023
4f40fc8
Bug fix
TeachRaccooon Aug 16, 2023
4c61ec2
Bug fix
TeachRaccooon Aug 16, 2023
b08a8df
Update
TeachRaccooon Aug 16, 2023
f7baefd
Bug fix
TeachRaccooon Aug 17, 2023
0aaab30
Update
TeachRaccooon Aug 17, 2023
af23d1a
Update
TeachRaccooon Aug 17, 2023
e1930cb
Update
TeachRaccooon Aug 17, 2023
d936aa7
Update
TeachRaccooon Aug 17, 2023
e19db6b
Update
TeachRaccooon Aug 17, 2023
bafffb3
Update
TeachRaccooon Aug 17, 2023
f67cf3f
Update
TeachRaccooon Aug 17, 2023
803bbb5
Update
TeachRaccooon Aug 17, 2023
7b97c12
Update
TeachRaccooon Aug 17, 2023
65942ea
Update
TeachRaccooon Aug 17, 2023
53c0da0
Update
TeachRaccooon Aug 17, 2023
a296bf4
Update
TeachRaccooon Aug 17, 2023
9e3ec25
Update
TeachRaccooon Aug 17, 2023
031a95e
Update
TeachRaccooon Aug 17, 2023
d6747b3
Update
TeachRaccooon Aug 17, 2023
5f89ea7
Update
TeachRaccooon Aug 17, 2023
e21d460
Update
TeachRaccooon Aug 17, 2023
e09119b
Update
TeachRaccooon Aug 17, 2023
1e30ea6
Update
TeachRaccooon Aug 17, 2023
6b6760d
Performing actual benchmarking now
TeachRaccooon Aug 17, 2023
40e9c07
bug fix
TeachRaccooon Aug 17, 2023
4dc55f2
Update
TeachRaccooon Aug 17, 2023
c56c58e
Update
TeachRaccooon Aug 17, 2023
b758c18
New benchmark
TeachRaccooon Aug 17, 2023
ce185ee
CQRRP timing update
TeachRaccooon Aug 17, 2023
8fb4f37
CQRRP timing update
TeachRaccooon Aug 17, 2023
8f280c2
Timing update
TeachRaccooon Aug 17, 2023
ce8dfe5
Timing update
TeachRaccooon Aug 17, 2023
6036364
Timing update
TeachRaccooon Aug 17, 2023
975ff22
TiTiming check
TeachRaccooon Aug 17, 2023
ebb7681
Attempt
TeachRaccooon Aug 17, 2023
9a5a1cf
Update
TeachRaccooon Aug 18, 2023
f9843a3
Added accuracy benchmark
TeachRaccooon Aug 21, 2023
d11b2d0
Added accuracy benchmark
TeachRaccooon Aug 21, 2023
6936aae
Update
TeachRaccooon Aug 21, 2023
648d928
Update
TeachRaccooon Aug 22, 2023
01a5df4
Update
TeachRaccooon Aug 22, 2023
a9cd10b
Update
TeachRaccooon Aug 22, 2023
c8f4735
Update
TeachRaccooon Aug 22, 2023
de38102
Update
TeachRaccooon Aug 22, 2023
f4088d2
Update
TeachRaccooon Aug 22, 2023
f2d6b5c
Update
TeachRaccooon Aug 22, 2023
54c1125
Update
TeachRaccooon Aug 22, 2023
1a64acc
Update
TeachRaccooon Aug 22, 2023
91273b2
Update
TeachRaccooon Aug 22, 2023
b339ac2
Update
TeachRaccooon Aug 22, 2023
9bdd3eb
Update
TeachRaccooon Aug 22, 2023
caaeac6
Update
TeachRaccooon Aug 22, 2023
d64622a
Update
TeachRaccooon Aug 22, 2023
2b1ee18
Update
TeachRaccooon Aug 22, 2023
44b1ce0
Update
TeachRaccooon Aug 22, 2023
c2d2a40
Update
TeachRaccooon Aug 22, 2023
86c7805
Update
TeachRaccooon Aug 22, 2023
13c5d75
Update
TeachRaccooon Aug 22, 2023
bab776e
Update
TeachRaccooon Aug 22, 2023
27ccdf6
Update
TeachRaccooon Aug 22, 2023
9055c5f
Update
TeachRaccooon Aug 22, 2023
1289280
Update
TeachRaccooon Aug 22, 2023
b28e2fc
Update
TeachRaccooon Aug 22, 2023
19c9fc1
Found an issue with calling QRCP with CQRRP - in short, we need an ld…
TeachRaccooon Aug 22, 2023
ec33104
This commit depicts an issue, but idk what's going on
TeachRaccooon Aug 22, 2023
9fb3dd5
Fixed version of the prev commit
TeachRaccooon Aug 22, 2023
d074a75
THis shows that any recursive usage of CQRRP is bad
TeachRaccooon Aug 22, 2023
4de9275
Update
TeachRaccooon Aug 22, 2023
2d36ee0
Continuing bug investigation
TeachRaccooon Aug 22, 2023
584cfcf
Nans occur when we are performing trsm for computing A_pre.
TeachRaccooon Aug 24, 2023
b404e6f
At some point, SASO results in a whole row of zeros
TeachRaccooon Aug 24, 2023
4af3b03
Looks like we have a RandBLAS issue.
TeachRaccooon Aug 24, 2023
360489f
Just realized that d_factor had a wrong data type. Prev situation was…
TeachRaccooon Aug 28, 2023
9a44e70
Issue was only present for d = 1 (rank drop). Fixed.
TeachRaccooon Aug 28, 2023
0db2e56
Cleanup
TeachRaccooon Aug 28, 2023
ebb551e
bug fix
TeachRaccooon Aug 28, 2023
cc43f64
Update before pivoting logic change
TeachRaccooon Aug 29, 2023
1332bf2
Bug happening at specific matrix sizes.
TeachRaccooon Aug 29, 2023
928e9e8
Fixed bug with using improper leading dimension when block size has b…
TeachRaccooon Aug 29, 2023
b3e47d6
This is weird
TeachRaccooon Aug 29, 2023
58e615f
Finally resolved pivots issue in QRCP via pivoted LU + GEQRF.
TeachRaccooon Aug 30, 2023
1d73d73
Update
TeachRaccooon Aug 30, 2023
1764074
Update
TeachRaccooon Aug 30, 2023
ee9db5a
Update
TeachRaccooon Aug 30, 2023
461110c
Update
TeachRaccooon Aug 30, 2023
40e5bf8
Update
TeachRaccooon Aug 30, 2023
110cccf
Added various version of QRCP again
TeachRaccooon Aug 30, 2023
4537346
Update
TeachRaccooon Aug 30, 2023
e41d8a1
Testing polynomial
TeachRaccooon Aug 30, 2023
3573c58
Testing polynomial
TeachRaccooon Aug 30, 2023
66ee0f5
Testing polynomial
TeachRaccooon Aug 30, 2023
b73cc43
Added Kahan matrix generator
TeachRaccooon Aug 30, 2023
8210191
Trying Step
TeachRaccooon Aug 30, 2023
33eb211
Trying Kahan
TeachRaccooon Aug 30, 2023
e614c33
Testing smaller data against geqp3
TeachRaccooon Aug 31, 2023
cf0aee3
Testing smaller data against geqp3
TeachRaccooon Aug 31, 2023
31585b5
Kahan fix
TeachRaccooon Aug 31, 2023
2c7285d
Kahan fix
TeachRaccooon Aug 31, 2023
6030612
Update
TeachRaccooon Aug 31, 2023
3df3a60
Update
TeachRaccooon Aug 31, 2023
18eea0f
Update
TeachRaccooon Aug 31, 2023
6e6cf4b
Update
TeachRaccooon Aug 31, 2023
da8364b
Cleanup
TeachRaccooon Aug 31, 2023
c7fbc4b
Updated pivoting
TeachRaccooon Aug 31, 2023
b678ba7
Let's see GEQP3's business card
TeachRaccooon Sep 1, 2023
1cb33b5
Let's see GEQP3's business card
TeachRaccooon Sep 1, 2023
3394ef6
Added comments
TeachRaccooon Sep 27, 2023
1b8aa8f
Added comments
TeachRaccooon Sep 27, 2023
ea50c3d
Hopefully, this updates RandBLAS submodule version
TeachRaccooon Sep 27, 2023
7b022b4
Merge branch 'main' into cqrrp-blocked
TeachRaccooon Sep 27, 2023
0221298
Update per RandBLAS change
TeachRaccooon Sep 27, 2023
08be1b0
Update core-linux.yaml
TeachRaccooon Sep 29, 2023
134734a
Update core-linux.yaml
TeachRaccooon Sep 29, 2023
40eb4f3
Update core-linux.yaml
TeachRaccooon Sep 29, 2023
e733bb2
Update core-macos.yaml
TeachRaccooon Sep 29, 2023
be383f8
Update core-linux.yaml
TeachRaccooon Sep 29, 2023
385ab44
Update core-linux.yaml
TeachRaccooon Sep 29, 2023
89b91e2
Update core-linux.yaml
TeachRaccooon Sep 29, 2023
5950d81
Update core-linux.yaml
TeachRaccooon Sep 29, 2023
f03ac32
Getting rid of mem leaks
TeachRaccooon Sep 29, 2023
75c2338
Update
TeachRaccooon Sep 29, 2023
9e98187
Mem leak fix after rebase.
TeachRaccooon Sep 29, 2023
0365652
Reverted changes that were accidentally added from an experimental br…
TeachRaccooon Sep 29, 2023
cd60fdf
Proper space deallocation
TeachRaccooon Sep 29, 2023
159aa5c
Update core-macos.yaml
TeachRaccooon Sep 29, 2023
bd6f159
Adding protection for using outdated Accelerate
TeachRaccooon Sep 29, 2023
5ee293d
Adding protection for using outdated Accelerate
TeachRaccooon Sep 29, 2023
d75d8e0
Adding protection for using outdated Accelerate
TeachRaccooon Sep 29, 2023
60fa48a
Adding protection for using outdated Accelerate
TeachRaccooon Sep 29, 2023
41d4ca7
Adding protection for using outdated Accelerate
TeachRaccooon Sep 29, 2023
7e63f8f
Update per Riley's comments
TeachRaccooon Nov 17, 2023
e8302d8
LAPACK issue
TeachRaccooon Nov 17, 2023
5785ee9
LAPACK issue
TeachRaccooon Nov 17, 2023
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
16 changes: 10 additions & 6 deletions .github/workflows/core-linux.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -27,7 +27,11 @@ jobs:
export DEBIAN_FRONTEND="noninteractive"
export TZ="America/Los_Angeles"
sudo apt-get update -qq
sudo apt-get install -qq -y git-core gcc g++ gfortran cmake subversion automake m4 libgtest-dev libmkl-dev libopenblas-openmp-dev
sudo apt-get install -qq -y git-core gcc g++ gfortran cmake subversion automake m4 libgtest-dev libopenblas-openmp-dev
wget -O- https://apt.repos.intel.com/intel-gpg-keys/GPG-PUB-KEY-INTEL-SW-PRODUCTS.PUB \
| gpg --dearmor | sudo tee /usr/share/keyrings/oneapi-archive-keyring.gpg > /dev/null
echo "deb [signed-by=/usr/share/keyrings/oneapi-archive-keyring.gpg] https://apt.repos.intel.com/oneapi all main" | sudo tee /etc/apt/sources.list.d/oneAPI.list
sudo apt update

- name: install BLAS++
run: |
Expand All @@ -53,7 +57,7 @@ jobs:
mkdir lapackpp-build
cd lapackpp-build
cmake -DCMAKE_BUILD_TYPE=Debug \
-Dblaspp_DIR=`pwd`/../blaspp-install/lib/blaspp \
-Dblaspp_DIR=`pwd`/../blaspp-install/lib/cmake/blaspp \
-DCMAKE_INSTALL_PREFIX=`pwd`/../lapackpp-install \
-DCMAKE_BINARY_DIR=`pwd` \
-Dbuild_tests=OFF \
Expand All @@ -66,8 +70,8 @@ jobs:
mkdir RandLAPACK-build
cd RandLAPACK-build
cmake -DCMAKE_BUILD_TYPE=Release \
-Dblaspp_DIR=`pwd`/../blaspp-install/lib/blaspp \
-Dlapackpp_DIR=`pwd`/../lapackpp-install/lib/lapackpp \
-Dblaspp_DIR=`pwd`/../blaspp-install/lib/cmake/blaspp \
-Dlapackpp_DIR=`pwd`/../lapackpp-install/lib/cmake/lapackpp \
-DRandom123_DIR=`pwd`/../Random123-install/include/ \
-DCMAKE_INSTALL_PREFIX=`pwd`/../RandLAPACK-install \
`pwd`/../RandLAPACK
Expand All @@ -92,8 +96,8 @@ jobs:
cd RandLAPACK-build-asan
cmake -DCMAKE_BUILD_TYPE=Debug \
-DSANITIZE_ADDRESS=ON \
-Dblaspp_DIR=`pwd`/../blaspp-install/lib/blaspp \
-Dlapackpp_DIR=`pwd`/../lapackpp-install/lib/lapackpp \
-Dblaspp_DIR=`pwd`/../blaspp-install/lib/cmake/blaspp \
-Dlapackpp_DIR=`pwd`/../lapackpp-install/lib/cmake/lapackpp \
-DRandom123_DIR=`pwd`/../Random123-install/include/ \
-DCMAKE_INSTALL_PREFIX=`pwd`/../RandLAPACK-install-asan \
`pwd`/../RandLAPACK
Expand Down
10 changes: 5 additions & 5 deletions .github/workflows/core-macos.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -50,7 +50,7 @@ jobs:
mkdir lapackpp-build
cd lapackpp-build
cmake -DCMAKE_BUILD_TYPE=Debug \
-Dblaspp_DIR=`pwd`/../blaspp-install/lib/blaspp \
-Dblaspp_DIR=`pwd`/../blaspp-install/lib/cmake/blaspp \
-DCMAKE_INSTALL_PREFIX=`pwd`/../lapackpp-install \
-DCMAKE_BINARY_DIR=`pwd` \
-Dbuild_tests=OFF \
Expand All @@ -63,8 +63,8 @@ jobs:
mkdir RandLAPACK-build
cd RandLAPACK-build
cmake -DCMAKE_BUILD_TYPE=Release \
-Dblaspp_DIR=`pwd`/../blaspp-install/lib/blaspp \
-Dlapackpp_DIR=`pwd`/../lapackpp-install/lib/lapackpp \
-Dblaspp_DIR=`pwd`/../blaspp-install/lib/cmake/blaspp \
-Dlapackpp_DIR=`pwd`/../lapackpp-install/lib/cmake/lapackpp \
-DRandom123_DIR=`pwd`/../Random123-install/include/ \
-DCMAKE_INSTALL_PREFIX=`pwd`/../RandLAPACK-install \
`pwd`/../RandLAPACK
Expand All @@ -89,8 +89,8 @@ jobs:
cd RandLAPACK-build-asan
cmake -DCMAKE_BUILD_TYPE=Debug \
-DSANITIZE_ADDRESS=ON \
-Dblaspp_DIR=`pwd`/../blaspp-install/lib/blaspp \
-Dlapackpp_DIR=`pwd`/../lapackpp-install/lib/lapackpp \
-Dblaspp_DIR=`pwd`/../blaspp-install/lib/cmake/blaspp \
-Dlapackpp_DIR=`pwd`/../lapackpp-install/lib/cmake/lapackpp \
-DRandom123_DIR=`pwd`/../Random123-install/include/ \
-DCMAKE_INSTALL_PREFIX=`pwd`/../RandLAPACK-install-asan \
`pwd`/../RandLAPACK
Expand Down
1 change: 1 addition & 0 deletions RandLAPACK.hh
Original file line number Diff line number Diff line change
Expand Up @@ -19,6 +19,7 @@
// Drivers
#include "RandLAPACK/drivers/rl_rsvd.hh"
#include "RandLAPACK/drivers/rl_cqrrpt.hh"
#include "RandLAPACK/drivers/rl_cqrrp.hh"
#include "RandLAPACK/drivers/rl_revd2.hh"

#endif
1 change: 1 addition & 0 deletions RandLAPACK/CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -2,6 +2,7 @@
set(RandLAPACK_cxx_sources
rl_lapackpp.hh
rl_cqrrpt.hh
rl_cqrrp.hh
rl_rsvd.hh
rl_revd2.hh
rl_qb.hh
Expand Down
8 changes: 4 additions & 4 deletions RandLAPACK/comps/rl_rs.hh
Original file line number Diff line number Diff line change
Expand Up @@ -139,12 +139,12 @@ int RS<T, RNG>::call(

if (p % 2 == 0) {
// Fill n by k Omega
RandBLAS::DenseDist D{.n_rows = n, .n_cols = k};
state = RandBLAS::fill_dense(D, Omega_dat, state);
RandBLAS::DenseDist D(n, k);
state = RandBLAS::fill_dense(D, Omega_dat, state).second;
} else {
// Fill m by k Omega_1
RandBLAS::DenseDist D{.n_rows = m, .n_cols = k};
state = RandBLAS::fill_dense(D, Omega_1_dat, state);
RandBLAS::DenseDist D(m, k);
state = RandBLAS::fill_dense(D, Omega_1_dat, state).second;

// multiply A' by Omega results in n by k omega
blas::gemm(Layout::ColMajor, Op::Trans, Op::NoTrans, n, k, m, 1.0, A_dat, m, Omega_1_dat, m, 0.0, Omega_dat, n);
Expand Down
4 changes: 2 additions & 2 deletions RandLAPACK/comps/rl_syps.hh
Original file line number Diff line number Diff line change
Expand Up @@ -140,8 +140,8 @@ int SYPS<T, RNG>::call(
bool callers_skop_buff = skop_buff != nullptr;
if (!callers_skop_buff)
skop_buff = new T[m * k];
RandBLAS::DenseDist D{m, k};
state = RandBLAS::fill_dense(D, skop_buff, state);
RandBLAS::DenseDist D(m, k);
state = RandBLAS::fill_dense(D, skop_buff, state).second;

bool callers_work_buff = work_buff != nullptr;
if (!callers_work_buff)
Expand Down
Loading
Loading