This repository was archived by the owner on Sep 25, 2023. It is now read-only.
This repository was archived by the owner on Sep 25, 2023. It is now read-only.
[QST] firfilter performance on Jetson #416
Open
Description
I'm running benchmarking tests on a Jetson Nano 4GB board, and am testing a number of filter algorithms. The performance of each (for N number of taps) is shown below.
firfilter is tested as follows:
b_cp = cp.asarray(b)
x_cp = cp.asarray(x)
# warm up
firfilter_result = firfilter(b_cp, x_cp, axis=0)
start = time.time()
for i in range(20):
firfilter_result = firfilter(b_cp, x_cp, axis=0) # x is a 2D array
stop = time.time()
tot_time = (stop - start) / 20
csig_time.append(tot_time)
In contrast, running the same code on a GeForce GTX 1050 Ti yields the following:
In this case, firfilter (GPU) performance exceeds CPU performance for the other algorithms, which is expected.
However, we do not expectfirfilter
to be significantly slower than the other algorithms on Jetson . What potential performance optimizations could help solve this problem?
Metadata
Metadata
Assignees
Type
Projects
Status
No status