Skip to content
This repository was archived by the owner on Sep 25, 2023. It is now read-only.
This repository was archived by the owner on Sep 25, 2023. It is now read-only.

[QST] firfilter performance on Jetson #416

Open
@gpeyton

Description

@gpeyton

I'm running benchmarking tests on a Jetson Nano 4GB board, and am testing a number of filter algorithms. The performance of each (for N number of taps) is shown below.

Screen Shot 2021-08-19 at 10 49 42 AM

firfilter is tested as follows:

    b_cp = cp.asarray(b)
    x_cp = cp.asarray(x)
    
    # warm up
    firfilter_result = firfilter(b_cp, x_cp, axis=0) 

    start = time.time()
    for i in range(20):
        firfilter_result = firfilter(b_cp, x_cp, axis=0)   # x is a 2D array
    stop = time.time()
    tot_time = (stop - start) / 20
    csig_time.append(tot_time)

In contrast, running the same code on a GeForce GTX 1050 Ti yields the following:

55c053cf-b26d-427e-bf62-e1253b1c3a4c (1)

In this case, firfilter (GPU) performance exceeds CPU performance for the other algorithms, which is expected.

However, we do not expectfirfilter to be significantly slower than the other algorithms on Jetson . What potential performance optimizations could help solve this problem?

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    Status

    No status

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions