Global load #25

karloballa · 2019-08-23T05:03:07Z

I want to optimize memory loads and tried several different instructions to see how it goes. Up to now, however, I got success only with flat_load.

I would like to compile a kernel for RX 4xx/5xx, but if I use global_load_dword** I'm getting "unknown instruction". I suppose that there are some flags for that...

Also, I have a question about "buffer resources". How to access them, and how to use them? I tried several kernel setups but didn't have luck. Is there any table which explains what is where? Who decides what info goes to which register (driver?)? Infos on the net are sparse and most of them are contradictional...

Thanks in advance

matszpk · 2019-08-23T07:10:15Z

The GLOBAL_* and SCRATCH_* instructions has been introduced in the RX Vega GPU and they are not available in Fiji/Polaris GPU's. The resource buffer was used in old OpenCL implementation and in the first GCN GPU generation (Tahiti, Pitcairn, HD 7xxx). For new OpenCL drivers and newer GPU's the FLAT_* instructions are recommended to access memory.

karloballa · 2019-08-23T13:44:18Z

Thanks for the explanation,

It seems that I misinterpreted GCN generations. I thought that Polaris is GCN 1.4 and Vega GCN 1.5.

As for buffer_load it looked to me that I could spare some instructions. Now it is understandable why the compiler is forcing flat_load.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Global load #25

Global load #25

karloballa commented Aug 23, 2019

matszpk commented Aug 23, 2019

karloballa commented Aug 23, 2019

Global load #25

Global load #25

Comments

karloballa commented Aug 23, 2019

matszpk commented Aug 23, 2019

karloballa commented Aug 23, 2019