Activation memory estimation (in resource utilization) ignores layers where the activation quantization disabled #1293

elad-c · 2024-12-15T10:58:05Z

Issue Type

Bug

Source

source

MCT Version

nightly

OS Platform and Distribution

No response

Python version

No response

Describe the issue

Currently unquantized vectors size are ignored in MaxTensor and MaxCut. Need to handle according to the quantization preserving flag.

Expected behaviour

No response

Code to reproduce the issue

Node torch.expand might cause an issue (untreated increased tensor size).

Log output

No response

haihabi · 2024-12-15T13:49:04Z

@ofirgo and @elad-c this should be handled using quantization preserving.

elad-c assigned ofirgo Dec 15, 2024

ofirgo added the enhancement New feature or request label Jan 6, 2025

ofirgo changed the title ~~Activation memory estimation (in resource utilization) ignores layers the activation quantization disabled~~ Activation memory estimation (in resource utilization) ignores layers where the activation quantization disabled Jan 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Activation memory estimation (in resource utilization) ignores layers where the activation quantization disabled #1293

Activation memory estimation (in resource utilization) ignores layers where the activation quantization disabled #1293

elad-c commented Dec 15, 2024

haihabi commented Dec 15, 2024

Activation memory estimation (in resource utilization) ignores layers where the activation quantization disabled #1293

Activation memory estimation (in resource utilization) ignores layers where the activation quantization disabled #1293

Comments

elad-c commented Dec 15, 2024

Issue Type

Source

MCT Version

OS Platform and Distribution

Python version

Describe the issue

Expected behaviour

Code to reproduce the issue

Log output

haihabi commented Dec 15, 2024