Why multi-scale features partially shared a convolution network via PhiPartiallyShared #73

sunset-clouds · 2024-06-18T07:33:26Z

VAR is indeed impressive, but there’s one issue that’s been bothering me. We reached out to the authors for assistance with the matter, and we appreciate your help.

In the quant.py line 33: self.quant_resi = PhiPartiallyShared(nn.ModuleList([(Phi(Cvae, quant_resi) if abs(quant_resi) > 1e-6 else nn.Identity()) for _ in range(share_quant_resi)])).

According to my understanding, self.quant_resi is the $\phi_k(\cdot)$ function. There are 4 different $\phi_k(\cdot)$, and some scales share the same $\phi_k(\cdot)$, for example: $\phi_1(\cdot) = \phi_2(\cdot)$, $\phi_3(\cdot) = \phi_4(\cdot) = \phi_5(\cdot) $, $\phi_6(\cdot) = \phi_7(\cdot)$, $\phi_8(\cdot) = \phi_9(\cdot) = \phi_{10}(\cdot) $. I have two questions:

why we need to introduce $\phi_k(\cdot)$, I feel this is somewhat counterintuitive. In contrast, in RQ-VAE, it adopts: $f = f-z_k$ instead of $f = f-\phi_k(z_k)$. I want to know the true role of $\phi_k(\cdot)$;
why different scale share the same $\phi_k(\cdot)$, e.g., $\phi_1(\cdot) = \phi_2(\cdot)$, $\phi_3(\cdot) = \phi_4(\cdot) = \phi_5(\cdot) $?

eyedealism · 2024-07-14T21:21:39Z

In the paper, it says: To address the information loss in upscaling. It's like the decoder part of UNet to generate a smoother map, I guess.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why multi-scale features partially shared a convolution network via PhiPartiallyShared #73

Why multi-scale features partially shared a convolution network via PhiPartiallyShared #73

sunset-clouds commented Jun 18, 2024

eyedealism commented Jul 14, 2024

Why multi-scale features partially shared a convolution network via PhiPartiallyShared #73

Why multi-scale features partially shared a convolution network via PhiPartiallyShared #73

Comments

sunset-clouds commented Jun 18, 2024

eyedealism commented Jul 14, 2024