Frame-based training #36

Jay-IPL · 2023-01-12T20:16:10Z

Hi authors~ I noticed in your paper you mentioned you put the temporal dimension into the batch dimension and extract visual features independently. Thus, I think the whole model is frame-based training except when using video-swin as the backbone. Is my understanding right?

iseunghoon · 2025-02-08T15:46:27Z

Hi authors~ I noticed in your paper you mentioned you put the temporal dimension into the batch dimension and extract visual features independently. Thus, I think the whole model is frame-based training except when using video-swin as the backbone. Is my understanding right?

I agree with your point. Aside from Video Swin, there is no fusion along the temporal axis, right?"

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Frame-based training #36

Frame-based training #36

Jay-IPL commented Jan 12, 2023

iseunghoon commented Feb 8, 2025

Frame-based training #36

Frame-based training #36

Comments

Jay-IPL commented Jan 12, 2023

iseunghoon commented Feb 8, 2025