- All the
config.yaml
in ourexp
are NOT the training config actually used, since some hyperparameters are changed in therun.sh
ortest.sh
. - #Frame = #input_frame x #crop x #clip
- #input_frame means how many frames are input for model per inference
- #crop means spatial crops (e.g., 3 for left/right/center)
- #clip means temporal clips (e.g., 4 means repeted sampling four clips with different start indices)
Model | Frame | Model | Shell | Config |
---|---|---|---|---|
UniFormerV2-B/16 | 8 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14 | 8 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14@336 | 8 | ckpt | run.sh | config.yaml |
Frozen UniFormerV2-L/14@336 | 8 | ckpt | run.sh | config.yaml |
Model | Pretraining | #Frame | Top-1 | Model | Shell | Config |
---|---|---|---|---|---|---|
UniFormerV2-B/16 | CLIP-400M | 8x3x4 | 84.4 | ckpt | run.sh | config.yaml |
UniFormerV2-B/16 | CLIP-400M+K710 | 8x3x4 | 85.6 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14 | CLIP-400M+K710 | 8x3x4 | 88.8 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14 | CLIP-400M+K710 | 16x3x4 | 89.1 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14 | CLIP-400M+K710 | 32x3x2 | 89.3 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14@336 | CLIP-400M+K710 | 32x3x2 | 89.7 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14@336 | CLIP-400M+K710 | 64x3x2 | 90.0 | ckpt | run.sh | config.yaml |
Frozen Model | Pretraining | #Frame | Top-1 | Model | Shell | Config |
---|---|---|---|---|---|---|
UniFormerV2-L/14@336 | CLIP-400M | 8x1x3 | 86.7 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14@336 | CLIP-400M+K710 | 8x1x3 | 87.8 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14@336 | CLIP-400M+K710 | 32x3x4 | 88.9 | ckpt | run.sh | config.yaml |
Model | Pretraining | #Frame | Top-1 | Model | Shell | Config |
---|---|---|---|---|---|---|
UniFormerV2-B/16 | CLIP-400M | 8x3x4 | 85.0 | ckpt | run.sh | config.yaml |
UniFormerV2-B/16 | CLIP-400M+K710 | 8x3x4 | 86.1 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14 | CLIP-400M+K710 | 8x3x4 | 89.0 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14 | CLIP-400M+K710 | 16x3x4 | 89.4 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14 | CLIP-400M+K710 | 32x3x2 | 89.5 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14@336 | CLIP-400M+K710 | 32x3x2 | 89.9 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14@336 | CLIP-400M+K710 | 64x3x2 | 90.1 | ckpt | run.sh | config.yaml |
Frozen Model | Pretraining | #Frame | Top-1 | Model | Shell | Config |
---|---|---|---|---|---|---|
UniFormerV2-L/14@336 | CLIP-400M | 8x1x3 | 87.4 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14@336 | CLIP-400M+K710 | 8x1x3 | 88.2 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14@336 | CLIP-400M+K710 | 32x3x4 | 89.2 | ckpt | run.sh | config.yaml |
Model | Pretraining | #Frame | Top-1 | Model | Shell | Config |
---|---|---|---|---|---|---|
UniFormerV2-B/16 | CLIP-400M | 8x3x4 | 75.8 | ckpt | run.sh | config.yaml |
UniFormerV2-B/16 | CLIP-400M+K710 | 8x3x4 | 76.3 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14 | CLIP-400M+K710 | 8x3x4 | 80.8 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14 | CLIP-400M+K710 | 16x3x4 | 81.2 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14 | CLIP-400M+K710 | 32x3x2 | 81.5 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14@336 | CLIP-400M+K710 | 32x3x2 | 82.1 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14@336 | CLIP-400M+K710 | 64x3x2 | 82.7 | ckpt | run.sh | config.yaml |
Frozen Model | Pretraining | #Frame | Top-1 | Model | Shell | Config |
---|---|---|---|---|---|---|
UniFormerV2-L/14@336 | CLIP-400M | 8x1x3 | 79.6 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14@336 | CLIP-400M+K710 | 8x1x3 | 79.7 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14@336 | CLIP-400M+K710 | 32x3x4 | 80.8 | ckpt | run.sh | config.yaml |
Model | Pretraining | #Frame | Top-1 | Model | Shell | Config |
---|---|---|---|---|---|---|
UniFormerV2-B/16 | CLIP-400M+K710+K400 | 8x3x4 | 42.6 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14 | CLIP-400M+K710+K400 | 8x3x4 | 47.0 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14@336 | CLIP-400M+K710+K400 | 8x3x4 | 47.8 | ckpt | run.sh | config.yaml |
Model | Pretraining | #Frame | Top-1 | Model | Shell | Config |
---|---|---|---|---|---|---|
UniFormerV2-B/16 | CLIP-400M | 16x3x1 | 56.8 | ckpt | run.sh | config.yaml |
UniFormerV2-B/16 | CLIP-400M | 32x3x1 | 59.4 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14 | CLIP-400M | 16x3x1 | 60.5 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14 | CLIP-400M | 32x3x1 | 62.7 | ckpt | run.sh | config.yaml |
Model | Pretraining | #Frame | Top-1 | Model | Shell | Config |
---|---|---|---|---|---|---|
UniFormerV2-B/16 | CLIP-400M | 16x3x1 | 69.5 | ckpt | run.sh | config.yaml |
UniFormerV2-B/16 | CLIP-400M | 32x3x1 | 70.7 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14 | CLIP-400M | 16x3x1 | 72.1 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14 | CLIP-400M | 32x3x1 | 73.0 | ckpt | run.sh | config.yaml |
Model | Pretraining | #Frame | Top-1 | Model | Shell | Config |
---|---|---|---|---|---|---|
UniFormerV2-L/14 | CLIP-400M+K710+K400 | 16x3x10 | 94.3 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14 | CLIP-400M+K710+K400 | 32x3x10 | 94.7 | ckpt | run.sh | config.yaml |
Model | Pretraining | #Frame | Top-1 | Model | Shell | Config |
---|---|---|---|---|---|---|
UniFormerV2-L/14 | CLIP-400M+K710+K400 | 16x3x10 | 95.5 | ckpt | run.sh | config.yaml |
UniFormerV2-L/14 | CLIP-400M+K710+K400 | 32x3x10 | 95.4 | ckpt | run.sh | config.yaml |