HSViT: Horizontally Scalable Vision Transformer

Prerequisites

Ubuntu 22.04
Python 3.10 with pip

Install dependencies:

$ pip3 install -r requirements.txt

Model Training

Train HSViT

Train on CIFAR-10:

$ python3 main.py fit --config=config/hsvit_cifar10.yaml

Test on CIFAR-10:

$ python3 main.py test --config=config/hsvit_cifar10.yaml --ckpt_path="lightning_logs/version_0/checkpoints/epoch=0-step=38035.ckpt"

Train SOTA

Train ResNet18 on CIFAR-10:

$ python3 main.py fit --config=config/sota_cifar10.yaml --model.init_args.sota_config_path="./config/huggingface/resnet-18.json"

Train HSViT with existing CNN backbones

Download Hugginface ResNet50 model (pytorch_model.bin and config.json) to ./huggingface/ folder. Then, train HSViT + ResNet50 on ImageNet-1k:

$ python3 main.py fit --config=config/hsvit_het_imagenet.yaml

View Logs

Start a TensorBoard service based on directory of lightning_logs:

$ pip3 install tensorboard
$ tensorboard --logdir=lightning_logs/

Then go to http://localhost:6006 on your web browser for viewing the training logs.

Known Issues

Bug 1

RuntimeError: received 0 items of ancdata

Ref: GitHub Issue

Increase the ulimit:

import resource
rlimit = resource.getrlimit(resource.RLIMIT_NOFILE)
resource.setrlimit(resource.RLIMIT_NOFILE, (2048, rlimit[1]))

Citation

If you find the code useful, please use the following BibTeX entry.

@misc{xu2024hsvit,
      title={HSViT: Horizontally Scalable Vision Transformer}, 
      author={Chenhao Xu and Chang-Tsun Li and Chee Peng Lim and Douglas Creighton},
      year={2024},
      eprint={2404.05196},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
autoscripts		autoscripts
config		config
datasets		datasets
figures		figures
huggingface		huggingface
models		models
trainers		trainers
utils		utils
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HSViT: Horizontally Scalable Vision Transformer

Prerequisites

Model Training

Train HSViT

Train SOTA

Train HSViT with existing CNN backbones

View Logs

Known Issues

Bug 1

Citation

About

Languages

xuchenhao001/HSViT

Folders and files

Latest commit

History

Repository files navigation

HSViT: Horizontally Scalable Vision Transformer

Prerequisites

Model Training

Train HSViT

Train SOTA

Train HSViT with existing CNN backbones

View Logs

Known Issues

Bug 1

Citation

About

Resources

Stars

Watchers

Forks

Languages