I'm testing model throughput on my machine and it seems like Hiera is much slower than ViT in my case. Shouldn't it be the case that Hiera will be faster based on the paper? I run the benchmark with the provided benchmarking.py code and here is the output (All are run on one A100 GPU):
model = hiera.hiera_base_224()
benchmark(model, device=0, input_size=(3, 224, 224), batch_size=64, runs=40, use_fp16=True, verbose=True)
Throughput: 2095.55 im/s
model_name = 'vit_base_patch16_224_in21k'
model = timm.create_model(model_name, pretrained=False).cuda()
benchmark(model, device=0, input_size=(3, 224, 224), batch_size=64, runs=40, use_fp16=True, verbose=True)
Throughput: 3190.02 im/s
I'm testing model throughput on my machine and it seems like Hiera is much slower than ViT in my case. Shouldn't it be the case that Hiera will be faster based on the paper? I run the benchmark with the provided benchmarking.py code and here is the output (All are run on one A100 GPU):
Throughput: 2095.55 im/s
Throughput: 3190.02 im/s