Running inference on CPU #149

AD-lite24 · 2024-08-22T04:32:58Z

Hi I was wondering if there was any support for CPU inferences. The sample script from hubconf.py doesn't run even if after all the code instructing tensors and models to move to cuda were removed perhaps because of some internal line which still expects CUDA

torch.autocast(device_type='cuda', dtype=torch.bfloat16, enabled=False)

in mono/model/decode_heads/RAFTDepthNormalDPTDecoder5.py

Not sure how many more such instances there are so I wanted to get it clarified. I am sure it will be difficult to run on CPU but still

The text was updated successfully, but these errors were encountered:

elvistheyo · 2024-08-30T06:02:01Z

@AD-lite24 were you able to run it on CPU?

AD-lite24 · 2024-08-31T17:39:05Z

@elvistheyo Nope as I said it would take a lot of effort which might end up wasted anyway. Let me know if you choose to try it out though I could try to assist you with it if possible

JUGGHM · 2024-09-03T19:15:19Z

I think it will be difficult and not beneficial to infer on cpu. Approximately it will take 1.5~4 minutes to perform one inference for the ViT-L model. Additionally, one important acceleration library xformers does not support cpu as well.
The type torch.bfloat16 is only supported on GPU. The data type for all tensors should be torch.float32 for cpu devices.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Running inference on CPU #149

Running inference on CPU #149

AD-lite24 commented Aug 22, 2024

elvistheyo commented Aug 30, 2024

AD-lite24 commented Aug 31, 2024

JUGGHM commented Sep 3, 2024

Running inference on CPU #149

Running inference on CPU #149

Comments

AD-lite24 commented Aug 22, 2024

elvistheyo commented Aug 30, 2024

AD-lite24 commented Aug 31, 2024

JUGGHM commented Sep 3, 2024