Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! #1

landiaokafeiyan · 2023-02-10T01:10:03Z

Hi there,

Thanks for your excellent work. I have this problem when I train and test your code. Do you have any idea what is wrong? Since I find that the data and model are all in cuda.

Thanks in advance!

afpapqy · 2023-02-10T08:33:00Z

I solved that through transferring it into toch.nn.parallel.DistributedDataParallel.
However, I met another CUDA Memory Error:

torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 50.00 MiB (GPU 0; 11.91 GiB total capacity; 10.99 GiB already allocated; 3.88 MiB free; 11.07 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

landiaokafeiyan · 2023-02-10T19:53:08Z

I think you have to reduce the batch size, even though I have 2 X2080ti , batch-size set as 2

afpapqy · 2023-02-13T02:22:19Z

I think you have to reduce the batch size, even though I have 2 X2080ti , batch-size set as 2

My GPU is a Titan Xp with 12GB of memory, and the image size is 576*576, but I still get a "out of memory" error even when I set the batch size to 1.

anhquyetnguyen · 2023-02-15T13:47:45Z

I am facing it, can you share solution? @afpapqy @landiaokafeiyan

PigBroA · 2023-02-28T02:41:08Z

I modified little bit and I can run without device error

in models/swin_transformer_v2.py line 294
original: logit_scale = torch.clamp(self.logit_scale, max=torch.log(torch.tensor(1. / 0.01))).exp()
modified: logit_scale = torch.clamp(self.logit_scale, max=torch.log(torch.tensor(1. / 0.01).to('cuda:0'))).exp()

this is an example. You can get another variable to change the tensor's device status.

landiaokafeiyan · 2023-02-28T14:21:19Z

Hi @afpapqy @PigBroA
when I test the image with 3000x4000, I have to piece the image into several patches which will decrease the performance. Do you have any good ideas to slove this problem?

Thanks in advance.

kmbmjn · 2023-05-08T06:50:46Z

I modified little bit and I can run without device error

in models/swin_transformer_v2.py line 294 original: logit_scale = torch.clamp(self.logit_scale, max=torch.log(torch.tensor(1. / 0.01))).exp() modified: logit_scale = torch.clamp(self.logit_scale, max=torch.log(torch.tensor(1. / 0.01).to('cuda:0'))).exp()

this is an example. You can get another variable to change the tensor's device status.

Thank you for this solution!
For the multi-GPU environment, I encountered another error with "cuda:0" and "cuda:1" and alternatively, I used the following modification:

original: logit_scale = torch.clamp(self.logit_scale, max=torch.log(torch.tensor(1. / 0.01))).exp()
modified: logit_scale = torch.clamp(self.logit_scale, max=torch.log(torch.tensor(1. / 0.01).to(self.logit_scale.device))).exp()

adam-kosinski mentioned this issue Jun 6, 2023

How to predict or inference on a single figure when performing depth estimation? #6

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! #1

Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! #1

landiaokafeiyan commented Feb 10, 2023

afpapqy commented Feb 10, 2023 •

edited

Loading

landiaokafeiyan commented Feb 10, 2023

afpapqy commented Feb 13, 2023

anhquyetnguyen commented Feb 15, 2023

PigBroA commented Feb 28, 2023

landiaokafeiyan commented Feb 28, 2023

kmbmjn commented May 8, 2023

Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! #1

Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! #1

Comments

landiaokafeiyan commented Feb 10, 2023

afpapqy commented Feb 10, 2023 • edited Loading

landiaokafeiyan commented Feb 10, 2023

afpapqy commented Feb 13, 2023

anhquyetnguyen commented Feb 15, 2023

PigBroA commented Feb 28, 2023

landiaokafeiyan commented Feb 28, 2023

kmbmjn commented May 8, 2023

afpapqy commented Feb 10, 2023 •

edited

Loading