-
Notifications
You must be signed in to change notification settings - Fork 7.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RuntimeError: CUDA error: device-side assert triggered #2837
Comments
You've chosen to report an unexpected problem or bug. Unless you already know the root cause of it, please include details about it by filling the issue template. |
Same problem here with clean environments and installations. I've tested in anaconda and venv. Using CUDA 11.1 and pytorch 1.8.1. Also if I download and run locally the tutorial notebook I get the same error while training for the balloon dataset. |
Have had exactly the same error occur with Curiously enough, it seems that this issue affects only |
exactly the same on my 3070 cuda11.2 pytorch1.8.1 or nightly |
This is a pytorch bug, please follow pytorch/pytorch#55027 instead. |
How can I downgrade pytorch if I have Cuda 11.2 on my system? |
I have the same problem if I run code on my local machine (CUDA11.1 + Pythorch 1.8.1+ compiled detectron2) & (CUDA10.2 + Pythorch 1.8.1+ _pre_compiled detectron2) . however exactly the same code runs OK on colab (CUDA 10.1). |
Same error with CUDA11.1 + Pythorch 1.8.1+ compiled detectron2. It can run with CPU though. |
I've got the same problem, CUDA Version 10.0.130, Driver Version: 450.119.03
|
Same error here: with pytorch 1.8.1 and cuda 11, no errors such errors with pytorch 1.8.1 and cuda 10.2. |
Same problem, it works well on CPU. |
Same Error with pytorch180, py38. Unknown Problems |
Maybe this can help some people struggling with this issue, I am able to solve this problem by installing pytorch1.10 with CUDA 11.3 for my RTX 3060 GPU. I also previously faced the same problem when using pytorch 1.8 with CUDA 11.1 for RTX 3060 GPU. |
I have same problem, my cuda version 11.0 & torch 1.7.1. someone can help me for this. thx |
Same error. and no where solution. |
Refer to this doc: https://huggingface.co/docs/transformers/v4.20.1/en/troubleshooting
And from the traceback, the error in my case was caused by the outbound value of bounding box (normalized) - It should be in the range between 0 to 1000, so the value exceeding 1000 will throw an error. |
I am running
./train_net.py --config-file ../configs/PascalVOC-Detection/faster_rcnn_R_50_C4.yaml --num-gpus 1 SOLVER.IMS_PER_BATCH 2 SOLVER.BASE_LR 0.0025
and it came out with error : RuntimeError: CUDA error: device-side assert triggered
LOG
[03/29 17:55:18 fvcore.common.checkpoint]: The checkpoint state_dict contains keys that are not used by the model:
fc1000.{bias, weight}
stem.conv1.bias
[03/29 17:55:18 d2.engine.train_loop]: Starting training from iteration 0
/pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [13,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [16,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [19,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [20,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [22,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [23,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [24,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [25,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [26,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [27,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [28,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [29,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [30,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [31,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [96,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [97,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [98,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [99,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [100,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [101,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [102,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [103,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [104,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [105,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [106,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [107,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [108,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [109,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [110,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [111,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [112,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [113,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [114,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [115,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [116,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [117,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [118,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [119,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [120,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [121,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [32,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [33,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [34,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [35,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [36,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [37,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [38,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [39,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [40,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [41,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [42,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [43,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [44,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [45,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [46,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [47,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [48,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [49,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [50,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [51,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [52,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [53,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [54,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [55,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [56,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [57,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [58,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [59,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [60,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [61,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [62,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [63,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [64,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [65,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [66,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [67,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [68,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [69,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [70,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [71,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [72,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [73,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [74,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [75,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [76,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [77,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [78,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [79,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [80,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [81,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [82,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [83,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [84,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [85,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [86,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [87,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [88,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [89,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [90,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [91,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [92,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [93,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [94,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [95,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed.ERROR [03/29 17:55:19 d2.engine.train_loop]: Exception during training:
Traceback (most recent call last):
File "/home/jwz/文档/detectron2-master/detectron2/engine/train_loop.py", line 140, in train
self.run_step()
File "/home/jwz/文档/detectron2-master/detectron2/engine/defaults.py", line 441, in run_step
self._trainer.run_step()
File "/home/jwz/文档/detectron2-master/detectron2/engine/train_loop.py", line 234, in run_step
loss_dict = self.model(data)
File "/home/jwz/miniconda3/envs/open-mmlab/lib/python3.7/site-packages/torch/nn/modules/module.py", line 889, in _call_impl
result = self.forward(*input, **kwargs)
File "/home/jwz/文档/detectron2-master/detectron2/modeling/meta_arch/rcnn.py", line 160, in forward
proposals, proposal_losses = self.proposal_generator(images, features, gt_instances)
File "/home/jwz/miniconda3/envs/open-mmlab/lib/python3.7/site-packages/torch/nn/modules/module.py", line 889, in _call_impl
result = self.forward(*input, **kwargs)
File "/home/jwz/文档/detectron2-master/detectron2/modeling/proposal_generator/rpn.py", line 432, in forward
gt_labels, gt_boxes = self.label_and_sample_anchors(anchors, gt_instances)
File "/home/jwz/miniconda3/envs/open-mmlab/lib/python3.7/site-packages/torch/autograd/grad_mode.py", line 27, in decorate_context
return func(*args, **kwargs)
File "/home/jwz/文档/detectron2-master/detectron2/modeling/proposal_generator/rpn.py", line 301, in label_and_sample_anchors
matched_idxs, gt_labels_i = retry_if_cuda_oom(self.anchor_matcher)(match_quality_matrix)
File "/home/jwz/文档/detectron2-master/detectron2/utils/memory.py", line 70, in wrapped
return func(*args, **kwargs)
File "/home/jwz/文档/detectron2-master/detectron2/modeling/matcher.py", line 88, in call
assert torch.all(match_quality_matrix >= 0)
RuntimeError: CUDA error: device-side assert triggered
[03/29 17:55:19 d2.engine.hooks]: Total training time: 0:00:00 (0:00:00 on hooks)
[03/29 17:55:19 d2.utils.events]: iter: 1 total_loss: 3.161 loss_cls: 3.058 loss_box_reg: 0.0003562 loss_rpn_cls: 0.04828 loss_rpn_loc: 0.05456 data_time: 0.1060 lr: 2.5e-06 max_mem: 2041M
Traceback (most recent call last):
File "./train_net.py", line 167, in
args=(args,),
File "/home/jwz/文档/detectron2-master/detectron2/engine/launch.py", line 62, in launch
main_func(*args)
File "./train_net.py", line 155, in main
return trainer.train()
File "/home/jwz/文档/detectron2-master/detectron2/engine/defaults.py", line 431, in train
super().train(self.start_iter, self.max_iter)
File "/home/jwz/文档/detectron2-master/detectron2/engine/train_loop.py", line 140, in train
self.run_step()
File "/home/jwz/文档/detectron2-master/detectron2/engine/defaults.py", line 441, in run_step
self._trainer.run_step()
File "/home/jwz/文档/detectron2-master/detectron2/engine/train_loop.py", line 234, in run_step
loss_dict = self.model(data)
File "/home/jwz/miniconda3/envs/open-mmlab/lib/python3.7/site-packages/torch/nn/modules/module.py", line 889, in _call_impl
result = self.forward(*input, **kwargs)
File "/home/jwz/文档/detectron2-master/detectron2/modeling/meta_arch/rcnn.py", line 160, in forward
proposals, proposal_losses = self.proposal_generator(images, features, gt_instances)
File "/home/jwz/miniconda3/envs/open-mmlab/lib/python3.7/site-packages/torch/nn/modules/module.py", line 889, in _call_impl
result = self.forward(*input, **kwargs)
File "/home/jwz/文档/detectron2-master/detectron2/modeling/proposal_generator/rpn.py", line 432, in forward
gt_labels, gt_boxes = self.label_and_sample_anchors(anchors, gt_instances)
File "/home/jwz/miniconda3/envs/open-mmlab/lib/python3.7/site-packages/torch/autograd/grad_mode.py", line 27, in decorate_context
return func(*args, **kwargs)
File "/home/jwz/文档/detectron2-master/detectron2/modeling/proposal_generator/rpn.py", line 301, in label_and_sample_anchors
matched_idxs, gt_labels_i = retry_if_cuda_oom(self.anchor_matcher)(match_quality_matrix)
File "/home/jwz/文档/detectron2-master/detectron2/utils/memory.py", line 70, in wrapped
return func(*args, **kwargs)
File "/home/jwz/文档/detectron2-master/detectron2/modeling/matcher.py", line 88, in call
assert torch.all(match_quality_matrix >= 0)
RuntimeError: CUDA error: device-side assert triggered
ENV
sys.platform linux
Python 3.7.10 (default, Feb 26 2021, 18:47:35) [GCC 7.3.0]
numpy 1.20.2
detectron2 0.4 @/home/jwz/文档/detectron2-master/detectron2
Compiler GCC 10.2
CUDA compiler CUDA 11.1
detectron2 arch flags 8.6
DETECTRON2_ENV_MODULE
PyTorch 1.8.1+cu111 @/home/jwz/miniconda3/envs/open-mmlab/lib/python3.7/site-packages/torch
PyTorch debug build False
GPU available True
GPU 0 GeForce RTX 3060 (arch=8.6)
CUDA_HOME /usr/local/cuda-11.1
Pillow 8.1.2
torchvision 0.9.1+cu111 @/home/jwz/miniconda3/envs/open-mmlab/lib/python3.7/site-packages/torchvision
torchvision arch flags 3.5, 5.0, 6.0, 7.0, 7.5, 8.0, 8.6
fvcore 0.1.5.post20210328
iopath 0.1.7
cv2 4.5.1
PyTorch built with:
The text was updated successfully, but these errors were encountered: