Out of memory! #11

primepake · 2024-12-03T15:04:21Z

I used the same config with your training but got OOM! Are you sure it's correct?

LorenzoAgnolucci · 2024-12-03T16:01:58Z

Hi! What GPU are you using?

primepake · 2024-12-03T16:19:53Z

I'm using H100 with 80GB

LorenzoAgnolucci · 2024-12-03T16:29:01Z

I see. Are you training the model with the default resolution?

primepake · 2024-12-03T16:40:28Z

yes 512x512? have you tried with accelerator? I trained with it and got OOM

LorenzoAgnolucci · 2024-12-03T16:47:45Z

By default, the --train-patch-size parameter is set to 128. I've never tried using accelerator, sorry.

primepake · 2024-12-04T03:46:30Z

I just started the training but I'm checking your code, it only input with the shape
imgs_lq.shape, imgs_gt.shape, imgs_ref.shape: torch.Size([2, 5, 3, 128, 128]) torch.Size([2, 5, 3, 128, 128]) torch.Size([2, 5, 3, 128, 128])
so why in the inference you run with 512 image?

primepake · 2024-12-04T08:54:25Z

can you share the chart of you loss values?

LorenzoAgnolucci · 2024-12-04T09:02:23Z

In image and video restoration, training with a patch size smaller than the test one is quite a common way to save memory during training. On which dataset are you training? Is it the one of the paper or a custom one?

primepake · 2024-12-04T09:04:19Z

I'm training with car dataset

LorenzoAgnolucci · 2024-12-04T09:07:08Z

Can I see a video example somewhere? Is it public?

primepake · 2024-12-04T09:28:38Z

yes! it recorded around the car
https://www.kaggle.com/datasets/tapakah68/videos-around-cars

LorenzoAgnolucci · 2024-12-04T09:32:58Z

What's the end goal of your work? Because the videos don't seem degraded with artifacts similar to those considered in our paper.

primepake · 2024-12-04T09:34:36Z

I want to increase the quality of the image by degradation ground truth image while training.

LorenzoAgnolucci · 2024-12-04T09:49:39Z

What kind of degradation? Our model is designed for analog video restoration, I'm not sure it's suitable for your purpose.

primepake · 2024-12-04T09:53:41Z

it's simple degradations (gaussian noise, blur gaussian, blur part of car, downsample_upsample) in video

LorenzoAgnolucci · 2024-12-04T14:58:52Z

How did you determine the reference frames? Did you use the standard textual prompts or did you change them to fit your needs?

By the way, I'm closing the issue since the out of memory problem is now solved.

primepake · 2024-12-04T15:51:15Z

I just select random from dataset. Do you if we select different views it will support that?

LorenzoAgnolucci · 2024-12-04T15:54:11Z

Why should random frames represent reference frames for the restoration? In the paper, we devised a specific methodology for analog videos. You should try to adjust that for your needs if you want to use our model for your purpose.

primepake · 2024-12-04T16:00:49Z

I'm just thinking while training we can select random then the model can learn better after that, in the inference stage we can choose reference based on CLIP score.

LorenzoAgnolucci · 2024-12-04T16:09:22Z

I've never tried it, so I'm not sure if it will work (or not)

primepake · 2024-12-04T16:21:18Z

Thanks! Can you provide your training chart?

LorenzoAgnolucci · 2024-12-04T16:32:12Z

I don't have the charts anymore, but I have the values of the losses at the end of the training: about 3.2 for the pixel_loss, and about 4.6 for the perceptual_loss. However, since you are using a different training dataset, I don't think these values are useful for you. But from what I remember, the profiles of the losses were similar to your charts.

primepake · 2024-12-04T17:59:06Z

Thanks a lot!

LorenzoAgnolucci closed this as completed Dec 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Out of memory! #11

Out of memory! #11

primepake commented Dec 3, 2024

LorenzoAgnolucci commented Dec 3, 2024

primepake commented Dec 3, 2024

LorenzoAgnolucci commented Dec 3, 2024 •

edited

Loading

primepake commented Dec 3, 2024

LorenzoAgnolucci commented Dec 3, 2024

primepake commented Dec 4, 2024

primepake commented Dec 4, 2024

LorenzoAgnolucci commented Dec 4, 2024

primepake commented Dec 4, 2024 •

edited

Loading

LorenzoAgnolucci commented Dec 4, 2024

primepake commented Dec 4, 2024

LorenzoAgnolucci commented Dec 4, 2024

primepake commented Dec 4, 2024

LorenzoAgnolucci commented Dec 4, 2024

primepake commented Dec 4, 2024

LorenzoAgnolucci commented Dec 4, 2024

primepake commented Dec 4, 2024

LorenzoAgnolucci commented Dec 4, 2024

primepake commented Dec 4, 2024

LorenzoAgnolucci commented Dec 4, 2024

primepake commented Dec 4, 2024

LorenzoAgnolucci commented Dec 4, 2024

primepake commented Dec 4, 2024

Out of memory! #11

Out of memory! #11

Comments

primepake commented Dec 3, 2024

LorenzoAgnolucci commented Dec 3, 2024

primepake commented Dec 3, 2024

LorenzoAgnolucci commented Dec 3, 2024 • edited Loading

primepake commented Dec 3, 2024

LorenzoAgnolucci commented Dec 3, 2024

primepake commented Dec 4, 2024

primepake commented Dec 4, 2024

LorenzoAgnolucci commented Dec 4, 2024

primepake commented Dec 4, 2024 • edited Loading

LorenzoAgnolucci commented Dec 4, 2024

primepake commented Dec 4, 2024

LorenzoAgnolucci commented Dec 4, 2024

primepake commented Dec 4, 2024

LorenzoAgnolucci commented Dec 4, 2024

primepake commented Dec 4, 2024

LorenzoAgnolucci commented Dec 4, 2024

primepake commented Dec 4, 2024

LorenzoAgnolucci commented Dec 4, 2024

primepake commented Dec 4, 2024

LorenzoAgnolucci commented Dec 4, 2024

primepake commented Dec 4, 2024

LorenzoAgnolucci commented Dec 4, 2024

primepake commented Dec 4, 2024

LorenzoAgnolucci commented Dec 3, 2024 •

edited

Loading

primepake commented Dec 4, 2024 •

edited

Loading