reproduction of Panoptic segmentation on COCO #79

wmkai · 2022-04-08T06:36:12Z

Hi thank you for your excellent work. I meet a problem when re-run your experiments.

I tried to follow your advice in Getting Started with Mask2Former, and run:
python train_net.py --num-gpus 8 \ --config-file configs/coco/panoptic-segmentation/maskformer2_R50_bs16_50ep.yaml
after training, the log showed that "Start inference on 625 batches". But after a few days, there are still no new logs. So I kill this process and run
python train_net.py \ --config-file configs/coco/panoptic-segmentation/maskformer2_R50_bs16_50ep.yaml \ --eval-only MODEL.WEIGHTS ./output/model_0094999.pth,
after evaluation, it showed that the result was

was lower that the result from Table 1 in the paper,

could you help me see what is the reason for this ^ ^

The text was updated successfully, but these errors were encountered:

bowenc0221 · 2022-04-08T18:24:09Z

Your training did not finish, please refer to #74

wmkai · 2022-04-09T13:38:13Z

thx for replying, but my problem seems not the same as #74. His pytorch nccl and system nccl version numbers are not the same but mine are the same. Everytime after iteration 94979, my training process stop. At the same time, all my 8 GPUs are utilized 0% which is not the same with #74.

bowenc0221 · 2022-04-09T17:24:48Z

The COCO model is trained for 368750 iterations, but you evaluated the model on the 94999-th iteration.

wmkai · 2022-04-10T03:50:23Z

thanks, I tried again and find these logs after the 94999-th iteration

And then I checked the CPU memory usage and found the CPU memory is exhausted.

Can I ask about how much CPU memory is required for this training process

wmkai · 2022-04-13T08:58:36Z

it seems that I met this problem in open-mmlab/mmdetection#7538

wmkai closed this as completed Aug 18, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reproduction of Panoptic segmentation on COCO #79

reproduction of Panoptic segmentation on COCO #79

wmkai commented Apr 8, 2022 •

edited

Loading

bowenc0221 commented Apr 8, 2022

wmkai commented Apr 9, 2022

bowenc0221 commented Apr 9, 2022

wmkai commented Apr 10, 2022 •

edited

Loading

wmkai commented Apr 13, 2022

reproduction of Panoptic segmentation on COCO #79

reproduction of Panoptic segmentation on COCO #79

Comments

wmkai commented Apr 8, 2022 • edited Loading

bowenc0221 commented Apr 8, 2022

wmkai commented Apr 9, 2022

bowenc0221 commented Apr 9, 2022

wmkai commented Apr 10, 2022 • edited Loading

wmkai commented Apr 13, 2022

wmkai commented Apr 8, 2022 •

edited

Loading

wmkai commented Apr 10, 2022 •

edited

Loading