The best performance of pretrained model #68

xwhkkk · 2023-03-06T07:25:09Z

Thanks for sharing your released the pretrained model. I wonder whether the default model XMem.pth is training with stage 03 ?The proposed result on val and test set is only the 107K pth ?
Many thanks in advance !

hkchengrex · 2023-03-06T07:38:49Z

The default is s03.
It is a 107K model if I recall correctly.

xwhkkk · 2023-03-06T13:00:14Z

Thanks for your kind reply.
Have you trained the model with stage 02 ? What's the best model in stage 02 ?
I tested the 160K model in stage02 is 86.2, but in stage 03 it decrease to 85.8 in 107K model. It is norm ? How should we choose which stage to use ? (2 or 3)

Thanks for your patience.

hkchengrex · 2023-03-06T17:47:46Z

I have tried s02 but it has basically the same performance as in s03 so I opted for shorter training instead. If s02 works better in your case then go for it. I have not observed any overfitting in training stage 2, and the only caveat is that it takes longer to train.

xwhkkk · 2023-03-08T06:33:42Z

I have tried s02 but it has basically the same performance as in s03 so I opted for shorter training instead. If s02 works better in your case then go for it. I have not observed any overfitting in training stage 2, and the only caveat is that it takes longer to train.
Thanks. I evaulate my base training(stage03) result on val set with 2 A-100 gpu and 4 A-100 gpu training (keep batch size = 8 )，but the result is only 85.8 and 84.9. Could you give some suggestions what caused that ?

hkchengrex · 2023-03-08T08:56:12Z

I have only trained it on the few machines that I have access to and I have not had any significantly worse results so I have little idea. Have you tried to look at all the last few network weights (105K-110K) and see if any of them is better?

Another reason can be PIL 8 vs. PIL 9 -- they use different JPEG reading algorithms and I recently run into problems with one of my recent projects but I am not sure if it affects XMem.

If longer training works in your case it is probably the easiest thing to do.

hkchengrex · 2023-03-08T08:58:21Z

There seems to be a similar issue in #60. This is probably not an isolated case but it is very hard for me to debug...

xwhkkk · 2023-03-09T10:52:20Z

Thanks for your kind reply. The only difference between stage 02 and stage 03 is the number of iterations , right ? So I think the result of stage 02 and stage 03 in 100K iterations shoud be nearly same, but I tested them on DAVIS val set found it is 84.0 and 84.7, respectively. Could you give me some suggestions ?

hkchengrex · 2023-03-10T00:17:32Z

We adjust the maximum skip between frames (curriculum learning) using the training progress in terms of the percentage of total iterations. So they are not the same when the total number of iterations is different.

I will try to investigate the training issue.

hkchengrex · 2023-03-12T02:18:31Z

Continue in #71.

hkchengrex mentioned this issue Mar 12, 2023

Training reproducibility thread #71

Open

hkchengrex closed this as completed Mar 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The best performance of pretrained model #68

The best performance of pretrained model #68

xwhkkk commented Mar 6, 2023

hkchengrex commented Mar 6, 2023

xwhkkk commented Mar 6, 2023

hkchengrex commented Mar 6, 2023 •

edited

Loading

xwhkkk commented Mar 8, 2023

hkchengrex commented Mar 8, 2023

hkchengrex commented Mar 8, 2023

xwhkkk commented Mar 9, 2023

hkchengrex commented Mar 10, 2023

hkchengrex commented Mar 12, 2023

The best performance of pretrained model #68

The best performance of pretrained model #68

Comments

xwhkkk commented Mar 6, 2023

hkchengrex commented Mar 6, 2023

xwhkkk commented Mar 6, 2023

hkchengrex commented Mar 6, 2023 • edited Loading

xwhkkk commented Mar 8, 2023

hkchengrex commented Mar 8, 2023

hkchengrex commented Mar 8, 2023

xwhkkk commented Mar 9, 2023

hkchengrex commented Mar 10, 2023

hkchengrex commented Mar 12, 2023

hkchengrex commented Mar 6, 2023 •

edited

Loading