-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MS73 layer models might be overfitting #1202
Comments
Yes! What we're finding is that the models trained on specific images will do well generalizing to new random things we show them, and then at some point completely stop or freak out; usually in the vein of #1204 where we're seeing a layer stop being produced entirely, or layers are combined (usually text and music). |
Does the freak out occur without retraining? What happens if you try the second time w/o retraining? |
@fujinaga so far the models I've been creating have been producing consistent results; if I run the same folio twice with the same layer separation models, the results are exactly the same. So I believe the freak out happens after retraining. @kyrieb-ekat did mention something weird happening when using models trained in staging on a Paco Classifying job in production (Kyrie I'm sorry I can't remember the details!) |
Question: in the iterating models were the previous models used as inputs in the training run, or just the ZIP files? It makes sense more labeled examples of various items would provide more information on how to handle them when they show up, so this is good Rodan Lore to know... This abuts nicely with one thing I've been testing in the background about at what point rodan does best with too many samples (like, is it better to have ALL the samples added with each successive correction and not cycle them out, or is it better to retain like, three and cycle out the oldest ones), where Rodan does better with the previous training models as inputs and at least two sample inputs. There may or may not be a memory ceiling (so to speak) with samples, but since you were able to manage 5 without it breaking we might be able to get away with more than we think. |
Just the ZIP files! Wait... you can input models to the Paco Trainer job? Can't you just import layers? |
UPDATE: I have conducted yet another test and obtained yet another result. This time I tried making models using patches of a single folio. I ran the folio in Pixel four times, using a different chunk of folio 077 each time (the four chunks added up to the complete folio). The first set of models I made like so:
The second I made like so:
First of all, neither set of models work particularly well. Both still get layers 1 and 3 pretty mixed up. Am I just not using enough samples? Second of all, this time the iteration models did slightly better. The main difference is again in the border: the iteration models don't include any borders almost at all, whereas the combo models have a huge amount of border, usually in layer 1. Demo: these are the layer 1's of folio 055, with the combo models on the left and the iteration models on the right. Why is the border treated so differently when I painted over the exact same things? And why is this result the opposite of the previous one, where the iteration model is the one that included the border? @fujinaga is this expected behaviour? I would be grateful for futher guidance, because I'm a bit at loose ends at this point. |
I can best explain this with a scenario:
I did a Pixel run with folio 023. I then produced models with that run and used those to do a second Pixel run with folio 017. Those latest models could separate folio 017 fairly well. I then did a third Pixel run with folio 260. Those models separated folio 260 perfectly, but when they tried to separate 017 they produced three almost completely empty layers.
This has been happening fairly often; models we make do a great job separating the folio they were just trained on, but can't separate any other folio, including ones they were trained on previously.
These are the models I'm talking about:
Background Model MTWS 023 round 3.hdf5.zip
Model 1 MTWS 023 round 3.hdf5.zip
Model 2 MTWS 023 round 3.hdf5.zip
Model 3 MTWS 023 round 3.hdf5.zip
@kyrieb-ekat did I explain that right
The text was updated successfully, but these errors were encountered: