Pre-train PerceiverIO #85

jacobbieker · 2021-08-31T11:26:25Z

Various ideas

Pretrain predicting next frame from past two
Simulated clouds/optical flow
Try AutoFlow like in Perceiver paper

JackKelly · 2021-09-01T09:37:55Z

Sounds good!

A related trick up our sleeves would be to train on the ~10 years of data available from EUMETSAT: openclimatefix/nowcasting_dataset#81

(Training on more data isn't exactly "pre-training" :) But it might be worth trying. What do you think the priority should be: training on ~ 10 years of data; or pre-training using 'auxillary' tasks? Although it'll likely take a while to download & prepare ~10 years of data, so maybe we should get that going 'in the background' soonish?)

jacobbieker · 2021-09-01T09:54:48Z

I think yeah, getting it started in the background would be good, having all that data could also help if we want to try the similarity idea mentioned here #65, I think the extra data is probably a higher priority, but while that's running, trying the auxiliary tasks would be helpful.

For the simulated clouds/optical flow, more data could also help with getting real clouds that we could possibly "copy/paste" for the simulated optical flow? As in, get the cloud pixel values by subtracting the base ground data for real clouds, save out those clouds, and then paste random combos or crops of those clouds and generate the optical flow from that?

JackKelly · 2021-09-01T10:13:21Z

get the cloud pixel values by subtracting the base ground data for real clouds, save out those clouds, and then paste random combos or crops of those clouds and generate the optical flow from that

Sounds great to me!

JackKelly · 2021-09-01T10:25:16Z

I think the extra data is probably a higher priority

Cool, in our next meeting we can chat a bit about getting more data! I agree, it feels like a priority to grab more data!

jacobbieker · 2022-01-24T14:00:13Z

The HuggingFace PerceiverIO has the weights for optical flow task and others, so we can use that and then pre-train some more on the historical satellite imagery

jacobbieker added discussion enhancement New feature or request labels Aug 31, 2021

jacobbieker self-assigned this Aug 31, 2021

This was referenced Sep 1, 2021

Simulated Optical Flow using real clouds #88

Open

Add EUMETSAT cloud masks openclimatefix/nowcasting_dataset#83

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pre-train PerceiverIO #85

Pre-train PerceiverIO #85

jacobbieker commented Aug 31, 2021

JackKelly commented Sep 1, 2021 •

edited

Loading

jacobbieker commented Sep 1, 2021

JackKelly commented Sep 1, 2021

JackKelly commented Sep 1, 2021

jacobbieker commented Jan 24, 2022

Pre-train PerceiverIO #85

Pre-train PerceiverIO #85

Comments

jacobbieker commented Aug 31, 2021

JackKelly commented Sep 1, 2021 • edited Loading

jacobbieker commented Sep 1, 2021

JackKelly commented Sep 1, 2021

JackKelly commented Sep 1, 2021

jacobbieker commented Jan 24, 2022

JackKelly commented Sep 1, 2021 •

edited

Loading