Skip to content
This repository has been archived by the owner on Nov 29, 2023. It is now read-only.

Commit

Permalink
Update training/test/val splits based off #28 critique
Browse files Browse the repository at this point in the history
The correlation between days of the week might mean that there is leakage between the training and val/test sets. This is now split up so that 2020 is used for training, and 2021 is used for val/testing. This does mean that there is only data for the first half of 2021 in the test/val set for now, but that does cover winter/spring/part of summer so far, with more as time goes on.
  • Loading branch information
jacobbieker committed Jun 30, 2021
1 parent 425dded commit 85f7cc6
Show file tree
Hide file tree
Showing 4 changed files with 19 additions and 19 deletions.
20 changes: 10 additions & 10 deletions satflow/configs/datamodule/metnet_datamodule.yaml
Original file line number Diff line number Diff line change
@@ -1,22 +1,22 @@
# @package _group_
_target_: satflow.data.datamodules.SatFlowDataModule

batch_size: 4
batch_size: 1
data_dir: ${data_dir} # data_dir is specified in config.yaml
shuffle: 0
sources:
train: "satflow-flow-144-tiled-{00001..00149..2}.tar"
val: "satflow-flow-144-tiled-{00002..00149..4}.tar"
test: "satflow-flow-144-tiled-{00004..00149..4}.tar"
num_workers: 4
train: "satflow-flow-144-tiled-{00001..00105}.tar"
val: "satflow-flow-144-tiled-{00106..00129}.tar"
test: "satflow-flow-144-tiled-{00130..00149}.tar"
num_workers: 12
pin_memory: True
config:
visualize: False
num_timesteps: 12
skip_timesteps: 1
forecast_times: 48
output_shape: 256
output_target: 64
num_timesteps: 6
skip_timesteps: 3
forecast_times: 24
output_shape: 64
output_target: 16
target_type: "cloudmask"
num_crops: 10
use_topo: True
Expand Down
6 changes: 3 additions & 3 deletions satflow/configs/datamodule/satflow_dataloaders.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -18,9 +18,9 @@ pin_memory: True
time_as_channels: False
data_dir: ${data_dir} # data_dir is specified in config.yaml
sources:
train: "satflow-flow-36-{00001..00105..2}.tar"
val: "satflow-flow-36-{00002..00105..4}.tar"
test: "satflow-flow-36-{000106..00150}.tar"
train: "satflow-flow-144-tiled-{00001..00105}.tar"
val: "satflow-flow-144-tiled-{00106..00129}.tar"
test: "satflow-flow-144-tiled-{00130..00149}.tar"
bands:
[
"HRV",
Expand Down
6 changes: 3 additions & 3 deletions satflow/configs/datamodule/satflow_datamodule.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -5,9 +5,9 @@ batch_size: 1
data_dir: ${data_dir} # data_dir is specified in config.yaml
shuffle: 0
sources:
train: "satflow-flow-144-tiled-{00001..00149..2}.tar"
val: "satflow-flow-144-tiled-{00002..00149..4}.tar"
test: "satflow-flow-144-tiled-{000014..00149..4}.tar"
train: "satflow-flow-144-tiled-{00001..00105}.tar"
val: "satflow-flow-144-tiled-{00106..00129}.tar"
test: "satflow-flow-144-tiled-{00130..00149}.tar"
num_workers: 1
pin_memory: True
config:
Expand Down
6 changes: 3 additions & 3 deletions satflow/configs/datamodule/unet_dataloaders.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -17,9 +17,9 @@ use_image: False
pin_memory: True
time_as_channels: True
sources:
train: "/run/media/bieker/data/EUMETSAT/2020/satflow-flow-36-{00001..00105..2}.tar"
val: "/run/media/bieker/data/EUMETSAT/2020/satflow-flow-36-{00002..00105..4}.tar"
test: "/run/media/bieker/data/EUMETSAT/2021/satflow-flow-36-{00106..00150}.tar"
train: "satflow-flow-144-tiled-{00001..00105}.tar"
val: "satflow-flow-144-tiled-{00106..00129}.tar"
test: "satflow-flow-144-tiled-{00130..00149}.tar"
bands:
[
"HRV",
Expand Down

0 comments on commit 85f7cc6

Please sign in to comment.