Support non-RGB images with --img_channels option #14

mdraw · 2019-05-31T22:07:57Z

The expected number of image channels can now be expressed through the
--img_channels option. This option changes the generator and
discriminator architectures to generate/expect the given channel count
and changes the data loading mechanism to expect and - if necessary -
convert images to have this number of channels.

Fixes the issue with occasional grayscale images in RGB photo data sets
reported in I have met this error when run train.py ... #5 because if --img_channels=3 (default), grayscale images
are automatically converted to RGB in preprocessing.
Adds support for training with grayscale images, as requested in I have met this error when run train.py ... #5.
Should make it easier to implement training with multi-channel images that
have more than 3 channels (which I'm planning to try soon).
Enables training with RGB data sets in grayscale mode by simply setting
--img_channels=1.

Disclaimer: I have not tested my changes with the usual data sets for image synthesis. I have only tried it with two small toy data sets: one of RGB images and one of grayscale images. My test trainings run without errors and the generated data visualization works, but my trainings don't converge yet and probably need some hyperparameter tuning (the discriminator loss is at 0 very often).

The expected number of image channels can now be expressed through the "--img_channels" option. This option changes the generator and discriminator architectures to generate/expect the given channel count and changes the data loading mechanism to expect and - if necessary - convert images to have this number of channels. - Fixes the issue with occasional grayscale images in RGB photo data sets reported in akanimax#5 because if --img_channels=3 (default), grayscale images are automatically converted to RGB in preprocessing. - Adds support for training with grayscale images, as requested in akanimax#5. - Makes it easier to implement training with multi-channel images that have more than 3 channels (which I'm planning to try soon). - Enables training with RGB data sets in grayscale mode by simply setting --img_channels=1.

akanimax · 2019-06-02T05:59:38Z

@mdraw, Thanks a lot for the PR.
I'll review it in a couple of days. Thanks again!

Best regards,
@akanimax

mdraw mentioned this pull request May 31, 2019

I have met this error when run train.py ... #5

Open

mdraw force-pushed the img_channels branch from 6e67e54 to 6cfd445 Compare June 1, 2019 00:12

akanimax mentioned this pull request Dec 28, 2019

Working with Gray scale images akanimax/msg-gan-v1#11

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support non-RGB images with --img_channels option #14

Support non-RGB images with --img_channels option #14

mdraw commented May 31, 2019

akanimax commented Jun 2, 2019

Support non-RGB images with --img_channels option #14

Are you sure you want to change the base?

Support non-RGB images with --img_channels option #14

Conversation

mdraw commented May 31, 2019

akanimax commented Jun 2, 2019