-
Notifications
You must be signed in to change notification settings - Fork 105
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
upsample noise to concatenate with quantied representation #13
Comments
Hello,
|
Very appropriate your timely and generous reply. |
To produce the images in their paper, the authors train ADE20k for 50 epochs using the semantic label maps as additional information. I haven't tried training on ADE20k yet, because I don't have the compute power to spare right now but I will update the readme when I do. One possible explanation for the disparity in image quality is that the authors incorporate a VGG loss (Section 5.2 in the paper) based on intermediate activations in the VGG network, which I haven't implemented yet. Of course if the results are bad for general images despite appearing to work well on street scenes, it is highly possible there is a mistake in the implementation somewhere. |
Okay, I got. I will train ADE20k again without semantic map and with VGG loss. Thanks a lot. |
@Justin-Tan We expect your compress effect on ADE20k dataset, can you try it? @Jillian2017 and me try it, but the effect is not so good.Maybe there need some code modification for ADE20k. Thank you very much! |
Yes, I think implementing the VGG perceptual loss may help. Unfortunately I am quite busy at the moment, but it is top of the to-do list. |
Hi, while reading the paper, it proposed an optionally choice to concatenate the representaion with noise v. In your code, a dcgan_generator is used to generate noise v, while the output size is [,32,64,32]. If the dataset is cityscapes, the input image is resized to 5121024, the feature maps' size is 3264C( C=8 ),so I think the concatenated feature maps-z's size is [,32,64,32+8]=[,32,64,40], questions are:
1> why generate the noise by dcgan network?
2> if the input size changes, for example I train the ADE20k with the input size 512*512, then the noise'size cannot be concatenated to the quantized representation, so we need to change the dcgan network?
3> adding noise will increase the bpp by a large margin, as the its output size is too big.
These are my questions, looking forward to your reply.
Thanks for sharing your code.
generate the noise
The text was updated successfully, but these errors were encountered: