-
Notifications
You must be signed in to change notification settings - Fork 76
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open-source model definitions #1
Comments
I can try to get a more human readable version of this pushed, but does |
It is not quite as readable, as you need to plumbing through varies wrappers, for example, you get at high-level:
and how the
And from there, you need to dig into various of python files to find Conv2d configuration etc. BTW, this is just unzip the |
Quick and lazy, just ported the example code. Can't do much more without having the model arch. openai/consistencydecoder#1
Uploaded weights and "pseudo code" with correct hparams which contribute to the weight. |
The code above looks very much like a conditional UNet with concat conditioning (except that latents are upscaled by 8x using nearest neighbor upsampling). So for a latent of 4x32x32, it would be upsampled to 4x256x256 and then concatenated with the noisy input (3x256x256), then it looks like a regular UNet. |
@mrsteyk's code worked for me after some minor edits 👍 |
Yeah, I realised I messed up skip connections when I went to sleep. Ups originally didn’t have 4 non resizing ConvResblocks |
Thanks for this commit. Did you test |
Hey @gabgoh,
Super cool that you're open-sourcing the consistency decoder of Dalle-3 with a MIT license ❤️
Any chance you can also add the model definitions of the torch.jit binary? Otherwise it'll be quite difficult to port the model to other libraries.
The text was updated successfully, but these errors were encountered: