Spaces:

kakaobrain
/

karlo

Running on A10G

Is it possible to use Karlo's prior with the Stable Diffusion Variations model?

by beyondarmonia - opened Jan 1, 2023

Jan 1, 2023

Karlo's prior takes as input text embeddings and then converts them to CLIP image embeddings.

The SD image variations model takes CLIP image embeddings as conditioning to generate an actual image.

Is it possible to combine the two ( effectively replacing the Karlo's denoiser with SD ) like I'm imagining or am I missing something?

Kakao Brain org Jan 2, 2023

•

Hi @beyondarmonia , I think it's very reasonable to combine the prior (from karlo) with decoder (from SD). I've found that this PR (https://github.com/huggingface/diffusers/issues/1808) seems to implement your idea.

Jan 2, 2023

Thanks for the swift reply, @shkim-kb . Should have known someone would have already implemented it. Happy to know it works.

That PR is exactly what I was looking for. Thank you again.

Jan 16, 2023

Saehoon Kim, will you be able to eventually make a space for this hybrid model? I would love to try it out!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment