kony1337's picture
Duplicate from haoheliu/audioldm-text-to-audio-generation
24f6ec0
raw
history blame contribute delete
296 Bytes
{
"embed_dim": 768,
"vision_cfg": {
"image_size": 224,
"layers": 24,
"width": 1024,
"patch_size": 14
},
"text_cfg": {
"context_length": 77,
"vocab_size": 49408,
"width": 768,
"heads": 12,
"layers": 12
}
}