base model for mono-channel completion
a tiny vision language model
An end-to-end (e2e) Voice Language Model by Fish Audio.