Hi! I was working on some tests for a pr to diffusers and I noticed the clip model had a vocab size of 99 but when I was looking through the configs the vocab size seems like the standard 49408. Is this intended?
· Sign up or log in to comment