Very good model, but I've encountered an issue.

#1
by JLGY - opened

It seems to be a mix of WD tagger and natural language caption processing, and there's also a feature "named XXX who"...
Although most of the detections are incorrect,
it’s useful to use a script to replace them place.

However, I’ve found a problem: it’s easy for legs to be misinterpreted as "legs apart,"
for example, if the action is "crossed legs" but it gets detected as "legs apart."

I’m not sure why. If you use the original large model and WDtagger, it still detects "crossed legs."
It might be necessary to check how the dataset is constructed.

this could be a problem caused by inaccurate tags from civitai data. v1 is trained on new datasets so many of the problem mentioned should be improved in v1

Hi, can you share the training dataset?

Sign up or log in to comment