updates Kazusa tagging methodology dog
Browse files- kazusa/tagging methodology.md +60 -27
kazusa/tagging methodology.md
CHANGED
@@ -1,14 +1,37 @@
|
|
1 |
-
Tagging methodology for Kazusa (blue archive)
|
2 |
|
3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
4 |
|
5 |
- Tag unique features
|
6 |
- `halo` / `demon horns` / `low wings`
|
7 |
- Remove when not present or out of view. WD1.4 likes putting `halo` even on images where no halo is visible.
|
8 |
-
- Kazusa
|
9 |
-
- Pruned `extra ears` as it seems redundant.
|
10 |
- Tag outfit variants with a single master tag
|
11 |
-
- Kazusa
|
12 |
- Uniform: `school uniform` / `black jacket`
|
13 |
- Sometimes the jacket appears without anything else, which was not tagged `school uniform`
|
14 |
- Non-canon costumes
|
@@ -16,24 +39,26 @@ Start with WD1.4 Swinv2 at 0.25 confidence.
|
|
16 |
- Nudity (WD1.4 usually does this accurately)
|
17 |
- `nude` / `completely nude`
|
18 |
- Prune eye colors
|
19 |
-
- Keep tags which describe unusual eye features (`multicolored eyes`, `heterochromia`, `slit pupils`) as they can otherwise be too subtle and inconsistently drawn for the AI to
|
20 |
- Prune hair colors
|
21 |
- This includes `two-toned hair`, `gradiant hair`, etc. The AI learns all of these very consistently without the tags, likely because artists tend to draw them consistently
|
22 |
- Partially prune hair styles
|
23 |
-
- Leave key style tags like `twintails`, `ponytail`, `short hair with long locks
|
24 |
- Prune exceedingly common tags like `bangs` / `sidelocks` / `eyebrows visible through hair` / `hair between eyes`, etc.
|
|
|
25 |
- Prune length, except for images which differ from the character's usual length
|
26 |
-
-
|
27 |
-
|
|
|
28 |
- Fixup hair ornaments
|
29 |
- Prune generic `hair ornament` in favor of more specificity
|
30 |
- `hairclip` / `black headband` / `hair flower` / `hair ribbon`, etc.
|
31 |
- Consolidate tags that have color variants (`headband` >> `black headband`)
|
32 |
-
- Kazusa
|
33 |
- Consolidate outfits
|
34 |
- Only tag an item when it is actually visible. If it is only barely visible along the edge of an image, keep in mind it may be cropped during bucketing.
|
35 |
-
- Danbooru's wiki entry for a character often provides a good list of tags
|
36 |
-
- Kazusa outfits
|
37 |
- School Uniform
|
38 |
- `black choker`
|
39 |
- `hooded jacket`
|
@@ -47,42 +72,50 @@ Start with WD1.4 Swinv2 at 0.25 confidence.
|
|
47 |
- `sneakers`
|
48 |
- Fixup sleeves
|
49 |
- ie. `long sleeves` / `puffy long sleeves` / `detached sleeves`
|
50 |
-
- You only need one, but pick one and be consistent. If sleeves aren't tagged
|
51 |
- Fixup collars
|
52 |
- ie. `detached collar` / `collared shirt` / `choker` / etc.
|
53 |
-
- Same deal as sleeves, they tend to appear when unwanted if not consistently tagged according to visibility
|
54 |
- Fixup clothing state
|
55 |
- ie. `open jacket` / `open shirt` / `partially undressed` / `off shoulder`
|
56 |
-
- The tagger is generally
|
57 |
- Tag expressions
|
58 |
-
-
|
59 |
- Start by searching for images without one of these, and add them.
|
60 |
- `open mouth`
|
61 |
- `closed mouth`
|
62 |
- `parted lips`
|
63 |
-
|
64 |
-
-
|
|
|
|
|
65 |
- `wavy mouth` / `embarrassed`
|
66 |
-
- `
|
|
|
|
|
67 |
- `surprised` / `o_o` / `wide-eyed`
|
68 |
- `upset` / `annoyed` / `frustrated` / `v-shaped eyebrows`
|
69 |
- `naughty face` / `seductive smile`
|
70 |
-
- `smug` / `:3`
|
|
|
71 |
- `eyes closed` / `one eye closed`
|
72 |
-
- WD1.4
|
73 |
- Tag camera angles/composition
|
|
|
74 |
- `cowboy shot`
|
75 |
- `upper body`
|
76 |
- `full body`
|
77 |
- `portrait`
|
78 |
-
- `cropped torso` / `cropped legs`
|
79 |
- `feet out of frame`
|
|
|
80 |
- `from side` / `from above` / `from below` / `from behind`
|
81 |
-
- Tag iconic poses
|
82 |
-
-
|
83 |
-
-
|
|
|
|
|
84 |
- `mouth hold`
|
85 |
- `eating`
|
86 |
-
- `
|
87 |
- Flip through each image and use Hydrus's "related tags" feature to quickly identify important tags that might be missing.
|
88 |
-
- This feature looks at other images with similar tags to provide suggestions.
|
|
|
1 |
+
# Tagging methodology for Kazusa (blue archive)
|
2 |
|
3 |
+
## README / Intro
|
4 |
+
Since I've seen a few people share this already I'll provide this disclaimer.
|
5 |
+
|
6 |
+
This is not really intended to be a guide, it's just an log/checklist of my process, for my own benefit, since I repeat this for a lot of LoRAs and I got tired of winging it every single time. I've put only the slightest amount of effort into making it accessible to others.
|
7 |
+
|
8 |
+
I don't claim that any or all of these optimal, nor can I confidently put them forth as recommendations. They're literally just a record of the steps I follow while tagging, gradually developed after ~16 characters using some version of the below process.
|
9 |
+
|
10 |
+
Still, I can at least point to my pre-Koharu LoRAs (which used pure WD1.4 tags) and the ones that came after (where I started heavily editing tags) and see a steady progression in quality and prompting flexibility despite using mostly the same training settings for each one.
|
11 |
+
|
12 |
+
Yes, it takes forever to do all of this shit. No, I don't recommend it unless you're extremely autistic; raw WD1.4 tags are probably good enough for most people. If you intend to do this for more than a few characters, I strongly recommend learning [Hydrus](https://hydrusnetwork.github.io/hydrus/introduction.html) it makes all of this way, way less tedious compared to doing it with crappier tools.
|
13 |
+
|
14 |
+
---
|
15 |
+
|
16 |
+
## Prep
|
17 |
+
|
18 |
+
- Scraped `1girl kazusa_(blue_archive) order:popularity` from sancom, curated for quality, then exported from Hydrus to feed into WD1.4 Tagger.
|
19 |
+
- Kazusa has a shitload of good art so I had to be very picky to get down to 280 images, which is still a lot. In hindsight I think huge datasets aren't really a problem; they let you train for longer without overfitting.
|
20 |
+
- Gelbooru is probably fine too. Danbooru sucks for loli unless you have Gold.
|
21 |
+
- I also got a few newer images from pixiv, don't remember which ones.
|
22 |
+
- Exported final images from Hydrus to feed into WD1.4 Tagger
|
23 |
+
- Auto-tagged with WD1.4 Swinv2 at 0.25 confidence
|
24 |
+
- Reimported images+tags into Hydrus using the .txt sidecar feature. I strongly recommend putting WD1.4 tags in a separate tag domain so they aren't mixed in with shit scraped from boorus.
|
25 |
+
|
26 |
+
## Tagging
|
27 |
|
28 |
- Tag unique features
|
29 |
- `halo` / `demon horns` / `low wings`
|
30 |
- Remove when not present or out of view. WD1.4 likes putting `halo` even on images where no halo is visible.
|
31 |
+
- **Kazusa**: `halo` / `animal ears`
|
32 |
+
- Pruned `extra ears` as it seems redundant and intrinsic to the character.
|
33 |
- Tag outfit variants with a single master tag
|
34 |
+
- **Kazusa**:
|
35 |
- Uniform: `school uniform` / `black jacket`
|
36 |
- Sometimes the jacket appears without anything else, which was not tagged `school uniform`
|
37 |
- Non-canon costumes
|
|
|
39 |
- Nudity (WD1.4 usually does this accurately)
|
40 |
- `nude` / `completely nude`
|
41 |
- Prune eye colors
|
42 |
+
- Keep tags which describe unusual eye features (`multicolored eyes`, `heterochromia`, `slit pupils`) as they can otherwise be too subtle and inconsistently drawn for the AI to notice
|
43 |
- Prune hair colors
|
44 |
- This includes `two-toned hair`, `gradiant hair`, etc. The AI learns all of these very consistently without the tags, likely because artists tend to draw them consistently
|
45 |
- Partially prune hair styles
|
46 |
+
- Leave key, defining style tags like `twintails`, `ponytail`, `short hair with long locks`, `twin braids`, etc.
|
47 |
- Prune exceedingly common tags like `bangs` / `sidelocks` / `eyebrows visible through hair` / `hair between eyes`, etc.
|
48 |
+
- Somewhat arbitrary, but I just don't think there's much value in them because they're ubiquitous and caption space is limited
|
49 |
- Prune length, except for images which differ from the character's usual length
|
50 |
+
- If you don't do this, it's more likely to get the hair length wrong when not prompted, which isn't a huge deal.
|
51 |
+
- Add `alternate hairstyle` and/or `alternate hair length` on applicable images, which can be used to more easily change styles while prompting
|
52 |
+
- **Kazusa**: `short hair, colored inner hair` -- while I would usually prune these, they're really her only defining hairstyle traits
|
53 |
- Fixup hair ornaments
|
54 |
- Prune generic `hair ornament` in favor of more specificity
|
55 |
- `hairclip` / `black headband` / `hair flower` / `hair ribbon`, etc.
|
56 |
- Consolidate tags that have color variants (`headband` >> `black headband`)
|
57 |
+
- **Kazusa**: `hairclip`
|
58 |
- Consolidate outfits
|
59 |
- Only tag an item when it is actually visible. If it is only barely visible along the edge of an image, keep in mind it may be cropped during bucketing.
|
60 |
+
- Danbooru's wiki entry for a character often provides a good list of tags for a character's entire outfit.
|
61 |
+
- **Kazusa outfits**:
|
62 |
- School Uniform
|
63 |
- `black choker`
|
64 |
- `hooded jacket`
|
|
|
72 |
- `sneakers`
|
73 |
- Fixup sleeves
|
74 |
- ie. `long sleeves` / `puffy long sleeves` / `detached sleeves`
|
75 |
+
- You only need one, but pick one and be consistent. If sleeves aren't tagged the AI tends to add them inappropriately (such as when prompting for sleeveless outfits or nudity)
|
76 |
- Fixup collars
|
77 |
- ie. `detached collar` / `collared shirt` / `choker` / etc.
|
78 |
+
- Same deal as sleeves, they tend to appear when unwanted if not consistently tagged according to actual visibility
|
79 |
- Fixup clothing state
|
80 |
- ie. `open jacket` / `open shirt` / `partially undressed` / `off shoulder`
|
81 |
+
- The tagger is generally good at this but it can help to double-check for weird outfits
|
82 |
- Tag expressions
|
83 |
+
- This is tedious and the autotagger doesn't help you out much, but tagging these can really help the AI nail multiple iconic expressions for a character
|
84 |
- Start by searching for images without one of these, and add them.
|
85 |
- `open mouth`
|
86 |
- `closed mouth`
|
87 |
- `parted lips`
|
88 |
+
- Sometimes applies with `open mouth`
|
89 |
+
- Then proceed through each image and add one of these
|
90 |
+
- `smile` / `light smile` / `:d` / `grin` (exposed teeth only)
|
91 |
+
- `:o` / `:<` / `expressionless` / `serious`
|
92 |
- `wavy mouth` / `embarrassed`
|
93 |
+
- `pout` / `:t` / `tsundere`
|
94 |
+
- `nervous` / `nervous smile`
|
95 |
+
- `flustered` / `swirly eyes` / `@_@`
|
96 |
- `surprised` / `o_o` / `wide-eyed`
|
97 |
- `upset` / `annoyed` / `frustrated` / `v-shaped eyebrows`
|
98 |
- `naughty face` / `seductive smile`
|
99 |
+
- `smug` / `:3` / `smirk`
|
100 |
+
- `yelling` / `frown`
|
101 |
- `eyes closed` / `one eye closed`
|
102 |
+
- WD1.4 almost always gets these two
|
103 |
- Tag camera angles/composition
|
104 |
+
- Most of these aren't very high value, but `from x` can be helpful.
|
105 |
- `cowboy shot`
|
106 |
- `upper body`
|
107 |
- `full body`
|
108 |
- `portrait`
|
|
|
109 |
- `feet out of frame`
|
110 |
+
- `cropped torso` / `cropped legs`
|
111 |
- `from side` / `from above` / `from below` / `from behind`
|
112 |
+
- Tag iconic poses, actions, or props
|
113 |
+
- Props need to show up often in training data for this to be worth it.
|
114 |
+
- `v` / `peace sign` / `standing on one leg`
|
115 |
+
- `holding dango` / `weapon case` / `fashion magazine`
|
116 |
+
- **Kazusa**
|
117 |
- `mouth hold`
|
118 |
- `eating`
|
119 |
+
- `macaron`
|
120 |
- Flip through each image and use Hydrus's "related tags" feature to quickly identify important tags that might be missing.
|
121 |
+
- This feature looks at other images with similar tags to provide suggestions. Good for spotting things you or the tagger might have missed.
|