khanon
/

lora-training

Model card Files Files and versions Community

khanon commited on Feb 11, 2023

Commit

6506682

•

1 Parent(s): 862db6c

adds retrained Koharu LoRA

Browse files

Files changed (9) hide show

README.md +5 -0
koharu/README.md +40 -43
koharu/{00337-4289014929.png → chara-koharu-v3.png} +2 -2
koharu/{example-003-v2.png → chara-koharu-v3.safetensors} +2 -2
koharu/{example-001-v2.png → example-001-DefmixRed-v3.png} +2 -2
koharu/{example-002-v2.png → example-002-DefmixRed-v3.png} +2 -2
koharu/example-004-v2.png +0 -3
koharu/{lora_character_koharu_v2_180i6r-split_832_batch3_5e-5text_2e-4unet_3epoch.json → lora_chara_koharu_v3_183i4r.json} +22 -11
koharu/lora_character_koharu_v1_158i5r_768_batch3_5e-5text_1.5e-4unet_3epoch.json +0 -43

README.md CHANGED Viewed

@@ -32,6 +32,11 @@ Here you will find the various LoRAs I've trained, typically of Blue Archive cha
 ### Izuna
 [Available on old Mega.co.nz repository.](https://mega.nz/folder/SqYwQTRI#GN2SmGTBsV6S4q-L-V4VeA)
 ### Kokona
 [Sunohara Kokona / 春原ココナ / 스노하라 코코나 / 春原心奈](https://huggingface.co/khanon/lora-training/blob/main/kokona/README.md)

 ### Izuna
 [Available on old Mega.co.nz repository.](https://mega.nz/folder/SqYwQTRI#GN2SmGTBsV6S4q-L-V4VeA)
+### Koharu
+[Shimoe Koharu / 下江コハル / 시모에 코하루 / 下江小春](https://huggingface.co/khanon/lora-training/blob/main/koharu/README.md)
+[![Koharu](koharu/chara-koharu-v3.png)](https://huggingface.co/khanon/lora-training/blob/main/koharu/README.md)
 ### Kokona
 [Sunohara Kokona / 春原ココナ / 스노하라 코코나 / 春原心奈](https://huggingface.co/khanon/lora-training/blob/main/kokona/README.md)

koharu/README.md CHANGED Viewed

@@ -1,60 +1,57 @@
 # Shimoe Koharu (Blue Archive)
-Changed training methodology around for Koharu. It took way more time and effort due to the degree of manual tagging involved, but it turned out pretty well.
-I'll probably return to this one later to make further improvements now that I've got a much better handle on the impact of tagging and how to get the most out of larger datasets.  I don't expect to manual tag every future student, though.
 ## Usage
-Use any or all of these tags to summon Koharu:
-`koharu, 1girl, halo, pink eyes, ringed eyes, head wings, low wings, pink hair`
-Unlike previous LoRAs, the character's name does help this one somewhat.  You can probably omit her hair to save tokens.
-The vertical line running down her body appears consistently, but may not always reach past her chest because artists are inconsistent in how they draw it.  You can try to describe it literally: "vertical black line running past navel" or whatever.  Don't try `tattoo` unless you want womb tattoos.
 It does a decent, but not perfect job with her eyes. Adding some combination of `embarrassed`, `open mouth`, `swirly eyes` with varying degrees of emphasis can draw out her characteristic horny retard look.
 I tried to add the slit pupils expression and the model sorta gets it, but not very well.  You can prompt it with `slit pupils` and `flustered` but it generally creates abominations.
-For her normal Trinity outfit:
-`school uniform, off shoulder, hat, skirt`
-Some of her swimsuits are in there too.
-Weights from 0.8 - 1.05 should work well.
-### Important
-This LoRA may be more aggressive than others in forcing a close-up/portrait camera.  I believe this is because I scraped Booru tags for this one, and WD1.4 more reliably tags camera angles and image composition than human taggers.  You can mitigate this by always prompting for an angle or composition tag, like `above waist` or `cowboy shot` or `from above`.  You can combine them, too.
-Trying to prompt Koharu from behind or the side generally doesn't work very well -- it can render her back if you use `from behind` and `back focus`, but her wings will be attached to her stomach and her halo will be flipped,because the AI doesn't know how to generalize those traits to different angles and there's not enough training data for them.
 ## Training
-*All parameters are provided in the accompanying JSON files.*
-Koharu's training was handled substantially differently.
-- Trained on a heavily curated set of 183 images, most repeated 6 times. 1150 total steps.
-  - Dataset included a mixture of SFW and NSFW.
-  - Doubled the number of steps because the dataset was larger than usual.  I typically target 450 - 650.
-- New tagging methodology.  No WD1.4 tags; instead I scraped tags from Sankaku Complex using Hydrus and manually cleaned them up.
-  - Removed tons of shit tags
   - Made sure important traits were present and consitently described, and traits like `halo` were consistent with actual visibility
   - Pruned lots of redundant tags and simplified outfits.  There is no `black serafuku, long sleeves`, only Koharu's `school uniform`.
   - Added camera angles and image composition hints
   - Added facial expressions (particularly `embarrassed`) and unusual pupils when present
-- Different learning rate than usual.
-  - 5e-5 text encoder (typically 1e-5 ~ 2e-5)
-  - 2e-4 UNet (typically one order of magnitude faster than text)
-  - This was experimental -- human tags tend to be more varied, allowing for more expressiveness (WD1.4 did not do a good job with her) but potentially requiring more training. The dataset was also larger.
-- VAE removed. I usually train the dataset on the NAI VAE but after some tests, I think this was leading to oversaturated outputs and it does not play nicely with alternative VAEs.
-  - May offer a No VAE and a WD1.4 VAE in the future as these seem to present the best results across many configurations
-While I think the experimental things I tried out with this dataset worked out well enough to be called a success, tag cleanup took literal hours and I will probably not be able to put nearly so much effort into every character. I just really like Koharu.  I will probably retrain some old ones with at least the new hyperparameter methodologies, though.
-## To-do
-- More consistently tag NSFW/SFW/nudity
-- Add more image composition/camera angle tags
-- Find additional images with prominent swirly eyes
-- Improve tags for socks/shoes
-- Remove `halo` tag from images where it is just barely visible to force camera to pull further away
-- Un-fuck wings from side angle (folded wings tag?)
-- Add `looking away` / `facing away` to applicable images because it is impossible

 # Shimoe Koharu (Blue Archive)
+下江コハル (ブルーアーカイブ) / 시모에 코하루 (블루 아카이브) / 下江小春 (蔚藍檔案)
+Note: this is an older LoRA that I recently retrained.  I think the quality of the captions is lacking and as such this LoRA doesn't perform quite as well as it should.  I'll re-tag the dataset when I have time.
+[Download here.](chara-koharu-v3.safetensors)
+## Table of Contents
+- [Preview](#preview)
+- [Usage](#usage)
+- [Training](#training)
+- [Revisions](#revisions)
+## Preview
+![Koharu portrait](chara-koharu-v3.png)
+![Koharu preview](example-001-DefmixRed-v3.png)
+![Koharu preview 2](example-002-DefmixRed-v3.png)
 ## Usage
+Use any or all of the following tags to summon Koharu: `koharu, 1girl, halo, pink eyes, ringed eyes, head wings, low wings, pink hair, blue archive`
+- Hair and eye tags are optional.
+- The vertical line should appear automatically, but may not always reach past her chest because artists are inconsistent in how they draw it.  You can try to describe it literally: `vertical black line running past navel` or whatever.  Don't try `tattoo` unless you want womb tattoos.
+For her normal Trinity outfit: `school uniform, off shoulder, hat, skirt`
 It does a decent, but not perfect job with her eyes. Adding some combination of `embarrassed`, `open mouth`, `swirly eyes` with varying degrees of emphasis can draw out her characteristic horny retard look.
 I tried to add the slit pupils expression and the model sorta gets it, but not very well.  You can prompt it with `slit pupils` and `flustered` but it generally creates abominations.
+Some of her swimsuits are in the training data, too.
 ## Training
+*Exact parameters are provided in the accompanying JSON files.*
+- Trained on a set of 183 images; 170 normal, 13 slit pupils/flustered.
+  - 4 repeats for normal
+  - 5 repeats for flustered expression outfit
+  - 3 batch size, 7 epochs
+  - `(170*4 + 13*5) / 3 * 7` = 1739 steps
+- 832x832 training resolution
+- `constant_with_warmup` scheduler
+- Initially tagged using scraped Danbooru tags, then heavily edited.
+  - Removed many shit/inaccurate tags
   - Made sure important traits were present and consitently described, and traits like `halo` were consistent with actual visibility
   - Pruned lots of redundant tags and simplified outfits.  There is no `black serafuku, long sleeves`, only Koharu's `school uniform`.
   - Added camera angles and image composition hints
   - Added facial expressions (particularly `embarrassed`) and unusual pupils when present
+- Used network_dimension 128 (same as usual) / network alpha 128 (default)
+- Trained without VAE.
+- [Training dataset available here.](https://mega.nz/folder/Wi4jRZbJ#OHhH-qsltCEbks3GF2gqmg)
+## Revisions
+- v3 (2023-02-11)
+  - Re-trained with more recent parameters. No changes to dataset.
+  - Still overfit to her standard outfit.  Needs re-tagging.
+- v2 (2023-01-15)
+  - Initial release.
+  - [Old version can be downloaded here.](https://mega.nz/folder/Wi4jRZbJ#OHhH-qsltCEbks3GF2gqmg)

koharu/{00337-4289014929.png → chara-koharu-v3.png} RENAMED Viewed

File without changes

koharu/{example-003-v2.png → chara-koharu-v3.safetensors} RENAMED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1258a8c61ecad917c730f013cd6f57fa752a425f1e08b78d60f2958e1375b85a
-size 2267393

 version https://git-lfs.github.com/spec/v1
+oid sha256:290039bfc75cd18e269e4ffdd4b3bc31a4d39c7fbfb38d9507059133ade725ca
+size 151132730

koharu/{example-001-v2.png → example-001-DefmixRed-v3.png} RENAMED Viewed

File without changes

koharu/{example-002-v2.png → example-002-DefmixRed-v3.png} RENAMED Viewed

File without changes

koharu/example-004-v2.png DELETED Viewed

Git LFS Details

SHA256: 45159a620dba4e945b698d747a5fe7f7830a2df9092508c5c158be597bbb1c0c
Pointer size: 132 Bytes
Size of remote file: 1.62 MB

koharu/{lora_character_koharu_v2_180i6r-split_832_batch3_5e-5text_2e-4unet_3epoch.json → lora_chara_koharu_v3_183i4r.json} RENAMED Viewed

@@ -3,21 +3,22 @@
   "v2": false,
   "v_parameterization": false,
   "logging_dir": "",
-  "train_data_dir": "G:/sd/training/datasets/koharu",
-  "reg_data_dir": "G:/sd/training/datasets/regempty",
-  "output_dir": "G:/sd/repo/extensions/sd-webui-additional-networks/models/lora",
   "max_resolution": "832,832",
-  "lr_scheduler": "cosine_with_restarts",
   "lr_warmup": "5",
   "train_batch_size": 3,
-  "epoch": "3",
-  "save_every_n_epochs": "1",
   "mixed_precision": "fp16",
   "save_precision": "fp16",
   "seed": "31337",
   "num_cpu_threads_per_process": 32,
-  "cache_latent": true,
-  "caption_extention": ".txt",
   "enable_bucket": true,
   "gradient_checkpointing": false,
   "full_fp16": false,
@@ -30,8 +31,8 @@
   "save_state": false,
   "resume": "",
   "prior_loss_weight": 1.0,
-  "text_encoder_lr": "5e-5",
-  "unet_lr": "2e-4",
   "network_dim": 128,
   "lora_network_weights": "",
   "color_aug": false,
@@ -39,5 +40,15 @@
   "clip_skip": 2,
   "gradient_accumulation_steps": 1.0,
   "mem_eff_attn": false,
-  "output_name": "koharu-v2-NoVAE"
 }

   "v2": false,
   "v_parameterization": false,
   "logging_dir": "",
+  "train_data_dir": "G:/sd/training/datasets/koharu/dataset",
+  "reg_data_dir": "",
+  "output_dir": "G:/sd/lora/trained/koharu",
   "max_resolution": "832,832",
+  "learning_rate": "1e-5",
+  "lr_scheduler": "constant_with_warmup",
   "lr_warmup": "5",
   "train_batch_size": 3,
+  "epoch": "7",
+  "save_every_n_epochs": "6",
   "mixed_precision": "fp16",
   "save_precision": "fp16",
   "seed": "31337",
   "num_cpu_threads_per_process": 32,
+  "cache_latents": true,
+  "caption_extension": ".txt",
   "enable_bucket": true,
   "gradient_checkpointing": false,
   "full_fp16": false,
   "save_state": false,
   "resume": "",
   "prior_loss_weight": 1.0,
+  "text_encoder_lr": "1.5e-5",
+  "unet_lr": "1.5e-4",
   "network_dim": 128,
   "lora_network_weights": "",
   "color_aug": false,
   "clip_skip": 2,
   "gradient_accumulation_steps": 1.0,
   "mem_eff_attn": false,
+  "output_name": "chara-koharu-v1",
+  "model_list": "",
+  "max_token_length": "150",
+  "max_train_epochs": "",
+  "max_data_loader_n_workers": "",
+  "network_alpha": 128,
+  "training_comment": "Character: `koharu, 1girl, halo, pink eyes, ringed eyes, head wings, low wings, pink hair`\nStandard outfit: `school uniform, off shoulder, hat, skirt`\nExpression: `embarrassed, open mouth, swirly eyes, @_@`\n(170 normal * 4 repeats + 13 flustered * 5 repeats) / 3 batch size * 7 epochs = 1738 steps",
+  "keep_tokens": 2,
+  "lr_scheduler_num_cycles": "",
+  "lr_scheduler_power": "",
+  "persistent_data_loader_workers": true
 }

koharu/lora_character_koharu_v1_158i5r_768_batch3_5e-5text_1.5e-4unet_3epoch.json DELETED Viewed

@@ -1,43 +0,0 @@
-{
-  "pretrained_model_name_or_path": "G:/sd/repo/models/Stable-diffusion/nai-animefull-final-pruned.safetensors",
-  "v2": false,
-  "v_parameterization": false,
-  "logging_dir": "",
-  "train_data_dir": "G:/sd/training/datasets/koharu",
-  "reg_data_dir": "G:/sd/training/datasets/regempty",
-  "output_dir": "G:/sd/repo/extensions/sd-webui-additional-networks/models/lora",
-  "max_resolution": "768,768",
-  "lr_scheduler": "cosine_with_restarts",
-  "lr_warmup": "5",
-  "train_batch_size": 3,
-  "epoch": "3",
-  "save_every_n_epochs": "1",
-  "mixed_precision": "fp16",
-  "save_precision": "fp16",
-  "seed": "31337",
-  "num_cpu_threads_per_process": 32,
-  "cache_latent": true,
-  "caption_extention": ".txt",
-  "enable_bucket": true,
-  "gradient_checkpointing": false,
-  "full_fp16": false,
-  "no_token_padding": false,
-  "stop_text_encoder_training": 0,
-  "use_8bit_adam": true,
-  "xformers": true,
-  "save_model_as": "safetensors",
-  "shuffle_caption": true,
-  "save_state": false,
-  "resume": "",
-  "prior_loss_weight": 1.0,
-  "text_encoder_lr": "5e-5",
-  "unet_lr": "1.5e-4",
-  "network_dim": 128,
-  "lora_network_weights": "",
-  "color_aug": false,
-  "flip_aug": false,
-  "clip_skip": 2,
-  "gradient_accumulation_steps": 1.0,
-  "mem_eff_attn": false,
-  "output_name": "koharu-v1-NoVAE"
-}