File size: 8,654 Bytes
e2872da
 
 
1a87873
 
e2872da
 
 
 
 
 
 
 
162aa08
e2872da
 
 
1a87873
 
d40efd6
4420aac
e2872da
1a87873
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
e2872da
 
3bc28d4
 
e2872da
 
 
 
162aa08
92a413c
162aa08
e2872da
8e49d81
e2872da
8e49d81
e2872da
 
 
 
 
 
 
 
 
73c5e5f
162aa08
e2872da
73c5e5f
58cd071
2383c03
e2872da
 
 
19afc43
e2872da
 
73c5e5f
d33311f
162aa08
 
d33311f
e2872da
162aa08
e2872da
 
162aa08
 
e2872da
 
 
162aa08
e2872da
 
 
 
 
 
 
 
162aa08
e2872da
 
162aa08
e2872da
 
 
 
162aa08
e2872da
162aa08
 
e2872da
162aa08
112bdb7
1a87873
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
---
license: apache-2.0
tags:
- Flux
- FluxPipeline
- text-to-image
- flux schnell
- image-generation
- flux-diffusers
- diffusers
- photo
- realism
pipeline_tag: text-to-image
library_name: diffusers
emoji: 🔜
language:
- en
base_model:
- black-forest-labs/FLUX.1-schnell
- mikeyandfriends/PixelWave_FLUX.1-schnell_01
instance_prompt: HST style autochrome photo
widget:
- text: >-
    HST style autochrome film photograph portrait of 1910 woman playing poker
    against a purple feathered dinosaur, the green-eyed woman has moderately
    blemished skin with visible lines and pores, she smiles, film grain,
    Kodachrome
  output:
    url: Historic3.png
- text: film photograph portrait of a woman
  output:
    url: Historic8.png
- text: >-
    film photograph portrait of blonde woman dancing with a purple feathered
    dinosaur, the green-eyed woman has moderately blemished skin with visible
    lines and pores, she smiles, film grain, Kodachrome
  output:
    url: Historic4.png
---
<Gallery />

# **Historic Color Soon® (by A.C.T. Soon®)**
Herein lives a versatile visual + textographic generative model especially catered to: <br>
- Quality generation at a low step-count (2 to 8, for most scenarios), with 4-step inference at around 768x768 routinely producing photorealistic outputs at a quality comparable to or even surpassing that of Flux v.1 Dev.  <br>
- Producing realistic images reminiscent of color film analog photography, exhibiting parallels to a broad spectrum of iconic instrumentalities and visual paradigms, from Autochrome-to-Kodachrome-to-Fujifilm-and-beyond. <br>
- Producing visuals with a vaguely "historical" or "lived-in" aesthetic character, striking chromaticity and luminosity dynamics, as well as textural/anatomical/skin details more reliably lifelike than other models at a comparable step-count/resource expenditure. <br>

![film photograph portrait of blonde woman playing poker with a dinosaur, the green-eyed woman has moderately blemished skin with visible lines and pores, she smiles, film grain](Historic4.png)


# **Foundations/Methods/Tools:**
- Primary Base Model: [FLUX.1-schnell](https://huggingface.co/black-forest-labs/FLUX.1-schnell) by Black Forest Labs.<br>
- Secondary Base Model: [Pixelwave Schnell V.1](https://huggingface.co/mikeyandfriends/PixelWave_FLUX.1-schnell_01/) by [HumbleMikey](https://civitai.com/user/humblemikey). The Safetensors & GGUF weights for it are found [here](https://huggingface.co/mikeyandfriends/PixelWave_FLUX.1-schnell_01/tree/main) at HF, and/or via the model's CivitAI page [here](https://civitai.com/models/141592?modelVersionId=778964)).<br>
- Finetuning (of Pixelwave Schnell V.1), mainly on around 500 color photographs taken during the 1900s and 1910s by Sergey Prokudin-Gorsky, who traveled and photographed widely in those years while pioneering and perfecting implementations of an early three-color-composite photography technique.<br> 
We urge you to explore the work of Prokudin-Gorsky for yourself, at the wonderfully organized online [archive at this link](https://prokudin-gorsky.org/), featuring many hundreds of high quality downloadable scans of composite color photo prints from the photographer's original glass plate negatives, available at this site alongside relatively recent restorations of a substantial portion of the images.<br>
The original glass-plate negatives, alongside a large archive of prints (and other relevant artifacts, documents, and exhibits) are currently held at and administrated by the Library of Congress in Washington, DC, USA.<br> 
- Full-rank LoRA fine-tuning of FLUX.1-schnell on 130 photographs from the same dataset, to create the first LoRA prefiguration of this model: the [Historic Color Schnell LoRA](https://huggingface.co/AlekseyCalvin/historic_color_schnell).<br> 
- Experiments with training five more LoRAs, using similar datasets, and targetting only select blocks in Schnell (and/or Dev).<br>   
- Merging the most successful among the latter results with Historic Color Schnell LoRA, using the 'flux_merge_lora.py' script from the sd3-branch /networks of [Kohya-ss's sd-scripts git](https://github.com/kohya-ss/sd-scripts/tree/sd3), whilst – wherever neccessary – converting the LoRAs (between ai-toolkit/Diffusers-adjacent and the Kohya/sd-scripts formats) using the 'convert_flux_lora.py' script at the same source.<br> 
- Merging the resulting LoRa into the checkpoint finetuned earlier from Pixelwave Schnell v.1.<br>  
- Ostris' training adapter for Schnell found here: [ostris/FLUX.1-schnell-training-adapter](https://huggingface.co/ostris/FLUX.1-schnell-training-adapter).<br>  

## Trigger Words:
You may use `HST` to bolster the vintage/autochrome effect. Though, in contrast with the Historic Color LoRA(s), the trigger does not appear to be quite as impactful towards the checkpoint.<br>

## Historical Note:
Prokudin-Gorsky's color photography technique would involve three photo-exposures, either simultaneous or sequential, using specialized color-spectrum filters (basically R.B.G.: red, blue, and green), rendering a subject/shot onto glass plates covered with light-emulsive mixture.<br> 
The photographer's focus on refining the developer and filter quality, in tandem with his incessant and wide-ranging experimentation, and his artful optimizations of glass plates (generally unwieldly, esp. for color, and by the 1910's already becoming outmoded for B&W on-location shoots, though elsewise extra reliable) ultimately led him to produce a color photography oeuvre of much greater fidelity and vividness than achieved by most of his contemporaries.<br>  
At the same time, the peculiarities of the photographer's method, coupled with his exceptionally hands-on execution thereof, would manifest in a range of idyosyncratic color, light, and motion artifacts common across the resulting prints.<br>  
Seldom marring the image as a whole, and less grave than the weaknesses of some contemporenously emerging autochrome techniques, the warm color hazes and flares framing many of Prokudin-Gorsky's prints may be seen as a kind of ephemeral signature.<br> 
Alongside some of the more subtle chromatic, textural, and (in some measure) figural characteristics of his work, these auras have reliably imprinted themselves into this Flux Schnell LoRA, the fourth in our series of historical adapters for Flux.<br>  

![film photograph portrait of blonde woman with a dinosaur, the green-eyed woman has moderately blemished skin with visible lines and pores, she smiles, film grain](Historic6.png)

## Download Model:
Weights for 'Historic Color SOON®' are available in Safetensors format in [a separate repo](/AlekseyCalvin/HistoricColorFLUXsoonr/tree/main).<br>
[Download](/AlekseyCalvin/HistoricColorFLUXsoonr/tree/main) them in the Files & versions tab.<br>
<br>
The model is available in 🧨 Diffusers format [within this repo](/AlekseyCalvin/HistoricColorSoonr_Schnell/tree/main). <br>

![film photograph portrait of a woman running with Pikachu, the woman has moderately blemished skin with visible lines and pores, she smiles, film grain](Historic7.png)


## Diffusers:<br>
To use `Historic Color SOON®` with the 🧨 diffusers python library, first install or upgrade diffusers:<br>
```shell
pip install -U diffusers
```
Then you can use `FluxPipeline` to run the model:
```python
import torch
from diffusers import FluxPipeline

pipe = FluxPipeline.from_pretrained("Shakker-Labs/AWPortrait-FL", torch_dtype=torch.bfloat16)
pipe.to("cuda")
pipe.enable_model_cpu_offload() #save some VRAM by offloading the model to CPU. Remove this if you have enough GPU power

prompt = "HST style autochrome film photograph portrait of 1910 woman playing poker against a purple feathered dinosaur, the green-eyed woman has moderately blemished skin with visible lines and pores, she smiles, film grain, Kodachrome"
image = pipe(
    prompt,
    guidance_scale=1.2,
    num_inference_steps=4,
    max_sequence_length=256,
    generator=torch.Generator("cpu").manual_seed(0)
).images[0]
image.save("hstcolor1.png")
```
To learn more check out the [diffusers](https://huggingface.co/docs/diffusers/main/en/api/pipelines/flux) documentation.
<br>

Also, if you're into literature broadly and old modernist poetry specifically, check out my verse translations at [SILVER AGE POETS](https://www.SilverAgePoets.com/the-poets-and-their-stories)!

![hst style autochrome film photograph portrait of 1910 woman playing poker against a purple feathered dinosaur, the green-eyed woman has moderately blemished skin with visible lines and pores, she smiles, film grain, Kodachrome](Historic3.png)