kelseye commited on
Commit
334a6cc
·
verified ·
1 Parent(s): 34de431

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,6 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ assets/realism/value_control_0.1.jpg filter=lfs diff=lfs merge=lfs -text
37
+ assets/realism/value_control_0.3.jpg filter=lfs diff=lfs merge=lfs -text
38
+ assets/realism/value_control_0.5.jpg filter=lfs diff=lfs merge=lfs -text
.mdl ADDED
Binary file (62 Bytes). View file
 
README.md ADDED
@@ -0,0 +1,61 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ ---
4
+ # AttriCtrl Numerical Image Generation Control Model
5
+
6
+ ## Introduction
7
+
8
+ AttriCtrl enables fine-grained control of numerical image attributes during image generation.
9
+
10
+ For more details, please refer to our paper: [AttriCtrl: Fine-Grained Control of Aesthetic Attribute Intensity in Diffusion Models](https://arxiv.org/abs/2508.02151)
11
+
12
+ ## Result Demonstration
13
+
14
+ ### Brightness
15
+
16
+ |scale = 0.1|scale = 0.3|scale = 0.5|scale = 0.7|scale = 0.9|
17
+ |-|-|-|-|-|
18
+ |![](./assets/brightness/value_control_0.1.jpg)|![](./assets/brightness/value_control_0.3.jpg)|![](./assets/brightness/value_control_0.5.jpg)|![](./assets/brightness/value_control_0.7.jpg)|![](./assets/brightness/value_control_0.9.jpg)|
19
+
20
+ ### Detail
21
+
22
+ |scale = 0.1|scale = 0.3|scale = 0.5|scale = 0.7|scale = 0.9|
23
+ |-|-|-|-|-|
24
+ |![](./assets/detail/value_control_0.1.jpg)|![](./assets/detail/value_control_0.3.jpg)|![](./assets/detail/value_control_0.5.jpg)|![](./assets/detail/value_control_0.7.jpg)|![](./assets/detail/value_control_0.9.jpg)|
25
+
26
+ ### Realism
27
+
28
+ |scale = 0.1|scale = 0.3|scale = 0.5|scale = 0.7|scale = 0.9|
29
+ |-|-|-|-|-|
30
+ |![](./assets/realism/value_control_0.1.jpg)|![](./assets/realism/value_control_0.3.jpg)|![](./assets/realism/value_control_0.5.jpg)|![](./assets/realism/value_control_0.7.jpg)|![](./assets/realism/value_control_0.9.jpg)|
31
+
32
+ ## Inference Code
33
+
34
+ ```shell
35
+ git clone https://github.com/modelscope/DiffSynth-Studio.git
36
+ cd DiffSynth-Studio
37
+ pip install -e .
38
+ ```
39
+
40
+ ```python
41
+ import torch
42
+ from diffsynth.pipelines.flux_image_new import FluxImagePipeline, ModelConfig
43
+ ```
44
+
45
+ ```python
46
+ pipe = FluxImagePipeline.from_pretrained(
47
+ torch_dtype=torch.bfloat16,
48
+ device="cuda",
49
+ model_configs=[
50
+ ModelConfig(model_id="black-forest-labs/FLUX.1-dev", origin_file_pattern="flux1-dev.safetensors"),
51
+ ModelConfig(model_id="black-forest-labs/FLUX.1-dev", origin_file_pattern="text_encoder/model.safetensors"),
52
+ ModelConfig(model_id="black-forest-labs/FLUX.1-dev", origin_file_pattern="text_encoder_2/"),
53
+ ModelConfig(model_id="black-forest-labs/FLUX.1-dev", origin_file_pattern="ae.safetensors"),
54
+ ModelConfig(model_id="DiffSynth-Studio/AttriCtrl-FLUX.1-Dev", origin_file_pattern="models/detail.safetensors")
55
+ ],
56
+ )
57
+
58
+ for i in [0.1, 0.3, 0.5, 0.7, 0.9]:
59
+ image = pipe(prompt="a cat on the beach", seed=2, value_controller_inputs=[i])
60
+ image.save(f"value_control_{i}.jpg")
61
+ ```
README_from_modelscope.md ADDED
@@ -0,0 +1,88 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ frameworks:
3
+ - Pytorch
4
+ license: Apache License 2.0
5
+ tasks:
6
+ - text-to-image-synthesis
7
+
8
+ #model-type:
9
+ ##如 gpt、phi、llama、chatglm、baichuan 等
10
+ #- gpt
11
+
12
+ #domain:
13
+ ##如 nlp、cv、audio、multi-modal
14
+ #- nlp
15
+
16
+ #language:
17
+ ##语言代码列表 https://help.aliyun.com/document_detail/215387.html?spm=a2c4g.11186623.0.0.9f8d7467kni6Aa
18
+ #- cn
19
+
20
+ #metrics:
21
+ ##如 CIDEr、Blue、ROUGE 等
22
+ #- CIDEr
23
+
24
+ #tags:
25
+ ##各种自定义,包括 pretrained、fine-tuned、instruction-tuned、RL-tuned 等训练方法和其他
26
+ #- pretrained
27
+
28
+ #tools:
29
+ ##如 vllm、fastchat、llamacpp、AdaSeq 等
30
+ #- vllm
31
+ ---
32
+ # AttriCtrl 数值型生图控制模型
33
+
34
+ ## 简介
35
+
36
+ AttriCtrl 可以实现数值型图像指标的生图控制。
37
+
38
+ 更多细节请参考我们的论文: [AttriCtrl: Fine-Grained Control of Aesthetic Attribute Intensity in Diffusion Models](https://arxiv.org/abs/2508.02151)
39
+
40
+ ## 效果展示
41
+
42
+ ### 亮度(Brightness)
43
+
44
+ |scale = 0.1|scale = 0.3|scale = 0.5|scale = 0.7|scale = 0.9|
45
+ |-|-|-|-|-|
46
+ |![](./assets/brightness/value_control_0.1.jpg)|![](./assets/brightness/value_control_0.3.jpg)|![](./assets/brightness/value_control_0.5.jpg)|![](./assets/brightness/value_control_0.7.jpg)|![](./assets/brightness/value_control_0.9.jpg)|
47
+
48
+ ### 细节(Detail)
49
+
50
+ |scale = 0.1|scale = 0.3|scale = 0.5|scale = 0.7|scale = 0.9|
51
+ |-|-|-|-|-|
52
+ |![](./assets/detail/value_control_0.1.jpg)|![](./assets/detail/value_control_0.3.jpg)|![](./assets/detail/value_control_0.5.jpg)|![](./assets/detail/value_control_0.7.jpg)|![](./assets/detail/value_control_0.9.jpg)|
53
+
54
+ ### 摄影感(Realism)
55
+
56
+ |scale = 0.1|scale = 0.3|scale = 0.5|scale = 0.7|scale = 0.9|
57
+ |-|-|-|-|-|
58
+ |![](./assets/realism/value_control_0.1.jpg)|![](./assets/realism/value_control_0.3.jpg)|![](./assets/realism/value_control_0.5.jpg)|![](./assets/realism/value_control_0.7.jpg)|![](./assets/realism/value_control_0.9.jpg)|
59
+
60
+ ## 推理代码
61
+
62
+ ```shell
63
+ git clone https://github.com/modelscope/DiffSynth-Studio.git
64
+ cd DiffSynth-Studio
65
+ pip install -e .
66
+ ```
67
+
68
+ ```python
69
+ import torch
70
+ from diffsynth.pipelines.flux_image_new import FluxImagePipeline, ModelConfig
71
+
72
+
73
+ pipe = FluxImagePipeline.from_pretrained(
74
+ torch_dtype=torch.bfloat16,
75
+ device="cuda",
76
+ model_configs=[
77
+ ModelConfig(model_id="black-forest-labs/FLUX.1-dev", origin_file_pattern="flux1-dev.safetensors"),
78
+ ModelConfig(model_id="black-forest-labs/FLUX.1-dev", origin_file_pattern="text_encoder/model.safetensors"),
79
+ ModelConfig(model_id="black-forest-labs/FLUX.1-dev", origin_file_pattern="text_encoder_2/"),
80
+ ModelConfig(model_id="black-forest-labs/FLUX.1-dev", origin_file_pattern="ae.safetensors"),
81
+ ModelConfig(model_id="DiffSynth-Studio/AttriCtrl-FLUX.1-Dev", origin_file_pattern="models/detail.safetensors")
82
+ ],
83
+ )
84
+
85
+ for i in [0.1, 0.3, 0.5, 0.7, 0.9]:
86
+ image = pipe(prompt="a cat on the beach", seed=2, value_controller_inputs=[i])
87
+ image.save(f"value_control_{i}.jpg")
88
+ ```
assets/brightness/value_control_0.1.jpg ADDED
assets/brightness/value_control_0.3.jpg ADDED
assets/brightness/value_control_0.5.jpg ADDED
assets/brightness/value_control_0.7.jpg ADDED
assets/brightness/value_control_0.9.jpg ADDED
assets/detail/value_control_0.1.jpg ADDED
assets/detail/value_control_0.3.jpg ADDED
assets/detail/value_control_0.5.jpg ADDED
assets/detail/value_control_0.7.jpg ADDED
assets/detail/value_control_0.9.jpg ADDED
assets/realism/value_control_0.1.jpg ADDED

Git LFS Details

  • SHA256: 21fc8da7d1803fabff627443ce495e9ff1532af8ff7d7cd48057fd7fe4b7b303
  • Pointer size: 131 Bytes
  • Size of remote file: 117 kB
assets/realism/value_control_0.3.jpg ADDED

Git LFS Details

  • SHA256: 65cd37ae98148d792668d3774c99fd9ecea0e4c64d28951c61dd9b2d39d3b44a
  • Pointer size: 131 Bytes
  • Size of remote file: 141 kB
assets/realism/value_control_0.5.jpg ADDED

Git LFS Details

  • SHA256: d407937a55a6ac47e89103c610a65cbee5b09c24d2e813ebc6316bc5215351d3
  • Pointer size: 131 Bytes
  • Size of remote file: 101 kB
assets/realism/value_control_0.7.jpg ADDED
assets/realism/value_control_0.9.jpg ADDED
configuration.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"framework":"Pytorch","task":"text-to-image-synthesis"}
models/brightness.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c32aba3e2c1f92200655f73d8f119554a8056fd178ee01a7730212daad9c8f4f
3
+ size 71860712
models/detail.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:af65a4f651561e31850eeffba7c0b9123fd5908fc6be2dbc96d5561fa09c21bb
3
+ size 71860712
models/realism.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:121f22be6c8e31581a1c0df29754e4f289c7895ca163335f71b125e4b6d9bf63
3
+ size 71860712