YangYunjin commited on 3 days ago

Commit

5d87d58

1 Parent(s): 9850215

update

Browse files

Files changed (24) hide show

._.gitattributes +0 -0
._README.md +0 -0
._config.json +3 -0
._model-00001-of-00003.safetensors +3 -0
._model-00002-of-00003.safetensors +3 -0
._model-00003-of-00003.safetensors +3 -0
._model.safetensors.index.json +3 -0
._preprocessor_config.json +3 -0
._processor_config.json +3 -0
._special_tokens_map.json +3 -0
._tokenizer.json +3 -0
._tokenizer_config.json +3 -0
.gitattributes +2 -0
README.md +61 -3
config.json +3 -0
model-00001-of-00003.safetensors +3 -0
model-00002-of-00003.safetensors +3 -0
model-00003-of-00003.safetensors +3 -0
model.safetensors.index.json +3 -0
preprocessor_config.json +3 -0
processor_config.json +3 -0
special_tokens_map.json +3 -0
tokenizer.json +3 -0
tokenizer_config.json +3 -0

._.gitattributes ADDED Viewed

Binary file (4.1 kB). View file

._README.md ADDED Viewed

Binary file (4.1 kB). View file

._config.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:926cd45db5af3c3dc3bcdddf4841166ab9e10fb905a2f773127db99deb44b88e
+size 4096

._model-00001-of-00003.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:926cd45db5af3c3dc3bcdddf4841166ab9e10fb905a2f773127db99deb44b88e
+size 4096

._model-00002-of-00003.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:926cd45db5af3c3dc3bcdddf4841166ab9e10fb905a2f773127db99deb44b88e
+size 4096

._model-00003-of-00003.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:926cd45db5af3c3dc3bcdddf4841166ab9e10fb905a2f773127db99deb44b88e
+size 4096

._model.safetensors.index.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:926cd45db5af3c3dc3bcdddf4841166ab9e10fb905a2f773127db99deb44b88e
+size 4096

._preprocessor_config.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:926cd45db5af3c3dc3bcdddf4841166ab9e10fb905a2f773127db99deb44b88e
+size 4096

._processor_config.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:926cd45db5af3c3dc3bcdddf4841166ab9e10fb905a2f773127db99deb44b88e
+size 4096

._special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:926cd45db5af3c3dc3bcdddf4841166ab9e10fb905a2f773127db99deb44b88e
+size 4096

._tokenizer.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:926cd45db5af3c3dc3bcdddf4841166ab9e10fb905a2f773127db99deb44b88e
+size 4096

._tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:926cd45db5af3c3dc3bcdddf4841166ab9e10fb905a2f773127db99deb44b88e
+size 4096

.gitattributes CHANGED Viewed

@@ -33,3 +33,5 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+*.json filter=lfs diff=lfs merge=lfs -text
+*.memmap filter=lfs diff=lfs merge=lfs -text

README.md CHANGED Viewed

@@ -1,3 +1,61 @@
----
-license: apache-2.0
----

+---
+license: mit
+license_name: deepseek
+license_link: LICENSE
+pipeline_tag: any-to-any
+library_name: transformers
+tags:
+- muiltimodal
+- text-to-image
+- unified-model
+---
+## 1. Introduction
+Janus-Pro is a novel autoregressive framework that unifies multimodal understanding and generation.
+It addresses the limitations of previous approaches by decoupling visual encoding into separate pathways, while still utilizing a single, unified transformer architecture for processing. The decoupling not only alleviates the conflict between the visual encoder’s roles in understanding and generation, but also enhances the framework’s flexibility.
+Janus-Pro surpasses previous unified model and matches or exceeds the performance of task-specific models.
+The simplicity, high flexibility, and effectiveness of Janus-Pro make it a strong candidate for next-generation unified multimodal models.
+[**Github Repository**](https://github.com/deepseek-ai/Janus)
+<div align="center">
+<img alt="image" src="janus_pro_teaser1.png" style="width:90%;">
+</div>
+<div align="center">
+<img alt="image" src="janus_pro_teaser2.png" style="width:90%;">
+</div>
+### 2. Model Summary
+Janus-Pro is a unified understanding and generation MLLM, which decouples visual encoding for multimodal understanding and generation.
+Janus-Pro is constructed based on the DeepSeek-LLM-1.5b-base/DeepSeek-LLM-7b-base.
+For multimodal understanding, it uses the [SigLIP-L](https://huggingface.co/timm/ViT-L-16-SigLIP-384) as the vision encoder, which supports 384 x 384 image input. For image generation, Janus-Pro uses the tokenizer from [here](https://github.com/FoundationVision/LlamaGen) with a downsample rate of 16.
+## 3. Quick Start
+Please refer to [**Github Repository**](https://github.com/deepseek-ai/Janus)
+## 4. License
+This code repository is licensed under [the MIT License](https://github.com/deepseek-ai/DeepSeek-LLM/blob/HEAD/LICENSE-CODE). The use of Janus-Pro models is subject to [DeepSeek Model License](https://github.com/deepseek-ai/DeepSeek-LLM/blob/HEAD/LICENSE-MODEL).
+## 5. Citation
+```
+@article{chen2025janus,
+  title={Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling},
+  author={Chen, Xiaokang and Wu, Zhiyu and Liu, Xingchao and Pan, Zizheng and Liu, Wen and Xie, Zhenda and Yu, Xingkai and Ruan, Chong},
+  journal={arXiv preprint arXiv:2501.17811},
+  year={2025}
+}
+```
+## 6. Contact
+If you have any questions, please raise an issue or contact us at [service@deepseek.com](mailto:service@deepseek.com).

config.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:561fdcd22965ed9fab979259426fcf9831823a8540b6cad8717765918b1c50fd
+size 1282

model-00001-of-00003.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c5b52f0483b8569186115a6a1eba87446363dea1d1b3addef93a5b948f57cb3e
+size 4916851534

model-00002-of-00003.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8c72e2cb11b6407f00e4e4ae4133fdb3cfbaec053e5d4d6b2aab60cd5e15349b
+size 4947392496

model-00003-of-00003.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4fe889057c6f13bb1fe3a62eebdfb127f3acfaf874dc19299c45bea53745c7db
+size 4976742608

model.safetensors.index.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:34700500210eaed06cee767c5caca364277a2ec3b4ef9a539749ebbeaca986b8
+size 89033

preprocessor_config.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4dae4dc1bda762bdc84b887b2d3339f21935a15ff716b3049490d96935f7f12a
+size 346

processor_config.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:19b96079e21e3cf0409f7252431100892f2ab2f377c69845f5090670fc319dd2
+size 334

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:42cbf6a44df7b2beed050aeb017d5d2e43e3c507d492fed797be8b939748798e
+size 684

tokenizer.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:42bcf2c54739affa70425520f9f8eb48e7409cf515541c74594de4c412b7d5ad
+size 7614107

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:520c2e09d31f3fdec13b5a521845dbda9b1b1235121d167183a9d632eb27c6a5
+size 107870