Add pipeline tag and library name (#2)

Browse files

- Add pipeline tag and library name (f0575ca2bc1ace3b44cc54d0c273609e54e22044)

Co-authored-by: Niels Rogge <nielsr@users.noreply.huggingface.co>

Files changed (1) hide show

README.md +20 -9

README.md CHANGED Viewed

@@ -1,7 +1,9 @@
 ---
-license: mit
 base_model:
 - inclusionAI/Ling-lite
 ---
 # Ming-Lite-Omni
@@ -115,7 +117,7 @@ Note: All models are evaluated based on 128 uniformly sampled frames.
 <div align="center">
 |     Model      | aishell1 | aishell2_android | aishell2_ios | cv15_zh  | fleurs_zh | wenetspeech_meeting | wenetspeech_net | librispeech_test_clean | librispeech_test_other | multilingual_librispeech | cv15_en  | fleurs_en |  voxpopuli_v1.0_en   |
-|:--------------:|:--------:|:----------------:|:------------:|:--------:|:---------:|:-------------------:|:---------------:|:----------------------:|:----------------------:|:------------------------:|:--------:|:---------:|:--------------------:|
 | Ming-lite-omni |   1.47   |     **2.55**     |   **2.52**   |   6.31   |   2.96    |        5.95         |      5.46       |          1.44          |          2.80          |         **4.15**         | **6.89** | **3.39**  |       **5.80**       |
 |  Qwen2.-Omni   |   1.18   |       2.75       |     2.63     | **5.20** |   3.00    |      **5.90**       |      7.70       |          1.80          |          3.40          |           7.56           |   7.60   |   4.10    |       **5.80**       |
 |  Qwen2-Audio   |   1.53   |       2.92       |     2.92     |   6.90   |   7.50    |        7.16         |      8.42       |          1.60          |          3.60          |           5.40           |   8.60   |   6.90    |         6.84         |
@@ -200,8 +202,6 @@ If you're in mainland China, we strongly recommend you to download our model fro
 Additional demonstration cases are available on our project [page](https://lucaria-academy.github.io/Ming-Omni/).
 ## Example Usage
 Please download our model following [Model Downloads](#model-downloads), then you can refer to the following codes to run Ming-lite-omni model.
@@ -275,19 +275,31 @@ messages = [
 To enable thinking before response, adding the following system prompt before your question:
 ```python
-cot_prompt = "SYSTEM: You are a helpful assistant. When the user asks a question, your response must include two parts: first, the reasoning process enclosed in <thinking>...</thinking> tags, then the final answer enclosed in <answer>...</answer> tags. The critical answer or key result should be placed within \\boxed{}.\n"
 # And your input message should be like this:
 messages = [
     {
         "role": "HUMAN",
         "content": [
             {"type": "image", "image": os.path.join(assets_path, "reasoning.png")},
-            {"type": "text", "text": cot_prompt + "In the rectangle $A B C D$ pictured, $M_{1}$ is the midpoint of $D C, M_{2}$ the midpoint of $A M_{1}, M_{3}$ the midpoint of $B M_{2}$ and $M_{4}$ the midpoint of $C M_{3}$. Determine the ratio of the area of the quadrilateral $M_{1} M_{2} M_{3} M_{4}$ to the area of the rectangle $A B C D$.\nChoices:\n(A) $\frac{7}{16}$\n(B) $\frac{3}{16}$\n(C) $\frac{7}{32}$\n(D) $\frac{9}{32}$\n(E) $\frac{1}{5}$"},
         ],
     },
 ]
 # Output:
-# \<think\>\nOkay, so I have this problem about a rectangle ABCD ... (thinking process omitted) ... So, the correct answer is C.\n\</think\>\n\<answer\>\\boxed{C}\</answer\>\n\n
 ```
 ```python
@@ -547,5 +559,4 @@ If you find our work helpful, feel free to give us a cite.
       archivePrefix = {arXiv},
       url = {https://arxiv.org/abs/2506.09344}
 }
-```

 ---
 base_model:
 - inclusionAI/Ling-lite
+license: mit
+pipeline_tag: any-to-any
+library_name: transformers
 ---
 # Ming-Lite-Omni
 <div align="center">
 |     Model      | aishell1 | aishell2_android | aishell2_ios | cv15_zh  | fleurs_zh | wenetspeech_meeting | wenetspeech_net | librispeech_test_clean | librispeech_test_other | multilingual_librispeech | cv15_en  | fleurs_en |  voxpopuli_v1.0_en   |
+|:--------------:|:--------:|:----------------:|:------------:|:--------:|:---------:|:-------------------:|:---------------:|:----------------------:|:----------------------:|:------------------------:|:--------:|:---------:|:--------------------:|\
 | Ming-lite-omni |   1.47   |     **2.55**     |   **2.52**   |   6.31   |   2.96    |        5.95         |      5.46       |          1.44          |          2.80          |         **4.15**         | **6.89** | **3.39**  |       **5.80**       |
 |  Qwen2.-Omni   |   1.18   |       2.75       |     2.63     | **5.20** |   3.00    |      **5.90**       |      7.70       |          1.80          |          3.40          |           7.56           |   7.60   |   4.10    |       **5.80**       |
 |  Qwen2-Audio   |   1.53   |       2.92       |     2.92     |   6.90   |   7.50    |        7.16         |      8.42       |          1.60          |          3.60          |           5.40           |   8.60   |   6.90    |         6.84         |
 Additional demonstration cases are available on our project [page](https://lucaria-academy.github.io/Ming-Omni/).
 ## Example Usage
 Please download our model following [Model Downloads](#model-downloads), then you can refer to the following codes to run Ming-lite-omni model.
 To enable thinking before response, adding the following system prompt before your question:
 ```python
+cot_prompt = "SYSTEM: You are a helpful assistant. When the user asks a question, your response must include two parts: first, the reasoning process enclosed in <thinking>...</thinking> tags, then the final answer enclosed in <answer>...</answer> tags. The critical answer or key result should be placed within \\boxed{}.
+"
 # And your input message should be like this:
 messages = [
     {
         "role": "HUMAN",
         "content": [
             {"type": "image", "image": os.path.join(assets_path, "reasoning.png")},
+            {"type": "text", "text": cot_prompt + "In the rectangle $A B C D$ pictured, $M_{1}$ is the midpoint of $D C, M_{2}$ the midpoint of $A M_{1}, M_{3}$ the midpoint of $B M_{2}$ and $M_{4}$ the midpoint of $C M_{3}$. Determine the ratio of the area of the quadrilateral $M_{1} M_{2} M_{3} M_{4}$ to the area of the rectangle $A B C D$.
+Choices:
+(A) $\\frac{7}{16}$
+(B) $\\frac{3}{16}$
+(C) $\\frac{7}{32}$
+(D) $\\frac{9}{32}$
+(E) $\\frac{1}{5}$"},
         ],
     },
 ]
 # Output:
+# \<think\>
+Okay, so I have this problem about a rectangle ABCD ... (thinking process omitted) ... So, the correct answer is C.
+\</think\>
+\<answer\>\\boxed{C}\</answer\>
 ```
 ```python
       archivePrefix = {arXiv},
       url = {https://arxiv.org/abs/2506.09344}
 }
+```