baidu
/

ERNIE-4.5-0.3B-Paddle

Text Generation

Model card Files Files and versions

Update README.md

#1

by sunzhongkai588 - opened Jul 3

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

Files changed (1) hide show

README.md +3 -46

README.md CHANGED Viewed

@@ -32,6 +32,9 @@ library_name: PaddlePaddle
 # ERNIE-4.5-0.3B
 ## ERNIE 4.5 Highlights
 The advanced capabilities of the ERNIE 4.5 models, particularly the MoE-based A47B and A3B series, are underpinned by several key technical innovations:
@@ -88,52 +91,6 @@ python -m fastdeploy.entrypoints.openai.api_server \
        --max-num-seqs 32
 ```
-### Using `transformers` library
-The following contains a code snippet illustrating how to use the model generate content based on given inputs.
-```python
-from transformers import AutoModelForCausalLM, AutoTokenizer
-model_name = "baidu/ERNIE-4.5-0.3B-PT"
-model_name = "baidu/ERNIE-4.5-0.3B-PT"
-# load the tokenizer and the model
-tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)
-model = AutoModelForCausalLM.from_pretrained(model_name, trust_remote_code=True)
-# prepare the model input
-prompt = "Give me a short introduction to large language model."
-messages = [
-    {"role": "user", "content": prompt}
-]
-text = tokenizer.apply_chat_template(
-    messages,
-    tokenize=False,
-    add_generation_prompt=True
-)
-model_inputs = tokenizer([text], add_special_tokens=False, return_tensors="pt").to(model.device)
-# conduct text completion
-generated_ids = model.generate(
-    model_inputs.input_ids,
-    max_new_tokens=1024
-)
-output_ids = generated_ids[0][len(model_inputs.input_ids[0]):].tolist()
-# decode the generated ids
-generate_text = tokenizer.decode(output_ids, skip_special_tokens=True).strip("\n")
-print("generate_text:", generate_text)
-```
-### vLLM inference
-vLLM is currently being adapted, priority can be given to using our forked repository [vllm](https://github.com/CSWYF3634076/vllm/tree/ernie). We are working with the community to fully support ERNIE4.5 models, stay tuned.
-```bash
-vllm serve baidu/ERNIE-4.5-0.3B-PT --trust-remote-code
-```
 ## License
 The ERNIE 4.5 models are provided under the Apache License 2.0. This license permits commercial use, subject to its terms and conditions. Copyright (c) 2025 Baidu, Inc. All Rights Reserved.

 # ERNIE-4.5-0.3B
+> [!NOTE]
+> Note: "**-Paddle**" models use [PaddlePaddle](https://github.com/PaddlePaddle/Paddle) weights, while "**-PT**" models use Transformer-style PyTorch weights.
 ## ERNIE 4.5 Highlights
 The advanced capabilities of the ERNIE 4.5 models, particularly the MoE-based A47B and A3B series, are underpinned by several key technical innovations:
        --max-num-seqs 32
 ```
 ## License
 The ERNIE 4.5 models are provided under the Apache License 2.0. This license permits commercial use, subject to its terms and conditions. Copyright (c) 2025 Baidu, Inc. All Rights Reserved.