Instructions to use MiniMaxAI/MiniMax-M2.7 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use MiniMaxAI/MiniMax-M2.7 with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="MiniMaxAI/MiniMax-M2.7", trust_remote_code=True)
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForMultimodalLM

tokenizer = AutoTokenizer.from_pretrained("MiniMaxAI/MiniMax-M2.7", trust_remote_code=True)
model = AutoModelForMultimodalLM.from_pretrained("MiniMaxAI/MiniMax-M2.7", trust_remote_code=True)
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Inference
HuggingChat
Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use MiniMaxAI/MiniMax-M2.7 with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "MiniMaxAI/MiniMax-M2.7"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "MiniMaxAI/MiniMax-M2.7",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/MiniMaxAI/MiniMax-M2.7

SGLang

How to use MiniMaxAI/MiniMax-M2.7 with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "MiniMaxAI/MiniMax-M2.7" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "MiniMaxAI/MiniMax-M2.7",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "MiniMaxAI/MiniMax-M2.7" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "MiniMaxAI/MiniMax-M2.7",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use MiniMaxAI/MiniMax-M2.7 with Docker Model Runner:
```
docker model run hf.co/MiniMaxAI/MiniMax-M2.7
```

minimax 3 什么时候开源？

#33

by wpfnnnns - opened 11 days ago

Discussion

wpfnnnns

11 days ago

以上

ryanlee-dev

MiniMax org 11 days ago

预计下周

m00fa

11 days ago

will it fit 256GB unified or we can only dream ? <3 😀

celikburak

11 days ago

will it fit 256GB unified or we can only dream ? <3 😀

Probably gotta keep dreaming. Even if it fits, we'd need about 60-70 GB just for the KV cache to handle a 1M context.

TobDeBer

11 days ago

•

edited 11 days ago

It has sparser attention than M2.7. I expect KV cache size to drop 10x compared to the M2.7
https://filecdn.minimax.chat/public/m3-msa-arch.png

singulariti

11 days ago

will it fit 256GB unified or we can only dream ? <3 😀

It had 100T tokens of training data so my guess is 500b+ params

conanwsz

10 days ago

预计下周

特地注册个账号过来点赞👍

nic1122

4 days ago

周一是个好日子

wufg

4 days ago

已经下周了。哈哈

Keeperorowner

4 days ago

预计下周

时间到了哦，快开源

johhnypipo

3 days ago

will it be out this week?

darvec

3 days ago

时间差不多咯

crystech

2 days ago

waiting

Sidney000

2 days ago

waiting :)

celikburak

2 days ago

😶‍🌫️

Geximus

1 day ago

Can't write here :D
w8ing!

ziyidd

1 day ago

10天之期已到

mattpetters

1 day ago

im not movin from dis spot

nic1122

1 day ago

怎么回事啊

ryanlee-dev

MiniMax org 1 day ago

We are drafting a community-friendly license and plan to open-source M3 by this Friday evening. Thank you all for your support.
我们正在拟一个社区友好的 license，预计本周五晚上开源 M3. 感谢大家支持

ryanlee-dev changed discussion status to closed 1 day ago

traphix

1 day ago

I haven't seen MiniMax-M3 submit a model pull request to frameworks like Transformers or vLLM; how will the model be deployed after release?

celikburak

about 24 hours ago

We are drafting a community-friendly license and plan to open-source M3 by this Friday evening. Thank you all for your support.
我们正在拟一个社区友好的 license，预计本周五晚上开源 M3. 感谢大家支持

open-source? open-weights?

EclipseMist

about 17 hours ago

I haven't seen MiniMax-M3 submit a model pull request to frameworks like Transformers or vLLM; how will the model be deployed after release?

I was thinking for a minute maybe its because its the same architecture like m2.7 and m2.5 were then I remembered m3 is both multimodal and has a new sparse attention. So it has me wondering if we will get a massive delay in being able to properly run the model.

g-a-b-y

about 11 hours ago

I haven't seen MiniMax-M3 submit a model pull request to frameworks like Transformers or vLLM; how will the model be deployed after release?

I was thinking for a minute maybe its because its the same architecture like m2.7 and m2.5 were then I remembered m3 is both multimodal and has a new sparse attention. So it has me wondering if we will get a massive delay in being able to properly run the model.

I anticipate that full support will not be available for several weeks, especially for vLLM.

g-a-b-y

about 10 hours ago

I submitted a feature request to vLLM: https://github.com/vllm-project/vllm/issues/45360

g-a-b-y

about 3 hours ago

vLLM added support here: https://github.com/vllm-project/vllm/pull/45381

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment