Instructions to use MiniMaxAI/MiniMax-M3 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use MiniMaxAI/MiniMax-M3 with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("image-text-to-text", model="MiniMaxAI/MiniMax-M3", trust_remote_code=True)
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"},
            {"type": "text", "text": "What animal is on the candy?"}
        ]
    },
]
pipe(text=messages)

# Load model directly
from transformers import AutoProcessor, AutoModelForMultimodalLM

processor = AutoProcessor.from_pretrained("MiniMaxAI/MiniMax-M3", trust_remote_code=True)
model = AutoModelForMultimodalLM.from_pretrained("MiniMaxAI/MiniMax-M3", trust_remote_code=True, device_map="auto")
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"},
            {"type": "text", "text": "What animal is on the candy?"}
        ]
    },
]
inputs = processor.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(processor.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Inference
HuggingChat
Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use MiniMaxAI/MiniMax-M3 with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "MiniMaxAI/MiniMax-M3"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "MiniMaxAI/MiniMax-M3",
		"messages": [
			{
				"role": "user",
				"content": [
					{
						"type": "text",
						"text": "Describe this image in one sentence."
					},
					{
						"type": "image_url",
						"image_url": {
							"url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg"
						}
					}
				]
			}
		]
	}'

Use Docker

docker model run hf.co/MiniMaxAI/MiniMax-M3

SGLang

How to use MiniMaxAI/MiniMax-M3 with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "MiniMaxAI/MiniMax-M3" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "MiniMaxAI/MiniMax-M3",
		"messages": [
			{
				"role": "user",
				"content": [
					{
						"type": "text",
						"text": "Describe this image in one sentence."
					},
					{
						"type": "image_url",
						"image_url": {
							"url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg"
						}
					}
				]
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "MiniMaxAI/MiniMax-M3" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "MiniMaxAI/MiniMax-M3",
		"messages": [
			{
				"role": "user",
				"content": [
					{
						"type": "text",
						"text": "Describe this image in one sentence."
					},
					{
						"type": "image_url",
						"image_url": {
							"url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg"
						}
					}
				]
			}
		]
	}'

Docker Model Runner
How to use MiniMaxAI/MiniMax-M3 with Docker Model Runner:
```
docker model run hf.co/MiniMaxAI/MiniMax-M3
```

Thank you team for the improved license

by original-el8 - opened Jun 12

Discussion

original-el8

Jun 12

Thanks to the team for the improved license. It really seems like you took feedback from the 2.7 release and improved the license for M3. Well done!

ryanlee-dev

MiniMax org Jun 12

Enjoy

rattl

Jun 12

coming here for same purpose, thanks for sharing nice model, and keep it up!

original-el8

Jun 12

Enjoy

Thanks @ryanlee-dev , I know you got a lot of grief in the previous release threads, I think the community really appreciates you being able to work through it with your legal team to land on a vastly improved, community-friendly license.

liyawei

MiniMax org Jun 12

great Job!

plaue

Jun 12

very nice, thank you !

Voktolom

Jun 12

A big thank you. Thank you for being open to the model!

fredizzimo

Jun 12

The definition of commercial use is still ambiguous, the examples say one thing and definition another. With a strict interpretation, almost everything needs a permission and displaying of the "Built with MiniMax M3" message. See https://gnu.support/software-freedom-fakers/MiniMax-M2-7-s-MIT-Style-License-Is-a-Misleading-Restriction-That-Bans-Commercial-Use-and-Fails-Free-Software-Standards-124111.html, for more details and a lot of examples. But I have also heard many people claim that it only applies to service providers and using the software as a service, and that using the model to make software, even commercial ones is allowed.

So, what's the correct interpretation? It would also be nice if that ambiguity was removed from the license text itself.

dct-cell

Jun 12

Very nice!

voves

Jun 12

Great Job! FP8 is fine on 8 B200, now just waiting for NVFP4 weights 👀

Monoclebear

Jun 12

I try to put in on MLX.

ryanlee-dev

MiniMax org Jun 13

Hope someone can help to upload NVFP4 version

g-a-b-y

Jun 13

Usually Nvidia or RedHatAI, will publish the NVFP4 2-3 weeks after official release.

g-a-b-y

Jun 13

Someone already made the request on their repo: https://github.com/NVIDIA/Model-Optimizer/issues/1708

hurler98

Jun 13

This isn't improved at all, Kimi allows all users to use the product. It just requires to display "Made with Kimi" for commercial (>$20M) uses. Here, any provider will need explicit permission to serve it to users. This is effectively going to restrict the number of providers. Case and point, Kimi K2.6 has over 2 dozen providers on openrouter, Minimax 2.7 less than a dozen.

The only (and non-negligible) improvement is for small and medium entreprises that want to deploy it for their employees. But with 400B parameters, they most likely would need to rely on a cloud provider, which might be prohibited from serving the model.

hurler98

Jun 13

Just to be clear, I am still very grateful to have one more semi-closed model, it's better than completely closed

voves

Jun 14

Luke Alonso dropped NVFP4, all aboard 🏃‍♂️

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment