🔮Unsloth Model Catalog

Unsloth LLMs directory for all our Dynamicarrow-up-right GGUF, 4-bit, 16-bit models on Hugging Face.

QwenDeepSeekGemmaLlamaMistralGLM

GGUFs let you run models in tools like Unsloth Studio✨, Ollama and llama.cpp. Instruct (4-bit) safetensors can be used for inference or fine-tuning via Unsloth.

Model
Variant
GGUF
Instruct (4-bit)

122B-A10B

397B-A17B

Edit-2511

80B-A3B-Thinking

30B-A3B-Instruct

30B-A3B-Thinking

235B-A22B-Instruct

235B-A22B-Thinking

30B-A3B-Instruct

30B-A3B-Thinking

235B-A22B-Instruct

4.6V-Flash

Kimi-K2

Thinking

DeepSeek models:

Model
Variant
GGUF
Instruct (4-bit)

Llama models:

Gemma models:

Qwen models:

Model
Variant
GGUF
Instruct (4-bit)

122B-A10B

397B-A17B

Edit-2511

Qwen3-Coder

30B-A3B

480B-A35B

30B-A3B-Instruct

30B-A3B-Thinking

235B-A22B-Thinking

235B-A22B-Instruct

235 B-A22B

Qwen 2.5 Omni

3 B

Qwen 2.5

0.5 B

Qwen 2.5 Coder (128 K)

0.5 B

QVQ (preview)

72 B

Qwen 2 (chat)

1.5 B

Qwen 2 VL

2 B

GLM models:

Model
Variant
GGUF
Instruct (4-bit)

Mistral models:

Model
Variant
GGUF
Instruct (4-bit)

Phi models:

Other (GLM, Orpheus, Smol, Llava etc.) models:

Model
Variant
GGUF
Instruct (4-bit)

GLM

4.5-Air

Grok 2

270B

Baidu-ERNIE

4.5-21B-A3B-Thinking

Hunyuan

A13B

Orpheus

0.1-ft (3B)

LLava

1.5 (7 B)

1.6 Mistral (7 B)

TinyLlama

Chat

Zephyr-SFT

7 B

Yi

6 B (v1.5)

6 B (v1.0)

34 B (chat)

34 B (base)

Last updated

Was this helpful?