⌘Ctrlk

Reddit Discord GitHub Newsletter

开始使用
新
模型
基础知识
博客

由 GitBook 提供支持

在本页

基础知识

🖥️推理与部署

了解如何保存您微调后的模型，以便在您喜欢的推理引擎中运行。

您也可以使用以下方式运行微调后的模型： Unsloth 的 2 倍更快推理.

llama.cpp - 保存为 GGUF

LM Studio

llama-server 与 OpenAI 端点

vLLM 引擎参数

上一页QwQ-32B 下一页GGUF & llama.cpp

最后更新于1个月前

这有帮助吗？

Community

Reddit r/unsloth
Twitter (X)
LinkedIn

Resources

Tutorials
Docker
Hugging Face

Company

About
Contact
Events

© Unsloth, 2026

这有帮助吗？