# Unsloth Docs Unsloth lets you run and train AI models on your own local hardware via an open-source UI. Our docs will guide you through running & training your own LLM locally. Get started Our GitHub

		Cover image
Qwen3.6 MTP	Run Qwen3.6 ~2x faster with no accuracy loss.	/files/CJQsmwx2mwAOIOYyVWBB	/pages/NpuhjPsxi8BKhuS8nnyY#mtp-guide
Unsloth API endpoint	Run inference with Unsloth via our new API.	/files/kw2LlDbFk91VBhcoAc2g	/pages/7sCtc6YWnJBYthTjQsr7
Google Gemma 4	Run and train Google's new Gemma 4 models!	/files/Hi2ZAIfyCNvFganYz8Dk	/pages/VnmWq1kNppQrTqCI6aLH
Introducing Unsloth Studio	New open, no-code UI to train and run LLMs.	/files/3rrq58PcvZFnywcYbnPb	/pages/qyazJc8QbOQ0mtlu6uEv
NVIDIA Nemotron 3 Omni	Run the strongest 30B omni locally.	/files/MsL4MdfCwBjNO8q7pmS1	/pages/GEABCCTb5KV7QKOeL4YY
Qwen3.6	The new Qwen3.6-27B model is here!	/files/9q8wsfzbD8FrBu5tMwyp	/pages/NpuhjPsxi8BKhuS8nnyY

{% columns %} {% column width="50%" %} {% content-ref url="/pages/nw2c1elNySGBBav8WP9B" %} [Fine-tuning Guide](/docs/get-started/fine-tuning-llms-guide.md) {% endcontent-ref %} {% content-ref url="/pages/bISOEydFwcVt8cnfyCfS" %} [Unsloth Notebooks](/docs/get-started/unsloth-notebooks.md) {% endcontent-ref %} {% endcolumn %} {% column width="50%" %} {% content-ref url="/pages/nQlzs5BcvqlaEjhsgbtY" %} [All Our Models](/docs/get-started/unsloth-model-catalog.md) {% endcontent-ref %} {% content-ref url="/pages/BAeSP6aOxvSeDUzCgKOK" %} [Complete LLM Directory](/docs/models/tutorials.md) {% endcontent-ref %} {% endcolumn %} {% endcolumns %} ### 🦥 Why Unsloth? * We directly collab with teams behind [gpt-oss](https://docs.unsloth.ai/new/gpt-oss-how-to-run-and-fine-tune#unsloth-fixes-for-gpt-oss), [Qwen3](https://www.reddit.com/r/LocalLLaMA/comments/1kaodxu/qwen3_unsloth_dynamic_ggufs_128k_context_bug_fixes/), [Llama 4](https://github.com/ggml-org/llama.cpp/pull/12889), [Mistral](https://huggingface.co/mistralai/Mistral-Medium-3.5-128B/discussions/18), [Gemma 1-3](https://news.ycombinator.com/item?id=39671146) and [Phi-4](https://unsloth.ai/blog/phi4), where we’ve **fixed critical bugs** that greatly improved model accuracy. Andrej Karpathy for example has [praised our work](https://x.com/karpathy/status/1765473722985771335). * Unsloth streamlines local training, inference, data, and deployment * Unsloth supports inference and training for 500+ models: [vision](/docs/basics/vision-fine-tuning.md), [TTS](/docs/basics/text-to-speech-tts-fine-tuning.md), [embedding](/docs/basics/embedding-finetuning.md), [RL](/docs/get-started/reinforcement-learning-rl-guide.md) ### ⭐ Features Unsloth lets you run and train models for text, [audio](https://unsloth.ai/docs/basics/text-to-speech-tts-fine-tuning), [embedding](https://unsloth.ai/docs/new/embedding-finetuning), [vision](https://unsloth.ai/docs/basics/vision-fine-tuning) and more. Unsloth provides many key features for both inference and training: #### Inference * Search + download + run any model like GGUFs, LoRA adapters, safetensors. * [Self-healing tool calling](/docs/new/studio/chat.md#auto-healing-tool-calling) / web search and use [Unsloth as an API](/docs/basics/api.md). * [Auto inference parameter](/docs/new/studio/chat.md#auto-parameter-tuning) tuning and edit chat templates. * [Export or save](/docs/new/studio/export.md) your model to GGUF, 16-bit safetensor etc. * [Compare outputs](/docs/new/studio/chat.md#model-arena) with two different model side by side. #### Training * Train and [RL](/docs/get-started/reinforcement-learning-rl-guide.md) 500+ models \~2x faster with \~70% less VRAM (no accuracy loss) * Supports full fine-tuning, pre-training, 4-bit, 16-bit and FP8 training. * [Auto-create datasets](/docs/new/studio/data-recipe.md) from PDF, CSV, DOCX files. Edit data in a visual node workflow. * Observability: Monitor training live, track loss, GPU usage, customize graphs * Most efficient [**reinforcement learning**](/docs/get-started/reinforcement-learning-rl-guide.md) library, using 80% less VRAM for GRPO, [FP8](/docs/get-started/reinforcement-learning-rl-guide/fp8-reinforcement-learning.md) etc. * [Multi-GPU](/docs/basics/multi-gpu-training-with-unsloth.md) works but a much better version is coming! ### Quickstart Unsloth supports MacOS, Linux, [Windows](/docs/get-started/install/windows-installation.md), [NVIDIA](/docs/get-started/install/pip-install.md), Intel and CPU setups. See: [Unsloth Requirements](/docs/get-started/fine-tuning-for-beginners/unsloth-requirements.md). Use the same commands to update: #### **MacOS, Linux, WSL:** ```bash curl -fsSL https://unsloth.ai/install.sh | sh ``` #### **Windows PowerShell:** ```bash irm https://unsloth.ai/install.ps1 | iex ``` #### Docker Use our official **Docker image**: [`unsloth/unsloth`](https://hub.docker.com/r/unsloth/unsloth) which currently works for Windows, WSL and Linux. MacOS support coming soon. #### Launch Unsloth ```bash unsloth studio -H 0.0.0.0 -p 8888 ``` ### What is Fine-tuning and RL? Why? [**Fine-tuning** an LLM](/docs/get-started/fine-tuning-llms-guide.md) customizes its behavior, enhances domain knowledge, and optimizes performance for specific tasks. By fine-tuning a pre-trained model (e.g. Llama-3.1-8B) on a dataset, you can: * **Update Knowledge**: Introduce new domain-specific information. * **Customize Behavior**: Adjust the model’s tone, personality, or response style. * **Optimize for Tasks**: Improve accuracy and relevance for specific use cases. [**Reinforcement Learning (RL)**](/docs/get-started/reinforcement-learning-rl-guide.md) is where an "agent" learns to make decisions by interacting with an environment and receiving **feedback** in the form of **rewards** or **penalties**. * **Action:** What the model generates (e.g. a sentence). * **Reward:** A signal indicating how good or bad the model's action was (e.g. did the response follow instructions? was it helpful?). * **Environment:** The scenario or task the model is working on (e.g. answering a user’s question). **Example fine-tuning or RL use-cases**: * Enables LLMs to predict if a headline impacts a company positively or negatively. * Can use historical customer interactions for more accurate and custom responses. * Fine-tune LLM on legal texts for contract analysis, case law research, and compliance. You can think of a fine-tuned model as a specialized agent designed to do specific tasks more effectively and efficiently. **Fine-tuning can replicate all of RAG's capabilities**, but not vice versa. {% columns %} {% column width="50%" %} {% content-ref url="/pages/HP82bIzgldwxWk3OSzVy" %} [FAQ + Is Fine-tuning Right For Me?](/docs/get-started/fine-tuning-for-beginners/faq-+-is-fine-tuning-right-for-me.md) {% endcontent-ref %} {% content-ref url="/pages/gEugERiAw2ztDNt98JVR" %} [Inference & Deployment](/docs/basics/inference-and-deployment.md) {% endcontent-ref %} {% endcolumn %} {% column width="50%" %} {% content-ref url="/pages/vT6jTKG1LCfN7HoJ4fVR" %} [Reinforcement Learning Guide](/docs/get-started/reinforcement-learning-rl-guide.md) {% endcontent-ref %} {% content-ref url="/pages/QznsvWxKKvrY6PdiByzz" %} [Dynamic 2.0 GGUFs](/docs/basics/unsloth-dynamic-2.0-ggufs.md) {% endcontent-ref %} {% endcolumn %} {% endcolumns %}

--- # Agent Instructions: Querying This Documentation If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question. Perform an HTTP GET request on the current page URL with the `ask` query parameter: ``` GET https://unsloth.ai/docs/get-started/readme.md?ask= ``` The question should be specific, self-contained, and written in natural language. The response will contain a direct answer to the question and relevant excerpts and sources from the documentation. Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.