Easily run & train models locally.

Join our Discord Start for free

Latest News

Run GLM-5.2 locally!Jun 18, 2026

Run DeepSeek-V4Jul 5, 2026

DiffusionGemmaJun 12, 2026

Qwen3.6 MTP is out now!Apr 22, 2026

View more news

Run models locally

Unsloth Studio runs 100% offline on your Mac and Windows device. Run GGUF and Safetensors models with tool-calling, web search, and OpenAI compatible API.

Compare models side by side and upload images, docs, audio, code files and more.

Learn more

Run models locally on Mac, Windows, Linux

Train models with no-code and full observability.

No-code training

Auto-create datasets from PDF, CSV, JSON docs and start training with real-time observability.

Unsloth's custom kernels supports optimized training for LoRA, FP8, FFT, PT and 500+ models including text, vision, audio and embeddings.

Quickstart Learn more

Unlimited Tool calling + Web search

Unsloth Studio lets LLMs run unlimited web search and execute Bash and Python, not just JavaScript. It sandboxes programs like Claude Artifacts so models can test code, generate files + verify answers with real computation.

E.g. Qwen3.5-4B searched 20+ websites and cited sources, with web search happening inside its thinking trace.

Learn more

Data Recipes

Data Recipes transforms your docs into useable datasets via graph-node workflow. Upload unstructured or structured files like PDFs, CSV and JSON. Unsloth Data Recipes auto turns documents into your desired formats.

Quickstart Learn more

Export models

Export any model, including your fine-tuned models, to safetensors, or GGUF for use with llama.cpp, vLLM, Ollama, and more.

Learn more

Don’t believe us?

Why not try our fully free open source version? Finetune 2X faster on a single NVIDIA GPU for free on Google Colab or Kaggle Notebooks.

Get access now

We'll share monthly updates!

Subscribe now

Train your own custom model in 24 hrs, not 30 days.

30x faster than FA2 + 30% accuracy

90% less memory usage than FA2

audio, embedding, vision support

The details

We're making AI more accessible to everyone

Find out more

Unsloth makes everything greener

As hardware costs rise and performance gains plateau, we use our math and coding skills to make models train and run smarter + faster.

Want lightning fast inference? We’re working on it!

Don't forget to join our newsletter!

By registering you agree to unsloth's Terms of Service and Privacy Policy,

Subscribe now

MultiGPU Docs

Even better multiGPU in the works!

Don't forget to join our newsletter!

Pricing

Free
Freeware of our standard version of unsloth
Get started
Open-source
Supports Mistral, Gemma
Supports LLama 1, 2, 3
MultiGPU - coming soon
Supports 4 bit, 16 bit LoRA

unsloth Pro
2.5x faster training + 20% less VRAM
Contact us
2.5x number of GPUs faster than FA2
20% less memory than OSS
Enhanced MultiGPU support
Up to 8 GPUS support
For any usecase

unsloth Enterprise
Unlock 30x faster training + multi-node support + 30% accuracy
Contact us
32x number of GPUs faster than FA2
up to +30% accuracy
5x faster inference
Supports full training
All Pro plan features
Multi-node support
Customer support

Ready to use unsloth?

Get started for free

Easily run & train models locally.

Latest News

Run models locally

No-code training

Unlimited Tool calling + Web search

Data Recipes

Export models

Don’t believe us?

Train your own custom model in 24 hrs, not 30 days.

The details

We're making AI more accessible to everyone

Unsloth makes everything greener

As hardware costs rise and performance gains plateau, we use our math and coding skills to make models train and run smarter + faster.

Want lightning fast inference? We’re working on it!

Don't forget to join our newsletter!

Even better multiGPU in the works!

Don't forget to join our newsletter!

Pricing

FreeFreeware of our standard version of unslothGet startedOpen-sourceSupports Mistral, GemmaSupports LLama 1, 2, 3MultiGPU - coming soonSupports 4 bit, 16 bit LoRA

unsloth Pro2.5x faster training + 20% less VRAMContact us2.5x number of GPUs faster than FA220% less memory than OSSEnhanced MultiGPU supportUp to 8 GPUS supportFor any usecase

unsloth EnterpriseUnlock 30x faster training + multi-node support + 30% accuracyContact us32x number of GPUs faster than FA2up to +30% accuracy5x faster inferenceSupports full trainingAll Pro plan featuresMulti-node supportCustomer support

Ready to use unsloth?

Free
Freeware of our standard version of unsloth
Get started
Open-source
Supports Mistral, Gemma
Supports LLama 1, 2, 3
MultiGPU - coming soon
Supports 4 bit, 16 bit LoRA

unsloth Pro
2.5x faster training + 20% less VRAM
Contact us
2.5x number of GPUs faster than FA2
20% less memory than OSS
Enhanced MultiGPU support
Up to 8 GPUS support
For any usecase

unsloth Enterprise
Unlock 30x faster training + multi-node support + 30% accuracy
Contact us
32x number of GPUs faster than FA2
up to +30% accuracy
5x faster inference
Supports full training
All Pro plan features
Multi-node support
Customer support