AMD AI Reinforcement Learning Hackathon mit Unsloth

Lerne praxisnahe Techniken für Reinforcement Learning für KI-Modelle mit Unsloth von Daniel Han, dem Ersteller von Unsloth.

Sie können Unsloths GitHub-Repository hier ansehen: https://github.com/unslothai/unsloth

Hier ist der Link zu unseren AMD Fine-Tuning-Notebooks:

notebooks/nb/gpt_oss_(20B)_Reinforcement_Learning_2048_Game_BF16.ipynb at main · unslothai/notebooksGitHub

https://github.com/unslothai/notebooks/blob/main/nb/gpt_oss_(20B)_Reinforcement_Learning_2048_Game_BF16.ipynb

wget 'https://raw.githubusercontent.com/unslothai/notebooks/refs/heads/main/nb/gpt_oss_(20B)_Reinforcement_Learning_2048_Game_BF16.ipynb'

Wenn Sie Unsloth / Unsloth Zoo aktualisieren möchten:

uv pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/rocm7.0 --upgrade --force-reinstall

pip uninstall unsloth unsloth_zoo -y && \
    pip install git+https://github.com/unslothai/unsloth-zoo git+https://github.com/unslothai/unsloth --no-deps --force-reinstall --no-cache-dir

Für bitsandbytes:

pip install "unsloth[amd] @ git+https://github.com/unslothai/unsloth"

Wenn Sie sehen:

error: Failed to install: bitsandbytes-1.33.7rc0-py3-none-manylinux_2_24_x86_64.whl (bitsandbytes==1.33.7rc0 (from https://github.com/bitsandbytes-foundation/bitsandbytes/releases/download/continuous-release_main/bitsandbytes-1.33.7.preview-py3-none-manylinux_2_24_x86_64.whl)) Caused by: Wheel version does not match filename (0.49.2.dev0 != 1.33.7rc0), which indicates a malformed wheel. If this is intentional, set UV_SKIP_WHEEL_FILENAME_CHECK=1.

Verwenden Sie NICHT UV_SKIP_WHEEL_FILENAME_CHECK, verwenden Sie stattdessen AUSSCHLIESSLICH pip install "unsloth[amd] @ git+https://github.com/unslothai/unsloth" (NICHT uv), da uv bitsandbytes zerstört. Fügen Sie ggf. eine Prüfung in die Pull Requests ein, um diese Fälle zu erkennen.

Für AMD-Installationsanweisungen können Sie unsere Anleitung hier einsehen:

AMD

Zuletzt aktualisiert vor 1 Monat

War das hilfreich?