# Page Not Found

The URL `erste-schritte/reinforcement-learning-rl-guide/preference-dpo-orpo-and-kto` does not exist.

You might be looking for one of these pages:
- [Training zur Präferenzoptimierung - DPO, ORPO & KTO](https://unsloth.ai/docs/de/los-gehts/reinforcement-learning-rl-guide/preference-dpo-orpo-and-kto.md)
- [Reinforcement Learning GRPO mit 7x längerem Kontext](https://unsloth.ai/docs/de/los-gehts/reinforcement-learning-rl-guide/grpo-long-context.md)
- [Erweiterte Dokumentation zu Reinforcement Learning](https://unsloth.ai/docs/de/los-gehts/reinforcement-learning-rl-guide/advanced-rl-documentation.md)
- [Speichereffizientes RL](https://unsloth.ai/docs/de/los-gehts/reinforcement-learning-rl-guide/memory-efficient-rl.md)
- [Vision-Reinforcement-Learning (VLM RL)](https://unsloth.ai/docs/de/los-gehts/reinforcement-learning-rl-guide/vision-reinforcement-learning-vlm-rl.md)

## How to find the correct page

1. **Browse the full index**: [/sitemap.md](https://unsloth.ai/docs/sitemap.md) - Complete documentation index
2. **View the full content**: [/llms-full.txt](https://unsloth.ai/docs/llms-full.txt) - Full content export

## Tips for requesting documentation

- For markdown responses, append `.md` to URLs (e.g., `/docs/de/los-gehts/reinforcement-learning-rl-guide/preference-dpo-orpo-and-kto.md`)
- Use `Accept: text/markdown` header for content negotiation