Question 1

How do I fine-tune Llama with Soup CLI?

Accepted Answer

Install Soup CLI with 'pip install soup-cli', then run 'soup train --base meta-llama/Llama-3.2-1B --task sft --data your_data.json'. Soup supports all Llama variants with optional Unsloth 2-5x speedup.

Question 2

What is DPO training and how does Soup CLI support it?

Accepted Answer

DPO (Direct Preference Optimization) aligns LLMs with human preferences without a reward model. Soup CLI supports DPO along with 10 other methods: SFT, GRPO, PPO, KTO, ORPO, SimPO, IPO, Pretrain, Embedding, and Reward Model training.

Question 3

How do I install Soup CLI?

Accepted Answer

Install from PyPI: 'pip install soup-cli'. Requires Python 3.9+. Optional extras: 'pip install soup-cli[fast]' for Unsloth speedup, 'pip install soup-cli[serve]' for inference server.

Question 4

What models does Soup CLI support?

Accepted Answer

Soup CLI supports Llama 3.2, Qwen 3, DeepSeek V3/R1, Mistral, Phi-4, Gemma 3, and any Hugging Face transformers-compatible model. 30 ready-made recipes included. It works with both full fine-tuning and QLoRA/LoRA methods.

Question 5

Can I migrate from LLaMA-Factory or Axolotl to Soup CLI?

Accepted Answer

Yes. Run 'soup migrate --from llamafactory config.yaml' or 'soup migrate --from axolotl config.yml' to automatically convert your existing training config to Soup format. Also supports Unsloth notebook migration.

Quant	Size (7B)	Quality	Use case
`q4_k_m`	~4.1 GB	Good	Default — best size/quality balance
`q5_k_m`	~4.8 GB	Better	When you need higher accuracy
`q8_0`	~7.5 GB	Near-lossless	Benchmarks, eval
`q2_k`	~2.6 GB	Lower	Tiny devices, RPi

Export fine-tuned models to GGUF and deploy on Ollama

1. Export to GGUF

2. Deploy to Ollama

3. Chat

One-liner: train → export → deploy

Other export formats

Related