Phase 4: Qwen Fine-tuning
Fine-tune Qwen2.5-Coder-14B with LoRA on HuggingFace Skills A100 GPU. Runs 3 parallel sweep configs (conservative / standard / aggressive) then full training on top-2.
Status: ready
Key Metrics
| Metric | Value |
|---|---|
model | Qwen2.5-Coder-14B-Instruct |
hardware | A100 80GB (cloud) |
sweep configs | 3 |
lora r range | 16–64 |
Technologies
HF Skills APIPEFT LoRAbitsandbytesA100 80GB
Outputs
- Fine-tuned LoRA weights on HF Hub
Commands
python -m src.phase4_qwen_finetuning.scripts.launch_validation_sweep --config src/config/v6_config.yaml
python -m src.phase4_qwen_finetuning.scripts.monitor_jobs --config src/config/v6_config.yaml