mirror of
https://github.com/hiyouga/LLaMA-Factory.git
synced 2025-12-15 11:20:35 +08:00
support vllm
This commit is contained in:
@@ -1,3 +1,4 @@
|
||||
Usage:
|
||||
|
||||
- `merge.sh` -> `quantize.sh`
|
||||
- `merge.sh`: merge the lora weights
|
||||
- `quantize.sh`: quantize the model with AutoGPTQ (must after merge.sh, optional)
|
||||
|
||||
Reference in New Issue
Block a user