hoshi-hiyouga
5817cda37e
[misc] fix packing and eval plot ( #7623 )
2025-04-07 18:20:57 +08:00
hoshi-hiyouga
6e58115f98
[trainer] update config ( #7174 )
...
Former-commit-id: b4b89b4ff3bc03aa388569e253d62580755a77a5
2025-03-05 23:32:54 +08:00
hoshi-hiyouga
ca78ba964d
[model] add models ( #7054 )
...
* add qwen25vl awq models
* add moonlight
Former-commit-id: ec1a1bc1184d13188029e19c1d4e7de68707aaf6
2025-02-24 22:05:13 +08:00
hoshi-hiyouga
bbf334f823
disable valset by default ( #6690 )
...
Former-commit-id: 77bbf659053e1b205974eb6df69998fee0305d26
2025-01-17 21:09:30 +08:00
Yaser Afshar
fe4546a7bb
Add trust_remote_code parameter and remove True
...
- Introduced a new model parameter `trust_remote_code`
- Set the default value of `trust_remote_code` to `False`
to enhance security
Former-commit-id: 09437763267bc7081159a6878cee9652a2b1ddac
2024-12-17 12:25:12 +00:00
hiyouga
0d18cca0db
add vllm config
...
Former-commit-id: 58ab4579dc81a1dcea2bf5938ba3f3116cecfc76
2024-11-10 21:28:18 +08:00
codingma
1ccc6153c7
1. fix output_dir in llama3_lora_pretrain.yaml
...
2. add llava1_5.yaml for inference
Former-commit-id: 982a1cdd24dfa51535af3e49c7ea80fddc95b0ee
2024-07-13 13:16:22 +08:00
hiyouga
2105cf6000
update examples
...
Former-commit-id: 2f78b5d62a34ea4d157bbe91a253859d25c8a7fe
2024-06-28 01:17:07 +08:00
hiyouga
0926d81053
update examples
...
Former-commit-id: b6e008c152421db668c971b0828cbee6a80b16bc
2024-06-13 03:15:06 +08:00