hoshi-hiyouga
5a29f49fb1
[config] update args ( #7231 )
...
Former-commit-id: ed8b12e3cb
2025-03-10 23:04:43 +08:00
hoshi-hiyouga
98ea0e8109
[misc] fix ds config ( #7205 )
...
Former-commit-id: db113f690e
2025-03-07 15:21:28 +08:00
hoshi-hiyouga
6e58115f98
[trainer] update config ( #7174 )
...
Former-commit-id: b4b89b4ff3
2025-03-05 23:32:54 +08:00
hoshi-hiyouga
ca78ba964d
[model] add models ( #7054 )
...
* add qwen25vl awq models
* add moonlight
Former-commit-id: ec1a1bc118
2025-02-24 22:05:13 +08:00
hoshi-hiyouga
1f4a0b11ba
[data] update vlm args ( #6976 )
...
Former-commit-id: 3da2cc2710
2025-02-18 02:12:51 +08:00
hoshi-hiyouga
b1d31ff0f9
[data] add min resolution option ( #6975 )
...
Former-commit-id: 7faecc0301
2025-02-18 01:40:46 +08:00
hoshi-hiyouga
07aa7b71a3
[misc] update readme ( #6917 )
...
Former-commit-id: 499ea45d1f
2025-02-13 00:58:10 +08:00
Eric Tang
e515fe62de
[example] fix path to ray example ( #6906 )
...
Former-commit-id: ee5fe216dc
2025-02-13 00:29:32 +08:00
hoshi-hiyouga
197aa3baf4
[data] fix ollama template ( #6902 )
...
* fix ollama template
* add meta info
* use half precision
Former-commit-id: e1a7c1242c
2025-02-11 22:43:09 +08:00
hoshi-hiyouga
c6be9e242c
[misc] support export ollama modelfile ( #6899 )
...
* support export ollama modelfile
* update config
* add system and num ctx
Former-commit-id: 9184a6e0ed
2025-02-11 19:52:25 +08:00
hoshi-hiyouga
bbf334f823
disable valset by default ( #6690 )
...
Former-commit-id: 77bbf65905
2025-01-17 21:09:30 +08:00
hoshi-hiyouga
9ef85f8fc4
[optim] clean apollo ( #6645 )
...
* clean apollo code
* update readme
Former-commit-id: 7a04021d04
2025-01-15 01:42:50 +08:00
zhuHQ
763f9b9df0
[optim] add support to APOLLO ( #6617 )
...
Former-commit-id: d9189f9f0b
2025-01-15 00:24:56 +08:00
hoshi-hiyouga
d8cba9464f
[inference] fix stop token for object detection ( #6624 )
...
* fix stop token
* update minicpm data pipeline
* fix npu qlora examples
Former-commit-id: e3e2c8c689
2025-01-13 21:34:20 +08:00
codingma
089c7d5e51
add nf4 qlora support on Ascend NPU ( #6601 )
...
* add nf4 qlora support on Ascend NPU
* add transformers version check
* add python>=3.10 requirement description for npu
* tiny fix
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: 03de5ac912
2025-01-13 19:43:36 +08:00
hiyouga
c89d17ab63
refactor mllm param logic
...
Former-commit-id: f6f630a1c9
2025-01-10 15:45:48 +00:00
hiyouga
b4174021d6
refactor ray integration, support save ckpt
...
Former-commit-id: d8cac6f546
2025-01-07 09:39:10 +00:00
Eric Tang
bba52e258e
run style check
...
Former-commit-id: 1e8e7be0a5
2025-01-07 08:55:44 +00:00
Kourosh Hakhamaneshi
1217240918
drafting ray integration
...
Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com >
Former-commit-id: 163ddb680b
2025-01-07 08:55:44 +00:00
Yaser Afshar
76ebd62ac1
Add missing key to init_kwargs
...
Former-commit-id: 1c8ad22a5f
2024-12-17 12:34:05 +00:00
Yaser Afshar
fe4546a7bb
Add trust_remote_code parameter and remove True
...
- Introduced a new model parameter `trust_remote_code`
- Set the default value of `trust_remote_code` to `False`
to enhance security
Former-commit-id: 0943776326
2024-12-17 12:25:12 +00:00
hiyouga
ba901bc000
update assets
...
Former-commit-id: 7059055e89
2024-12-14 17:36:03 +00:00
hiyouga
6f1e450739
fix mrope
...
Former-commit-id: 2811814fc4
2024-12-12 15:08:17 +00:00
hiyouga
cf8cad8e7e
support qwen2vl train proj only
...
Former-commit-id: 99c62660c6
2024-12-05 10:37:42 +00:00
hiyouga
90fb5605c1
update examples
...
Former-commit-id: e5584dc7ba
2024-12-05 08:48:25 +00:00
hiyouga
235cdcacee
support batch infer in vllm
...
Former-commit-id: 1324d158f9
2024-12-04 13:50:00 +00:00
hiyouga
0d18cca0db
add vllm config
...
Former-commit-id: 58ab4579dc
2024-11-10 21:28:18 +08:00
hiyouga
3f7c874594
update tests
...
Former-commit-id: 93d3b8f43f
2024-11-02 12:41:44 +08:00
hiyouga
0d8aa6e6ef
use pre-commit
...
Former-commit-id: 21db8ed2f4
2024-10-29 09:07:46 +00:00
hiyouga
3aa6a3e45b
add e2e tests
...
Former-commit-id: 94d5b1bd8f
2024-09-05 21:52:28 +08:00
hiyouga
bfdcc6bacf
add rlhf-v dataset
...
Former-commit-id: 8e49940746
2024-09-01 22:57:41 +08:00
hiyouga
f31e7e0dfc
remove visual_inputs, fix qlora
...
Former-commit-id: a025c3df61
2024-08-31 00:24:51 +08:00
hiyouga
c883542583
add examples
...
Former-commit-id: e08045a946
2024-08-30 21:43:19 +08:00
hiyouga
a83756b5e9
refactor mm training
...
Former-commit-id: 3382317e32
2024-08-30 02:14:31 +08:00
simonJJJ
8a09b1e732
initial-commit
...
Former-commit-id: aeb85f200b
2024-08-28 16:51:35 +08:00
hiyouga
f8c11bd540
update examples
...
Former-commit-id: 0a690ada6f
2024-08-09 20:13:46 +08:00
hiyouga
5eacd17090
add adam_mini to readme
...
Former-commit-id: e2a28f51c6
2024-08-09 20:02:03 +08:00
hiyouga
25b9cfa163
update scripts
...
Former-commit-id: 86f7099fa3
2024-08-09 19:16:23 +08:00
hiyouga
b5146facff
follow #5115
...
Former-commit-id: c87023d539
2024-08-09 18:03:00 +08:00
codingma
17c73b44da
fix eval_dataset in example
...
Former-commit-id: 823e7c122b
2024-08-07 18:24:19 +08:00
hiyouga
fae881b854
fix #4944
...
Former-commit-id: 1bbd49faae
2024-07-24 16:42:51 +08:00
hoshi-hiyouga
df1f0a1258
Update llama3_lora_eval.yaml
...
Former-commit-id: 91ba083f37
2024-07-15 22:55:12 +08:00
codingma
76046dfda8
1. change the task name format
...
2. delete split param in data_args.py
Former-commit-id: 645211dc01
2024-07-15 09:55:33 +08:00
hiyouga
14bc7b0551
fix up
...
Former-commit-id: 29ebcd75d5
2024-07-15 01:04:56 +08:00
hoshi-hiyouga
f9a4d96194
Update llava1_5.yaml
...
Former-commit-id: f618b80fa2
2024-07-13 20:30:06 +08:00
codingma
1ccc6153c7
1. fix output_dir in llama3_lora_pretrain.yaml
...
2. add llava1_5.yaml for inference
Former-commit-id: 982a1cdd24
2024-07-13 13:16:22 +08:00
hiyouga
d97bb11821
update pissa example
...
Former-commit-id: c9bb0757ec
2024-07-06 15:47:32 +08:00
hiyouga
2105cf6000
update examples
...
Former-commit-id: 2f78b5d62a
2024-06-28 01:17:07 +08:00
hiyouga
6e03536dca
update examples
...
Former-commit-id: d417e63f92
2024-06-27 00:53:33 +08:00
hiyouga
a225b5a70c
tiny fix about badam
...
Former-commit-id: 095fab58d3
2024-06-25 01:54:53 +08:00