hoshi-hiyouga
bbf334f823
disable valset by default ( #6690 )
...
Former-commit-id: 77bbf65905
2025-01-17 21:09:30 +08:00
hoshi-hiyouga
9ef85f8fc4
[optim] clean apollo ( #6645 )
...
* clean apollo code
* update readme
Former-commit-id: 7a04021d04
2025-01-15 01:42:50 +08:00
zhuHQ
763f9b9df0
[optim] add support to APOLLO ( #6617 )
...
Former-commit-id: d9189f9f0b
2025-01-15 00:24:56 +08:00
hoshi-hiyouga
d8cba9464f
[inference] fix stop token for object detection ( #6624 )
...
* fix stop token
* update minicpm data pipeline
* fix npu qlora examples
Former-commit-id: e3e2c8c689
2025-01-13 21:34:20 +08:00
codingma
089c7d5e51
add nf4 qlora support on Ascend NPU ( #6601 )
...
* add nf4 qlora support on Ascend NPU
* add transformers version check
* add python>=3.10 requirement description for npu
* tiny fix
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn >
Former-commit-id: 03de5ac912
2025-01-13 19:43:36 +08:00
hiyouga
c89d17ab63
refactor mllm param logic
...
Former-commit-id: f6f630a1c9
2025-01-10 15:45:48 +00:00
hiyouga
b4174021d6
refactor ray integration, support save ckpt
...
Former-commit-id: d8cac6f546
2025-01-07 09:39:10 +00:00
Eric Tang
bba52e258e
run style check
...
Former-commit-id: 1e8e7be0a5
2025-01-07 08:55:44 +00:00
Kourosh Hakhamaneshi
1217240918
drafting ray integration
...
Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com >
Former-commit-id: 163ddb680b
2025-01-07 08:55:44 +00:00
Yaser Afshar
76ebd62ac1
Add missing key to init_kwargs
...
Former-commit-id: 1c8ad22a5f
2024-12-17 12:34:05 +00:00
Yaser Afshar
fe4546a7bb
Add trust_remote_code parameter and remove True
...
- Introduced a new model parameter `trust_remote_code`
- Set the default value of `trust_remote_code` to `False`
to enhance security
Former-commit-id: 0943776326
2024-12-17 12:25:12 +00:00
hiyouga
ba901bc000
update assets
...
Former-commit-id: 7059055e89
2024-12-14 17:36:03 +00:00
hiyouga
6f1e450739
fix mrope
...
Former-commit-id: 2811814fc4
2024-12-12 15:08:17 +00:00
hiyouga
cf8cad8e7e
support qwen2vl train proj only
...
Former-commit-id: 99c62660c6
2024-12-05 10:37:42 +00:00
hiyouga
90fb5605c1
update examples
...
Former-commit-id: e5584dc7ba
2024-12-05 08:48:25 +00:00
hiyouga
235cdcacee
support batch infer in vllm
...
Former-commit-id: 1324d158f9
2024-12-04 13:50:00 +00:00
hiyouga
0d18cca0db
add vllm config
...
Former-commit-id: 58ab4579dc
2024-11-10 21:28:18 +08:00
hiyouga
3f7c874594
update tests
...
Former-commit-id: 93d3b8f43f
2024-11-02 12:41:44 +08:00
hiyouga
0d8aa6e6ef
use pre-commit
...
Former-commit-id: 21db8ed2f4
2024-10-29 09:07:46 +00:00
hiyouga
3aa6a3e45b
add e2e tests
...
Former-commit-id: 94d5b1bd8f
2024-09-05 21:52:28 +08:00
hiyouga
bfdcc6bacf
add rlhf-v dataset
...
Former-commit-id: 8e49940746
2024-09-01 22:57:41 +08:00
hiyouga
f31e7e0dfc
remove visual_inputs, fix qlora
...
Former-commit-id: a025c3df61
2024-08-31 00:24:51 +08:00
hiyouga
c883542583
add examples
...
Former-commit-id: e08045a946
2024-08-30 21:43:19 +08:00
hiyouga
a83756b5e9
refactor mm training
...
Former-commit-id: 3382317e32
2024-08-30 02:14:31 +08:00
simonJJJ
8a09b1e732
initial-commit
...
Former-commit-id: aeb85f200b
2024-08-28 16:51:35 +08:00
hiyouga
f8c11bd540
update examples
...
Former-commit-id: 0a690ada6f
2024-08-09 20:13:46 +08:00
hiyouga
5eacd17090
add adam_mini to readme
...
Former-commit-id: e2a28f51c6
2024-08-09 20:02:03 +08:00
hiyouga
25b9cfa163
update scripts
...
Former-commit-id: 86f7099fa3
2024-08-09 19:16:23 +08:00
hiyouga
b5146facff
follow #5115
...
Former-commit-id: c87023d539
2024-08-09 18:03:00 +08:00
codingma
17c73b44da
fix eval_dataset in example
...
Former-commit-id: 823e7c122b
2024-08-07 18:24:19 +08:00
hiyouga
fae881b854
fix #4944
...
Former-commit-id: 1bbd49faae
2024-07-24 16:42:51 +08:00
hoshi-hiyouga
df1f0a1258
Update llama3_lora_eval.yaml
...
Former-commit-id: 91ba083f37
2024-07-15 22:55:12 +08:00
codingma
76046dfda8
1. change the task name format
...
2. delete split param in data_args.py
Former-commit-id: 645211dc01
2024-07-15 09:55:33 +08:00
hiyouga
14bc7b0551
fix up
...
Former-commit-id: 29ebcd75d5
2024-07-15 01:04:56 +08:00
hoshi-hiyouga
f9a4d96194
Update llava1_5.yaml
...
Former-commit-id: f618b80fa2
2024-07-13 20:30:06 +08:00
codingma
1ccc6153c7
1. fix output_dir in llama3_lora_pretrain.yaml
...
2. add llava1_5.yaml for inference
Former-commit-id: 982a1cdd24
2024-07-13 13:16:22 +08:00
hiyouga
d97bb11821
update pissa example
...
Former-commit-id: c9bb0757ec
2024-07-06 15:47:32 +08:00
hiyouga
2105cf6000
update examples
...
Former-commit-id: 2f78b5d62a
2024-06-28 01:17:07 +08:00
hiyouga
6e03536dca
update examples
...
Former-commit-id: d417e63f92
2024-06-27 00:53:33 +08:00
hiyouga
a225b5a70c
tiny fix about badam
...
Former-commit-id: 095fab58d3
2024-06-25 01:54:53 +08:00
Jonery
bc1c082bc2
add example
...
Former-commit-id: 97c5235160
2024-06-18 13:50:26 +08:00
hiyouga
004f289074
tiny fix
...
Former-commit-id: 2bf2863a58
2024-06-17 17:47:25 +08:00
hiyouga
f25b8626bf
support pissa
...
Former-commit-id: 8c1046d78a
2024-06-16 01:08:12 +08:00
hiyouga
d4ce280fbc
Update README.md
...
Former-commit-id: 2d43b8bb49
2024-06-13 16:02:21 +08:00
hiyouga
f81a839197
update examples
...
Former-commit-id: 892e561c28
2024-06-13 03:26:10 +08:00
hiyouga
4c40171c55
Update llama3_full_sft_ds3.yaml
...
Former-commit-id: a19cdd39fe
2024-06-13 03:16:20 +08:00
hiyouga
0926d81053
update examples
...
Former-commit-id: b6e008c152
2024-06-13 03:15:06 +08:00
hiyouga
cceff9f520
lora modules: all by default
...
Former-commit-id: cae4737907
2024-06-06 03:53:28 +08:00
hiyouga
00b3fb4d14
update train hparams
...
Former-commit-id: dc4a00dd63
2024-06-06 01:49:20 +08:00
hiyouga
0eff6a66d5
tiny fix
...
Former-commit-id: 5a13b3baa6
2024-06-04 00:31:10 +08:00