hoshi-hiyouga
|
ed8b12e3cb
|
[config] update args (#7231)
|
2025-03-10 23:04:43 +08:00 |
|
hoshi-hiyouga
|
db113f690e
|
[misc] fix ds config (#7205)
|
2025-03-07 15:21:28 +08:00 |
|
hoshi-hiyouga
|
b4b89b4ff3
|
[trainer] update config (#7174)
|
2025-03-05 23:32:54 +08:00 |
|
hoshi-hiyouga
|
ec1a1bc118
|
[model] add models (#7054)
* add qwen25vl awq models
* add moonlight
|
2025-02-24 22:05:13 +08:00 |
|
hoshi-hiyouga
|
3da2cc2710
|
[data] update vlm args (#6976)
|
2025-02-18 02:12:51 +08:00 |
|
hoshi-hiyouga
|
7faecc0301
|
[data] add min resolution option (#6975)
|
2025-02-18 01:40:46 +08:00 |
|
hoshi-hiyouga
|
499ea45d1f
|
[misc] update readme (#6917)
|
2025-02-13 00:58:10 +08:00 |
|
Eric Tang
|
ee5fe216dc
|
[example] fix path to ray example (#6906)
|
2025-02-13 00:29:32 +08:00 |
|
hoshi-hiyouga
|
e1a7c1242c
|
[data] fix ollama template (#6902)
* fix ollama template
* add meta info
* use half precision
|
2025-02-11 22:43:09 +08:00 |
|
hoshi-hiyouga
|
9184a6e0ed
|
[misc] support export ollama modelfile (#6899)
* support export ollama modelfile
* update config
* add system and num ctx
|
2025-02-11 19:52:25 +08:00 |
|
hoshi-hiyouga
|
77bbf65905
|
disable valset by default (#6690)
|
2025-01-17 21:09:30 +08:00 |
|
hoshi-hiyouga
|
7a04021d04
|
[optim] clean apollo (#6645)
* clean apollo code
* update readme
|
2025-01-15 01:42:50 +08:00 |
|
zhuHQ
|
d9189f9f0b
|
[optim] add support to APOLLO (#6617)
|
2025-01-15 00:24:56 +08:00 |
|
hoshi-hiyouga
|
e3e2c8c689
|
[inference] fix stop token for object detection (#6624)
* fix stop token
* update minicpm data pipeline
* fix npu qlora examples
|
2025-01-13 21:34:20 +08:00 |
|
codingma
|
03de5ac912
|
add nf4 qlora support on Ascend NPU (#6601)
* add nf4 qlora support on Ascend NPU
* add transformers version check
* add python>=3.10 requirement description for npu
* tiny fix
---------
Co-authored-by: hoshi-hiyouga <hiyouga@buaa.edu.cn>
|
2025-01-13 19:43:36 +08:00 |
|
hiyouga
|
f6f630a1c9
|
refactor mllm param logic
|
2025-01-10 15:45:48 +00:00 |
|
hiyouga
|
d8cac6f546
|
refactor ray integration, support save ckpt
|
2025-01-07 09:39:10 +00:00 |
|
Eric Tang
|
1e8e7be0a5
|
run style check
|
2025-01-07 08:55:44 +00:00 |
|
Kourosh Hakhamaneshi
|
163ddb680b
|
drafting ray integration
Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>
|
2025-01-07 08:55:44 +00:00 |
|
Yaser Afshar
|
1c8ad22a5f
|
Add missing key to init_kwargs
|
2024-12-17 12:34:05 +00:00 |
|
Yaser Afshar
|
0943776326
|
Add trust_remote_code parameter and remove True
- Introduced a new model parameter `trust_remote_code`
- Set the default value of `trust_remote_code` to `False`
to enhance security
|
2024-12-17 12:25:12 +00:00 |
|
hiyouga
|
7059055e89
|
update assets
|
2024-12-14 17:36:03 +00:00 |
|
hiyouga
|
2811814fc4
|
fix mrope
|
2024-12-12 15:08:17 +00:00 |
|
hiyouga
|
99c62660c6
|
support qwen2vl train proj only
|
2024-12-05 10:37:42 +00:00 |
|
hiyouga
|
e5584dc7ba
|
update examples
|
2024-12-05 08:48:25 +00:00 |
|
hiyouga
|
1324d158f9
|
support batch infer in vllm
|
2024-12-04 13:50:00 +00:00 |
|
hiyouga
|
58ab4579dc
|
add vllm config
|
2024-11-10 21:28:18 +08:00 |
|
hiyouga
|
93d3b8f43f
|
update tests
|
2024-11-02 12:41:44 +08:00 |
|
hiyouga
|
21db8ed2f4
|
use pre-commit
|
2024-10-29 09:07:46 +00:00 |
|
hiyouga
|
94d5b1bd8f
|
add e2e tests
|
2024-09-05 21:52:28 +08:00 |
|
hiyouga
|
8e49940746
|
add rlhf-v dataset
|
2024-09-01 22:57:41 +08:00 |
|
hiyouga
|
a025c3df61
|
remove visual_inputs, fix qlora
|
2024-08-31 00:24:51 +08:00 |
|
hiyouga
|
e08045a946
|
add examples
|
2024-08-30 21:43:19 +08:00 |
|
hiyouga
|
3382317e32
|
refactor mm training
|
2024-08-30 02:14:31 +08:00 |
|
simonJJJ
|
aeb85f200b
|
initial-commit
|
2024-08-28 16:51:35 +08:00 |
|
hiyouga
|
0a690ada6f
|
update examples
|
2024-08-09 20:13:46 +08:00 |
|
hiyouga
|
e2a28f51c6
|
add adam_mini to readme
|
2024-08-09 20:02:03 +08:00 |
|
hiyouga
|
86f7099fa3
|
update scripts
|
2024-08-09 19:16:23 +08:00 |
|
hiyouga
|
c87023d539
|
follow #5115
|
2024-08-09 18:03:00 +08:00 |
|
codingma
|
823e7c122b
|
fix eval_dataset in example
|
2024-08-07 18:24:19 +08:00 |
|
hiyouga
|
1bbd49faae
|
fix #4944
|
2024-07-24 16:42:51 +08:00 |
|
hoshi-hiyouga
|
91ba083f37
|
Update llama3_lora_eval.yaml
|
2024-07-15 22:55:12 +08:00 |
|
codingma
|
645211dc01
|
1. change the task name format
2. delete split param in data_args.py
|
2024-07-15 09:55:33 +08:00 |
|
hiyouga
|
29ebcd75d5
|
fix up
|
2024-07-15 01:04:56 +08:00 |
|
hoshi-hiyouga
|
f618b80fa2
|
Update llava1_5.yaml
|
2024-07-13 20:30:06 +08:00 |
|
codingma
|
982a1cdd24
|
1. fix output_dir in llama3_lora_pretrain.yaml
2. add llava1_5.yaml for inference
|
2024-07-13 13:16:22 +08:00 |
|
hiyouga
|
c9bb0757ec
|
update pissa example
|
2024-07-06 15:47:32 +08:00 |
|
hiyouga
|
2f78b5d62a
|
update examples
|
2024-06-28 01:17:07 +08:00 |
|
hiyouga
|
d417e63f92
|
update examples
|
2024-06-27 00:53:33 +08:00 |
|
hiyouga
|
095fab58d3
|
tiny fix about badam
|
2024-06-25 01:54:53 +08:00 |
|