hoshi-hiyouga
9ccfb97a2c
[misc] update format ( #7277 )
2025-03-13 02:53:08 +08:00
hoshi-hiyouga
7c1640ed5f
[misc] upgrade format to py39 ( #7256 )
2025-03-12 00:08:41 +08:00
hoshi-hiyouga
5a29f49fb1
[config] update args ( #7231 )
...
Former-commit-id: ed8b12e3cbdaa85f5bde619081b86717a1f3c5fa
2025-03-10 23:04:43 +08:00
hoshi-hiyouga
113cc3d920
[misc] fix cli ( #7204 )
...
Former-commit-id: bd1722355944a3618f44a4ac3e8177c34b97dc75
2025-03-07 15:01:18 +08:00
hoshi-hiyouga
1f4a0b11ba
[data] update vlm args ( #6976 )
...
Former-commit-id: 3da2cc2710c9b13ab450815a92fff14b03251984
2025-02-18 02:12:51 +08:00
hoshi-hiyouga
b1d31ff0f9
[data] add min resolution option ( #6975 )
...
Former-commit-id: 7faecc0301709326efa21e7a3fdb75fe0a9635c2
2025-02-18 01:40:46 +08:00
hoshi-hiyouga
c6be9e242c
[misc] support export ollama modelfile ( #6899 )
...
* support export ollama modelfile
* update config
* add system and num ctx
Former-commit-id: 9184a6e0ed7ff5f632c848f861bfa448c4cd06fc
2025-02-11 19:52:25 +08:00
hoshi-hiyouga
e8c1979b79
[model] add qwen2.5 vl models ( #6779 )
...
Former-commit-id: 999c7c8fe0caf6b837a1bdc2c6a24fafec327cd8
2025-01-31 03:00:29 +08:00
hoshi-hiyouga
1efe525df7
[model] support yarn ( #6693 )
...
Former-commit-id: 1f47b6186c267de86cbdbd47ba2adbf1f9db7f39
2025-01-18 13:56:09 +08:00
hiyouga
47c2d91933
support report custom args
...
Former-commit-id: 5111cac6f8e7b77ef1ca1ff967734cfe1d6785f4
2024-12-21 21:42:45 +00:00
Yaser Afshar
76ebd62ac1
Add missing key to init_kwargs
...
Former-commit-id: 1c8ad22a5f167bf4e1c845e273583e5cb3a0214e
2024-12-17 12:34:05 +00:00
Yaser Afshar
fe4546a7bb
Add trust_remote_code parameter and remove True
...
- Introduced a new model parameter `trust_remote_code`
- Set the default value of `trust_remote_code` to `False`
to enhance security
Former-commit-id: 09437763267bc7081159a6878cee9652a2b1ddac
2024-12-17 12:25:12 +00:00
hiyouga
64bac4bc7e
support non-reenterent-gc & fix #6358
...
Former-commit-id: f319da6937964c5068864f7815efbf3847bd4ad8
2024-12-17 11:41:59 +00:00
hiyouga
e99031daa4
fix inputs
...
Former-commit-id: 446441fdb020b5a102480251cb8536dd8b3f8f99
2024-11-23 18:26:02 +00:00
hiyouga
0d18cca0db
add vllm config
...
Former-commit-id: 58ab4579dc81a1dcea2bf5938ba3f3116cecfc76
2024-11-10 21:28:18 +08:00
huniu20
e8e98bb125
add om_hub_token argument
...
Former-commit-id: 7b91be33c9cd8473453716f0c4c6dec924304efc
2024-10-10 17:16:46 +08:00
hiyouga
4df090ff48
fix #5542
...
Former-commit-id: fe7ffccdb9a45b31e20ab7e88282a75b45504a97
2024-09-30 23:28:55 +08:00
hiyouga
0daee7cb39
support activation offloading via unsloth gc
...
Former-commit-id: fb72a3adb0916232cc9ac9f0c725c02d07b9354c
2024-09-08 01:22:19 +08:00
hiyouga
3aa6a3e45b
add e2e tests
...
Former-commit-id: 94d5b1bd8f49dabeb9e3c53d634cfb3c06b0241d
2024-09-05 21:52:28 +08:00
hiyouga
9df7a26e6b
video datasets
...
Former-commit-id: 8cafc7b055a854f483ad1c67f3d487ffd34b5f89
2024-09-05 02:04:17 +08:00
hiyouga
5ef58eb655
fix #5334
...
Former-commit-id: 59d2b31e968677263f005f57ae8a56fc758307a7
2024-09-03 19:09:42 +08:00
hiyouga
bfdcc6bacf
add rlhf-v dataset
...
Former-commit-id: 8e49940746c1a6ff910f07dbefbec14af9d0f3c6
2024-09-01 22:57:41 +08:00
hiyouga
f31e7e0dfc
remove visual_inputs, fix qlora
...
Former-commit-id: a025c3df61db154bef13033518903bbf846f4fc8
2024-08-31 00:24:51 +08:00
hiyouga
a83756b5e9
refactor mm training
...
Former-commit-id: 3382317e32f88ed377d3e7759bdeaf0f2559d22a
2024-08-30 02:14:31 +08:00
hiyouga
c765292093
support liger kernel
...
Former-commit-id: 72bc8f01111ad69b92a647b54b4af988515d9c34
2024-08-27 11:20:14 +08:00
hiyouga
ff6fc666c1
update hparams
...
Former-commit-id: 575a02a23d9b41d00ca6291d8a40b5bdb3cbeeec
2024-07-03 23:18:58 +08:00
ancv
7f42932957
move efficient_packing from data_args to model_args
...
Former-commit-id: e8e13b09423dd08a31a3bde8f85833c6e5d43ee5
2024-07-02 18:37:55 +07:00
hiyouga
d2d9fa4abb
support HQQ/EETQ #4113
...
Former-commit-id: ad144c2265cdee0d23014dbb3d017ea257cb26ed
2024-06-27 00:29:42 +08:00
hiyouga
47651a94a3
fix #4410
...
Former-commit-id: fca893d73c3d7bbb87a816522f2e1568d3e9c612
2024-06-24 22:34:31 +08:00
stceum
9aa640f27b
Bug Fix: off
is parsed as False
in yaml file, changed to disabled
to avoid this.
...
Former-commit-id: 3ed063f281d1c2563df1b9eb3800543208c9dc16
2024-06-24 20:39:31 +08:00
hiyouga
f25b8626bf
support pissa
...
Former-commit-id: 8c1046d78ac6c8f9429b73617e35e1eccb35138f
2024-06-16 01:08:12 +08:00
hiyouga
96b82ccd4d
use fixture
...
Former-commit-id: 80a9e6bf94cf14fa63e6b6cdf7e1ce13722c8b5e
2024-06-15 20:06:17 +08:00
hiyouga
2946153cea
add license
...
Former-commit-id: d87108daa68bd40174b262be1ca65fe6e1b7ab56
2024-06-15 17:54:33 +08:00
hiyouga
a3f4925c2c
add test cases
...
Former-commit-id: b27269bd2b52fb9d43cde8a8b7f293099b0127a2
2024-06-15 04:05:54 +08:00
hiyouga
99ce085415
fix lint
...
Former-commit-id: 713fde4259233af645bade7790211064a07a2a6f
2024-06-13 00:48:44 +08:00
hiyouga
5834651c4a
fix #4198
...
Former-commit-id: 89f2bd8c8c035181927bd530a7ffc733407d674c
2024-06-11 15:38:38 +08:00
hiyouga
4f0ce9be4e
reorganize adapter code
...
Former-commit-id: 54cd743ebfbd296ae9eaf10c33f59e127f451785
2024-06-08 00:47:23 +08:00
hoshi-hiyouga
d31c9c73c7
Merge pull request #4080 from MengqingCao/npu
...
Add npu option for model exporting
Former-commit-id: ca459f67eb319c262d5bab882e9cb3535a76eb69
2024-06-06 03:15:44 +08:00
hoshi-hiyouga
d9a372658a
Update model_args.py
...
Former-commit-id: af2c3cbee4372724cf00379d8494f5000dc8a3df
2024-06-06 03:14:23 +08:00
hiyouga
c439c959f7
add vllm_dtype arg #3387 #3717
...
Former-commit-id: 8fcc79e1e66a4e952710ac8c98685437d924361a
2024-06-06 02:53:27 +08:00
hiyouga
3fcb678d00
support train from scratch #4033 #4075
...
Former-commit-id: a12a506c3d2ba85975a5990c46d2e055cdfe0f2e
2024-06-06 02:43:19 +08:00
MengqingCao
15f6ab73a5
add npu for model export
...
Former-commit-id: 07045c876a4c6025ba56905b65cce6c9faedacb5
2024-06-05 07:06:40 +00:00
hiyouga
e4ce59243b
fix #4005 #4013
...
Former-commit-id: eed33862bc733361f3c28b3c95dc0eb4ea00884c
2024-06-03 19:12:29 +08:00
Xu Song
abe33220bf
Update model_args.py
...
Former-commit-id: dade2f083d398f83581d935f44d20dc45244c924
2024-05-31 14:35:48 +08:00
hiyouga
b88ecd71fd
fix full/freeze tuning for mllm
...
Former-commit-id: 08564838bd02651668845ed74e2e60561e5b6d8c
2024-05-27 20:37:57 +08:00
BUAADreamer
119af92620
Merge branch 'hiyouga:main' into main
...
Former-commit-id: 047a06a1e5336625acac14b3533ab3ef2b9905be
2024-05-24 09:50:00 +08:00
hiyouga
6d8ef03741
fix oom issues in export
...
Former-commit-id: 67ebc7b388c61b9d880c02d7fd217c29299fdf43
2024-05-23 23:32:45 +08:00
BUAADreamer
071d674065
support pretraining of llava
...
Former-commit-id: 29a6d5bdb8610be8f796eed65eede9ba7b503527
2024-05-21 08:57:14 +08:00
hoshi-hiyouga
e4a2accf4a
Update model_args.py
...
Former-commit-id: 964672745389e35580a7010b0f010bd5ee08d542
2024-05-17 16:16:41 +08:00
juejuezi
20326affde
feat: pass the max_lora_rank
parameter to vLLM backend
...
Former-commit-id: b20d62ba3ccc5c02529d19e22b7adcfe8b88c326
2024-05-17 16:07:39 +08:00