51 Commits

Author SHA1 Message Date
hoshi-hiyouga
9ccfb97a2c
[misc] update format (#7277) 2025-03-13 02:53:08 +08:00
hoshi-hiyouga
7c1640ed5f
[misc] upgrade format to py39 (#7256) 2025-03-12 00:08:41 +08:00
hoshi-hiyouga
5a29f49fb1 [config] update args (#7231)
Former-commit-id: ed8b12e3cbdaa85f5bde619081b86717a1f3c5fa
2025-03-10 23:04:43 +08:00
hoshi-hiyouga
113cc3d920 [misc] fix cli (#7204)
Former-commit-id: bd1722355944a3618f44a4ac3e8177c34b97dc75
2025-03-07 15:01:18 +08:00
hoshi-hiyouga
1f4a0b11ba [data] update vlm args (#6976)
Former-commit-id: 3da2cc2710c9b13ab450815a92fff14b03251984
2025-02-18 02:12:51 +08:00
hoshi-hiyouga
b1d31ff0f9 [data] add min resolution option (#6975)
Former-commit-id: 7faecc0301709326efa21e7a3fdb75fe0a9635c2
2025-02-18 01:40:46 +08:00
hoshi-hiyouga
c6be9e242c [misc] support export ollama modelfile (#6899)
* support export ollama modelfile

* update config

* add system and num ctx

Former-commit-id: 9184a6e0ed7ff5f632c848f861bfa448c4cd06fc
2025-02-11 19:52:25 +08:00
hoshi-hiyouga
e8c1979b79 [model] add qwen2.5 vl models (#6779)
Former-commit-id: 999c7c8fe0caf6b837a1bdc2c6a24fafec327cd8
2025-01-31 03:00:29 +08:00
hoshi-hiyouga
1efe525df7 [model] support yarn (#6693)
Former-commit-id: 1f47b6186c267de86cbdbd47ba2adbf1f9db7f39
2025-01-18 13:56:09 +08:00
hiyouga
47c2d91933 support report custom args
Former-commit-id: 5111cac6f8e7b77ef1ca1ff967734cfe1d6785f4
2024-12-21 21:42:45 +00:00
Yaser Afshar
76ebd62ac1 Add missing key to init_kwargs
Former-commit-id: 1c8ad22a5f167bf4e1c845e273583e5cb3a0214e
2024-12-17 12:34:05 +00:00
Yaser Afshar
fe4546a7bb Add trust_remote_code parameter and remove True
- Introduced a new model parameter `trust_remote_code`
- Set the default value of `trust_remote_code` to `False`
  to enhance security


Former-commit-id: 09437763267bc7081159a6878cee9652a2b1ddac
2024-12-17 12:25:12 +00:00
hiyouga
64bac4bc7e support non-reenterent-gc & fix #6358
Former-commit-id: f319da6937964c5068864f7815efbf3847bd4ad8
2024-12-17 11:41:59 +00:00
hiyouga
e99031daa4 fix inputs
Former-commit-id: 446441fdb020b5a102480251cb8536dd8b3f8f99
2024-11-23 18:26:02 +00:00
hiyouga
0d18cca0db add vllm config
Former-commit-id: 58ab4579dc81a1dcea2bf5938ba3f3116cecfc76
2024-11-10 21:28:18 +08:00
huniu20
e8e98bb125 add om_hub_token argument
Former-commit-id: 7b91be33c9cd8473453716f0c4c6dec924304efc
2024-10-10 17:16:46 +08:00
hiyouga
4df090ff48 fix #5542
Former-commit-id: fe7ffccdb9a45b31e20ab7e88282a75b45504a97
2024-09-30 23:28:55 +08:00
hiyouga
0daee7cb39 support activation offloading via unsloth gc
Former-commit-id: fb72a3adb0916232cc9ac9f0c725c02d07b9354c
2024-09-08 01:22:19 +08:00
hiyouga
3aa6a3e45b add e2e tests
Former-commit-id: 94d5b1bd8f49dabeb9e3c53d634cfb3c06b0241d
2024-09-05 21:52:28 +08:00
hiyouga
9df7a26e6b video datasets
Former-commit-id: 8cafc7b055a854f483ad1c67f3d487ffd34b5f89
2024-09-05 02:04:17 +08:00
hiyouga
5ef58eb655 fix #5334
Former-commit-id: 59d2b31e968677263f005f57ae8a56fc758307a7
2024-09-03 19:09:42 +08:00
hiyouga
bfdcc6bacf add rlhf-v dataset
Former-commit-id: 8e49940746c1a6ff910f07dbefbec14af9d0f3c6
2024-09-01 22:57:41 +08:00
hiyouga
f31e7e0dfc remove visual_inputs, fix qlora
Former-commit-id: a025c3df61db154bef13033518903bbf846f4fc8
2024-08-31 00:24:51 +08:00
hiyouga
a83756b5e9 refactor mm training
Former-commit-id: 3382317e32f88ed377d3e7759bdeaf0f2559d22a
2024-08-30 02:14:31 +08:00
hiyouga
c765292093 support liger kernel
Former-commit-id: 72bc8f01111ad69b92a647b54b4af988515d9c34
2024-08-27 11:20:14 +08:00
hiyouga
ff6fc666c1 update hparams
Former-commit-id: 575a02a23d9b41d00ca6291d8a40b5bdb3cbeeec
2024-07-03 23:18:58 +08:00
ancv
7f42932957 move efficient_packing from data_args to model_args
Former-commit-id: e8e13b09423dd08a31a3bde8f85833c6e5d43ee5
2024-07-02 18:37:55 +07:00
hiyouga
d2d9fa4abb support HQQ/EETQ #4113
Former-commit-id: ad144c2265cdee0d23014dbb3d017ea257cb26ed
2024-06-27 00:29:42 +08:00
hiyouga
47651a94a3 fix #4410
Former-commit-id: fca893d73c3d7bbb87a816522f2e1568d3e9c612
2024-06-24 22:34:31 +08:00
stceum
9aa640f27b Bug Fix: off is parsed as False in yaml file, changed to disabled to avoid this.
Former-commit-id: 3ed063f281d1c2563df1b9eb3800543208c9dc16
2024-06-24 20:39:31 +08:00
hiyouga
f25b8626bf support pissa
Former-commit-id: 8c1046d78ac6c8f9429b73617e35e1eccb35138f
2024-06-16 01:08:12 +08:00
hiyouga
96b82ccd4d use fixture
Former-commit-id: 80a9e6bf94cf14fa63e6b6cdf7e1ce13722c8b5e
2024-06-15 20:06:17 +08:00
hiyouga
2946153cea add license
Former-commit-id: d87108daa68bd40174b262be1ca65fe6e1b7ab56
2024-06-15 17:54:33 +08:00
hiyouga
a3f4925c2c add test cases
Former-commit-id: b27269bd2b52fb9d43cde8a8b7f293099b0127a2
2024-06-15 04:05:54 +08:00
hiyouga
99ce085415 fix lint
Former-commit-id: 713fde4259233af645bade7790211064a07a2a6f
2024-06-13 00:48:44 +08:00
hiyouga
5834651c4a fix #4198
Former-commit-id: 89f2bd8c8c035181927bd530a7ffc733407d674c
2024-06-11 15:38:38 +08:00
hiyouga
4f0ce9be4e reorganize adapter code
Former-commit-id: 54cd743ebfbd296ae9eaf10c33f59e127f451785
2024-06-08 00:47:23 +08:00
hoshi-hiyouga
d31c9c73c7 Merge pull request #4080 from MengqingCao/npu
Add npu option for model exporting

Former-commit-id: ca459f67eb319c262d5bab882e9cb3535a76eb69
2024-06-06 03:15:44 +08:00
hoshi-hiyouga
d9a372658a Update model_args.py
Former-commit-id: af2c3cbee4372724cf00379d8494f5000dc8a3df
2024-06-06 03:14:23 +08:00
hiyouga
c439c959f7 add vllm_dtype arg #3387 #3717
Former-commit-id: 8fcc79e1e66a4e952710ac8c98685437d924361a
2024-06-06 02:53:27 +08:00
hiyouga
3fcb678d00 support train from scratch #4033 #4075
Former-commit-id: a12a506c3d2ba85975a5990c46d2e055cdfe0f2e
2024-06-06 02:43:19 +08:00
MengqingCao
15f6ab73a5 add npu for model export
Former-commit-id: 07045c876a4c6025ba56905b65cce6c9faedacb5
2024-06-05 07:06:40 +00:00
hiyouga
e4ce59243b fix #4005 #4013
Former-commit-id: eed33862bc733361f3c28b3c95dc0eb4ea00884c
2024-06-03 19:12:29 +08:00
Xu Song
abe33220bf Update model_args.py
Former-commit-id: dade2f083d398f83581d935f44d20dc45244c924
2024-05-31 14:35:48 +08:00
hiyouga
b88ecd71fd fix full/freeze tuning for mllm
Former-commit-id: 08564838bd02651668845ed74e2e60561e5b6d8c
2024-05-27 20:37:57 +08:00
BUAADreamer
119af92620 Merge branch 'hiyouga:main' into main
Former-commit-id: 047a06a1e5336625acac14b3533ab3ef2b9905be
2024-05-24 09:50:00 +08:00
hiyouga
6d8ef03741 fix oom issues in export
Former-commit-id: 67ebc7b388c61b9d880c02d7fd217c29299fdf43
2024-05-23 23:32:45 +08:00
BUAADreamer
071d674065 support pretraining of llava
Former-commit-id: 29a6d5bdb8610be8f796eed65eede9ba7b503527
2024-05-21 08:57:14 +08:00
hoshi-hiyouga
e4a2accf4a Update model_args.py
Former-commit-id: 964672745389e35580a7010b0f010bd5ee08d542
2024-05-17 16:16:41 +08:00
juejuezi
20326affde feat: pass the max_lora_rank parameter to vLLM backend
Former-commit-id: b20d62ba3ccc5c02529d19e22b7adcfe8b88c326
2024-05-17 16:07:39 +08:00