hiyouga
|
938c4cb132
|
fix dpo trainer
Former-commit-id: 074745b1707f98e092749f57041d866c5d55bc04
|
2023-12-23 01:51:55 +08:00 |
|
hiyouga
|
bf872424da
|
llama board: add unsloth
Former-commit-id: 9a18a85639ad53d75fb25b6a71edd96fe95c5e59
|
2023-12-23 00:35:53 +08:00 |
|
hiyouga
|
f0d405f392
|
support unsloth
Former-commit-id: 7aad0b889d9a316fffd65f32a419078418fc0986
|
2023-12-23 00:14:33 +08:00 |
|
hoshi-hiyouga
|
940824e306
|
Merge pull request #1953 from ShaneTian/model-load-bf16
Fix slow model initialization in bfloat16 dtype.
Former-commit-id: 315b8367cb6edee55101964a076892c5d8d2783f
|
2023-12-22 17:29:54 +08:00 |
|
ShaneTian
|
3eaabe12aa
|
Fix slow model initialization in bfloat16 dtype.
Former-commit-id: d032daa4bd598dd0d71b43eb68a614de77a699a6
|
2023-12-22 16:27:28 +08:00 |
|
hiyouga
|
ce79528bb1
|
fix param type
Former-commit-id: ba69378841778410f8004385df3fd4c41e5fa573
|
2023-12-21 17:33:01 +08:00 |
|
hiyouga
|
fee0fef052
|
fix ds zero3 check
Former-commit-id: 083355fc051f5d25400eb80887ff5e0d15ce729b
|
2023-12-21 01:19:22 +08:00 |
|
hiyouga
|
de6be321b9
|
match version
Former-commit-id: af0194e6d9ba1c67a8592c840c01cee7991b0b0e
|
2023-12-20 22:17:35 +08:00 |
|
hoshi-hiyouga
|
1df1a44e9d
|
Merge pull request #1932 from ShaneTian/main
Update transformers to 4.36.2 to resolve multi-node saving bug.
Former-commit-id: ba4d32bf59e3862ff6d4037647e6893b82a7098e
|
2023-12-20 22:13:28 +08:00 |
|
ShaneTian
|
69cb0038f0
|
Update transformers to 4.36.2 to resolve bug when saving a checkpoint in the multi-node setting.
Former-commit-id: 390f0caf7ff28b75b51eb0f3bba079288e8b66d6
|
2023-12-20 22:00:41 +08:00 |
|
hiyouga
|
0a69665525
|
Update wechat.jpg
Former-commit-id: 7910dbae9238c06adcdbd6e2ff7712aec9e3373f
|
2023-12-20 19:24:37 +08:00 |
|
hiyouga
|
9337568e23
|
fix stop words
Former-commit-id: dec360d5aee58c11804b8e45dd8d4c375086887f
|
2023-12-20 19:06:43 +08:00 |
|
hiyouga
|
622d31e398
|
fix yi template #1895
Former-commit-id: 5af8841c4f6c97df522d2cf4e283d5ef0af21a18
|
2023-12-20 18:58:16 +08:00 |
|
hiyouga
|
23a875a8b1
|
improve quantization
Former-commit-id: 624cc212819b7cd16295c72084cd454b67cf89a6
|
2023-12-20 18:27:16 +08:00 |
|
hiyouga
|
c1233ab65f
|
add max_memory for gptq #1923
Former-commit-id: c4a3977ad7278b59a9b3dadcb446bb4c99da5c9d
|
2023-12-20 18:15:17 +08:00 |
|
hiyouga
|
82a79e9fdf
|
fix #1073 #1462 #1735 #1908
Former-commit-id: 31165a9822bd52130b33cd3439f887c26e0679dc
|
2023-12-20 17:15:40 +08:00 |
|
hiyouga
|
f64be8ee84
|
optimize data loading logic
Former-commit-id: ec1fe1daa98c61f62c753b22847de028b5c5cded
|
2023-12-20 16:15:41 +08:00 |
|
hiyouga
|
633624dc3c
|
fix #1909
Former-commit-id: c6abbbfe90dcb0e832f73f0c611fc32eaa7ea78d
|
2023-12-20 16:11:07 +08:00 |
|
hiyouga
|
a862ce636f
|
fix mixtral inference #1821
Former-commit-id: f86857bd9ef456e77ad79a584f1fa08a129e5270
|
2023-12-20 15:11:15 +08:00 |
|
hiyouga
|
e06b9c4fa1
|
fix #1900
Former-commit-id: 0c6ab7c75e671348f24d28309d450aecec0a157f
|
2023-12-19 17:21:46 +08:00 |
|
hiyouga
|
54e58adf09
|
update readme
Former-commit-id: edb7d177c210b800e2778bb5b638e12ac1078ef9
|
2023-12-18 22:29:45 +08:00 |
|
hiyouga
|
3981927d6b
|
add codegeex template
Former-commit-id: a67a440644687dc2262134c0f2895f3ae42cae19
|
2023-12-18 19:52:35 +08:00 |
|
hiyouga
|
51c636db54
|
add xverse-65B-2 model
Former-commit-id: 2df923540c3cbf3b06c74801ea66d3523718b84a
|
2023-12-18 19:24:09 +08:00 |
|
hiyouga
|
1af13cb737
|
add models
Former-commit-id: 709ac8870a17a96e786b32c75ad8c4e573148cee
|
2023-12-18 19:09:31 +08:00 |
|
hiyouga
|
5a199af387
|
fix tokenizer for Yi chat models #1617 #1875
Former-commit-id: 71a9c1617181b7df46cfb193464fb7e56e6399b1
|
2023-12-18 17:18:11 +08:00 |
|
hiyouga
|
dee19b11ba
|
update readme
Former-commit-id: 2b4e5f0d3239984f62c7eca6dc7b9e3bbc6f8c4e
|
2023-12-18 15:46:45 +08:00 |
|
hiyouga
|
16cc0321f2
|
fix llama board
Former-commit-id: c46879575f434b2b458bddae6db63b227db4202e
|
2023-12-16 22:17:37 +08:00 |
|
hiyouga
|
8154b4bdf6
|
fix #1742
Former-commit-id: 870426ff70c060213ac283b10a9b1f4bf71679ef
v0.4.0
|
2023-12-16 20:50:45 +08:00 |
|
hiyouga
|
397f6bb615
|
add xverse-65b-chat model
Former-commit-id: 7ae6919b9bb9ecc8d821eea47a03eacd9eb997ac
|
2023-12-16 20:21:29 +08:00 |
|
hiyouga
|
ff5f57bfbf
|
set version
Former-commit-id: 328ad06bd4da01d6534354eb6bb798b259a017b9
|
2023-12-16 20:17:51 +08:00 |
|
hiyouga
|
7c362509a6
|
add noisy mean initialization #1815
Former-commit-id: a66186b8724ffd0351a32593ab52d8a2312f339b
|
2023-12-16 19:47:51 +08:00 |
|
hiyouga
|
4e75ca1222
|
support dpo-ftx
Former-commit-id: b87c74289d523ef88611b376074199ffd03cf103
|
2023-12-16 19:21:41 +08:00 |
|
hiyouga
|
f0f9d253d8
|
support autogptq in llama board #246
Former-commit-id: 71389be37cb0f1a65db6e501e11ca14e615c1a24
|
2023-12-16 16:31:30 +08:00 |
|
hoshi-hiyouga
|
adafb38cb8
|
Merge pull request #1868 from yhyu13/improve_hfargparser
Improve logging for unknown args
Former-commit-id: 93f64ce9a85cd8ebff1bb88139c3176b07920020
|
2023-12-16 16:06:09 +08:00 |
|
yhyu13
|
cc91724507
|
Use llmtuner logger
Former-commit-id: fc70a92cb6e9c22bab9a0695f476ae80461c656f
|
2023-12-16 07:15:27 +00:00 |
|
yhyu13
|
362e3c913f
|
Improve logging for unknown args
Former-commit-id: 26817143ff86a853c011be11678235bcc803ccce
|
2023-12-16 05:16:29 +00:00 |
|
hiyouga
|
7db6fe4754
|
update tips
Former-commit-id: 3551171d49f0f6aa5f745d80f71939408c9bb3a7
|
2023-12-15 23:52:50 +08:00 |
|
hiyouga
|
9a88387b91
|
fix #1770
Former-commit-id: 439a26c27606dc617cfd073ef23256b8f6f7a4fb
|
2023-12-15 23:50:15 +08:00 |
|
hiyouga
|
7dbc670902
|
support quantization in export model
Former-commit-id: 3524aa1e58da94ab00e9a2024952ea1b4119b2af
|
2023-12-15 23:44:50 +08:00 |
|
hiyouga
|
2db4cfab40
|
update dc link
Former-commit-id: 87ef3f47b519677e6a7c81ff45584acb34339f3a
|
2023-12-15 22:11:31 +08:00 |
|
hoshi-hiyouga
|
68b1500c41
|
Merge pull request #1864 from hiyouga/dev
Refactor hyper-parameters of adapters and model loader
Former-commit-id: e2bd597b3c1492eec58575fbe5e634ae0b7c91d9
|
2023-12-15 22:06:56 +08:00 |
|
hiyouga
|
329576a5b4
|
fix bug
Former-commit-id: 00c77104f8f9675e1421f99d78de710b21cab047
|
2023-12-15 21:54:02 +08:00 |
|
hiyouga
|
f32c8614c2
|
fix bug
Former-commit-id: 9e509b99af95666ea27f5540bedd64a330357ad7
|
2023-12-15 21:49:26 +08:00 |
|
hiyouga
|
d24d2f0458
|
add configurer
Former-commit-id: 2740aa9cbbcfc6dcfef82915b7db4e0f8b2c1bae
|
2023-12-15 21:46:40 +08:00 |
|
hiyouga
|
bd03307bbd
|
refactor adapter hparam
Former-commit-id: 0716f5e470afffd2df5a815712b552a4b4797153
|
2023-12-15 20:53:11 +08:00 |
|
hiyouga
|
6a186d4386
|
add loftq
Former-commit-id: d4c351f1ec82b0864cc32c5602b03daddf9aeaba
|
2023-12-14 21:53:56 +08:00 |
|
hiyouga
|
e55e32efc4
|
fix valuehead model
Former-commit-id: bfdee1608f53a6334d8e73c48dbeb4160969d783
|
2023-12-14 20:15:20 +08:00 |
|
hoshi-hiyouga
|
3358416e82
|
Update wechat.jpg
Former-commit-id: bf2d9c8febb3a95cbb95322ddf03068938e4b4fb
|
2023-12-13 18:23:18 +08:00 |
|
hoshi-hiyouga
|
fa99ead86b
|
tiny fix
Former-commit-id: 81167cd19dac4926da9bd259f4e3cb064c22825c
|
2023-12-13 17:32:36 +08:00 |
|
hoshi-hiyouga
|
3f8d33695c
|
revert peft version
Former-commit-id: 9b0630f84fe8eaae7a5a3123626807408645521f
|
2023-12-13 10:49:45 +08:00 |
|