896 Commits

Author SHA1 Message Date
hiyouga
3981927d6b add codegeex template
Former-commit-id: a67a440644687dc2262134c0f2895f3ae42cae19
2023-12-18 19:52:35 +08:00
hiyouga
51c636db54 add xverse-65B-2 model
Former-commit-id: 2df923540c3cbf3b06c74801ea66d3523718b84a
2023-12-18 19:24:09 +08:00
hiyouga
1af13cb737 add models
Former-commit-id: 709ac8870a17a96e786b32c75ad8c4e573148cee
2023-12-18 19:09:31 +08:00
hiyouga
5a199af387 fix tokenizer for Yi chat models #1617 #1875
Former-commit-id: 71a9c1617181b7df46cfb193464fb7e56e6399b1
2023-12-18 17:18:11 +08:00
hiyouga
dee19b11ba update readme
Former-commit-id: 2b4e5f0d3239984f62c7eca6dc7b9e3bbc6f8c4e
2023-12-18 15:46:45 +08:00
hiyouga
16cc0321f2 fix llama board
Former-commit-id: c46879575f434b2b458bddae6db63b227db4202e
2023-12-16 22:17:37 +08:00
hiyouga
8154b4bdf6 fix #1742
Former-commit-id: 870426ff70c060213ac283b10a9b1f4bf71679ef
v0.4.0
2023-12-16 20:50:45 +08:00
hiyouga
397f6bb615 add xverse-65b-chat model
Former-commit-id: 7ae6919b9bb9ecc8d821eea47a03eacd9eb997ac
2023-12-16 20:21:29 +08:00
hiyouga
ff5f57bfbf set version
Former-commit-id: 328ad06bd4da01d6534354eb6bb798b259a017b9
2023-12-16 20:17:51 +08:00
hiyouga
7c362509a6 add noisy mean initialization #1815
Former-commit-id: a66186b8724ffd0351a32593ab52d8a2312f339b
2023-12-16 19:47:51 +08:00
hiyouga
4e75ca1222 support dpo-ftx
Former-commit-id: b87c74289d523ef88611b376074199ffd03cf103
2023-12-16 19:21:41 +08:00
hiyouga
f0f9d253d8 support autogptq in llama board #246
Former-commit-id: 71389be37cb0f1a65db6e501e11ca14e615c1a24
2023-12-16 16:31:30 +08:00
hoshi-hiyouga
adafb38cb8 Merge pull request #1868 from yhyu13/improve_hfargparser
Improve logging for unknown args

Former-commit-id: 93f64ce9a85cd8ebff1bb88139c3176b07920020
2023-12-16 16:06:09 +08:00
yhyu13
cc91724507 Use llmtuner logger
Former-commit-id: fc70a92cb6e9c22bab9a0695f476ae80461c656f
2023-12-16 07:15:27 +00:00
yhyu13
362e3c913f Improve logging for unknown args
Former-commit-id: 26817143ff86a853c011be11678235bcc803ccce
2023-12-16 05:16:29 +00:00
hiyouga
7db6fe4754 update tips
Former-commit-id: 3551171d49f0f6aa5f745d80f71939408c9bb3a7
2023-12-15 23:52:50 +08:00
hiyouga
9a88387b91 fix #1770
Former-commit-id: 439a26c27606dc617cfd073ef23256b8f6f7a4fb
2023-12-15 23:50:15 +08:00
hiyouga
7dbc670902 support quantization in export model
Former-commit-id: 3524aa1e58da94ab00e9a2024952ea1b4119b2af
2023-12-15 23:44:50 +08:00
hiyouga
2db4cfab40 update dc link
Former-commit-id: 87ef3f47b519677e6a7c81ff45584acb34339f3a
2023-12-15 22:11:31 +08:00
hoshi-hiyouga
68b1500c41 Merge pull request #1864 from hiyouga/dev
Refactor hyper-parameters of adapters and model loader

Former-commit-id: e2bd597b3c1492eec58575fbe5e634ae0b7c91d9
2023-12-15 22:06:56 +08:00
hiyouga
329576a5b4 fix bug
Former-commit-id: 00c77104f8f9675e1421f99d78de710b21cab047
2023-12-15 21:54:02 +08:00
hiyouga
f32c8614c2 fix bug
Former-commit-id: 9e509b99af95666ea27f5540bedd64a330357ad7
2023-12-15 21:49:26 +08:00
hiyouga
d24d2f0458 add configurer
Former-commit-id: 2740aa9cbbcfc6dcfef82915b7db4e0f8b2c1bae
2023-12-15 21:46:40 +08:00
hiyouga
bd03307bbd refactor adapter hparam
Former-commit-id: 0716f5e470afffd2df5a815712b552a4b4797153
2023-12-15 20:53:11 +08:00
hiyouga
6a186d4386 add loftq
Former-commit-id: d4c351f1ec82b0864cc32c5602b03daddf9aeaba
2023-12-14 21:53:56 +08:00
hiyouga
e55e32efc4 fix valuehead model
Former-commit-id: bfdee1608f53a6334d8e73c48dbeb4160969d783
2023-12-14 20:15:20 +08:00
hoshi-hiyouga
3358416e82 Update wechat.jpg
Former-commit-id: bf2d9c8febb3a95cbb95322ddf03068938e4b4fb
2023-12-13 18:23:18 +08:00
hoshi-hiyouga
fa99ead86b tiny fix
Former-commit-id: 81167cd19dac4926da9bd259f4e3cb064c22825c
2023-12-13 17:32:36 +08:00
hoshi-hiyouga
3f8d33695c revert peft version
Former-commit-id: 9b0630f84fe8eaae7a5a3123626807408645521f
2023-12-13 10:49:45 +08:00
hoshi-hiyouga
ad1860d35e update peft version
Former-commit-id: 573a12c86b6ccd9085d4cbeebcbdbfcc24e1990e
2023-12-13 10:23:51 +08:00
hoshi-hiyouga
657dff438c tiny fix
Former-commit-id: 6953096c9d8f85d56cc980a4bec3a052411fb4a0
2023-12-13 10:21:29 +08:00
hoshi-hiyouga
5b211cfbe9 fix #1819
Former-commit-id: 1fcd545c3dd78bd2113cde8ef788c5395de11c34
2023-12-13 10:14:01 +08:00
hiyouga
15b321da8e remove loftq
Former-commit-id: 3a8a50d4d42082b3bdce549653b398e49f2eb554
2023-12-13 01:53:46 +08:00
hiyouga
c9a2a7a3b3 fix sharegpt loading
Former-commit-id: 2c8e88f9c14e030f79778893eaeeb7abda97dd0b
2023-12-13 00:56:16 +08:00
hiyouga
f9ab303629 add model urls
Former-commit-id: 3552035d7eecff86943f02aa26693544fe295f49
2023-12-13 00:09:17 +08:00
hiyouga
fdaba1648a update readme
Former-commit-id: 28cc07868c5f65644245e37c1559e230aac18ed0
2023-12-12 23:30:29 +08:00
hiyouga
4c69025a83 support loftq
Former-commit-id: 6219dfbd9377528bce286c724ae2dd0090881095
2023-12-12 22:47:06 +08:00
hiyouga
b43a8dcfbf fix #1795
Former-commit-id: ada0e536c926fd0196d127c89cc817da1008c017
2023-12-12 19:58:34 +08:00
hiyouga
1a0bdd305c support system column #1765
Former-commit-id: 0a9c6e0146ebc71d5438c837463d6ab236e227c4
2023-12-12 19:45:59 +08:00
hiyouga
cefc0b2f03 fix modelscope data hub
Former-commit-id: d5b2c57a356539df9993e4774b856231eca8a6da
2023-12-12 18:33:06 +08:00
hoshi-hiyouga
0091af79b2 Merge pull request #1802 from tastelikefeet/feat/support_ms
Support ModelScope Datahub

Former-commit-id: 382319915c3d986e018c1346c638b518bb29a6a3
2023-12-12 17:58:37 +08:00
hoshi-hiyouga
b67085e13a Merge branch 'main' into feat/support_ms
Former-commit-id: 6382efec52f6be3daa5db0bd280a96162009fca1
2023-12-12 17:55:32 +08:00
hiyouga
e293be7423 fix webui
Former-commit-id: e6ddebd3ae670f1ceccd7cbe0ad8daee11070eba
2023-12-12 15:27:40 +08:00
xingjun.wang
e331e8c200 modify guanaco
Former-commit-id: e80a989d49366bf08f62d212d329a90a02d8167e
2023-12-12 15:00:37 +08:00
xingjun.wang
277790d868 update dataset info
Former-commit-id: 73b50a26b9c6282f28df87338fa4057759c38f69
2023-12-12 14:53:59 +08:00
xingjun.wang
6cb2c99e7d add use_streaming
Former-commit-id: adc98c86dad64f1a793017fa628b5cf19abbdd01
2023-12-12 14:23:05 +08:00
xingjun.wang
1bd75afae8 fix cache dir
Former-commit-id: 1909f0d11732bd99fadc6c1191e026137c6a7dff
2023-12-12 14:21:33 +08:00
xingjun.wang
c1974c91e5 add print info for test
Former-commit-id: 168321a4da7612620b9528860306f03bf65d019a
2023-12-12 14:14:40 +08:00
xingjun.wang
e17f2a3f7f update cache dir
Former-commit-id: edc82b923a3fb03c5af100b5357e10f0c18b4523
2023-12-12 13:08:18 +08:00
xingjun.wang
879209829e update args for MsDataset.load
Former-commit-id: 09533e95edc5fa65a38b2f04c6d88506196021b3
2023-12-12 13:02:54 +08:00