hiyouga
|
5a199af387
|
fix tokenizer for Yi chat models #1617 #1875
Former-commit-id: 71a9c1617181b7df46cfb193464fb7e56e6399b1
|
2023-12-18 17:18:11 +08:00 |
|
hiyouga
|
dee19b11ba
|
update readme
Former-commit-id: 2b4e5f0d3239984f62c7eca6dc7b9e3bbc6f8c4e
|
2023-12-18 15:46:45 +08:00 |
|
hiyouga
|
16cc0321f2
|
fix llama board
Former-commit-id: c46879575f434b2b458bddae6db63b227db4202e
|
2023-12-16 22:17:37 +08:00 |
|
hiyouga
|
8154b4bdf6
|
fix #1742
Former-commit-id: 870426ff70c060213ac283b10a9b1f4bf71679ef
v0.4.0
|
2023-12-16 20:50:45 +08:00 |
|
hiyouga
|
397f6bb615
|
add xverse-65b-chat model
Former-commit-id: 7ae6919b9bb9ecc8d821eea47a03eacd9eb997ac
|
2023-12-16 20:21:29 +08:00 |
|
hiyouga
|
ff5f57bfbf
|
set version
Former-commit-id: 328ad06bd4da01d6534354eb6bb798b259a017b9
|
2023-12-16 20:17:51 +08:00 |
|
hiyouga
|
7c362509a6
|
add noisy mean initialization #1815
Former-commit-id: a66186b8724ffd0351a32593ab52d8a2312f339b
|
2023-12-16 19:47:51 +08:00 |
|
hiyouga
|
4e75ca1222
|
support dpo-ftx
Former-commit-id: b87c74289d523ef88611b376074199ffd03cf103
|
2023-12-16 19:21:41 +08:00 |
|
hiyouga
|
f0f9d253d8
|
support autogptq in llama board #246
Former-commit-id: 71389be37cb0f1a65db6e501e11ca14e615c1a24
|
2023-12-16 16:31:30 +08:00 |
|
hoshi-hiyouga
|
adafb38cb8
|
Merge pull request #1868 from yhyu13/improve_hfargparser
Improve logging for unknown args
Former-commit-id: 93f64ce9a85cd8ebff1bb88139c3176b07920020
|
2023-12-16 16:06:09 +08:00 |
|
yhyu13
|
cc91724507
|
Use llmtuner logger
Former-commit-id: fc70a92cb6e9c22bab9a0695f476ae80461c656f
|
2023-12-16 07:15:27 +00:00 |
|
yhyu13
|
362e3c913f
|
Improve logging for unknown args
Former-commit-id: 26817143ff86a853c011be11678235bcc803ccce
|
2023-12-16 05:16:29 +00:00 |
|
hiyouga
|
7db6fe4754
|
update tips
Former-commit-id: 3551171d49f0f6aa5f745d80f71939408c9bb3a7
|
2023-12-15 23:52:50 +08:00 |
|
hiyouga
|
9a88387b91
|
fix #1770
Former-commit-id: 439a26c27606dc617cfd073ef23256b8f6f7a4fb
|
2023-12-15 23:50:15 +08:00 |
|
hiyouga
|
7dbc670902
|
support quantization in export model
Former-commit-id: 3524aa1e58da94ab00e9a2024952ea1b4119b2af
|
2023-12-15 23:44:50 +08:00 |
|
hiyouga
|
2db4cfab40
|
update dc link
Former-commit-id: 87ef3f47b519677e6a7c81ff45584acb34339f3a
|
2023-12-15 22:11:31 +08:00 |
|
hoshi-hiyouga
|
68b1500c41
|
Merge pull request #1864 from hiyouga/dev
Refactor hyper-parameters of adapters and model loader
Former-commit-id: e2bd597b3c1492eec58575fbe5e634ae0b7c91d9
|
2023-12-15 22:06:56 +08:00 |
|
hiyouga
|
329576a5b4
|
fix bug
Former-commit-id: 00c77104f8f9675e1421f99d78de710b21cab047
|
2023-12-15 21:54:02 +08:00 |
|
hiyouga
|
f32c8614c2
|
fix bug
Former-commit-id: 9e509b99af95666ea27f5540bedd64a330357ad7
|
2023-12-15 21:49:26 +08:00 |
|
hiyouga
|
d24d2f0458
|
add configurer
Former-commit-id: 2740aa9cbbcfc6dcfef82915b7db4e0f8b2c1bae
|
2023-12-15 21:46:40 +08:00 |
|
hiyouga
|
bd03307bbd
|
refactor adapter hparam
Former-commit-id: 0716f5e470afffd2df5a815712b552a4b4797153
|
2023-12-15 20:53:11 +08:00 |
|
hiyouga
|
6a186d4386
|
add loftq
Former-commit-id: d4c351f1ec82b0864cc32c5602b03daddf9aeaba
|
2023-12-14 21:53:56 +08:00 |
|
hiyouga
|
e55e32efc4
|
fix valuehead model
Former-commit-id: bfdee1608f53a6334d8e73c48dbeb4160969d783
|
2023-12-14 20:15:20 +08:00 |
|
hoshi-hiyouga
|
3358416e82
|
Update wechat.jpg
Former-commit-id: bf2d9c8febb3a95cbb95322ddf03068938e4b4fb
|
2023-12-13 18:23:18 +08:00 |
|
hoshi-hiyouga
|
fa99ead86b
|
tiny fix
Former-commit-id: 81167cd19dac4926da9bd259f4e3cb064c22825c
|
2023-12-13 17:32:36 +08:00 |
|
hoshi-hiyouga
|
3f8d33695c
|
revert peft version
Former-commit-id: 9b0630f84fe8eaae7a5a3123626807408645521f
|
2023-12-13 10:49:45 +08:00 |
|
hoshi-hiyouga
|
ad1860d35e
|
update peft version
Former-commit-id: 573a12c86b6ccd9085d4cbeebcbdbfcc24e1990e
|
2023-12-13 10:23:51 +08:00 |
|
hoshi-hiyouga
|
657dff438c
|
tiny fix
Former-commit-id: 6953096c9d8f85d56cc980a4bec3a052411fb4a0
|
2023-12-13 10:21:29 +08:00 |
|
hoshi-hiyouga
|
5b211cfbe9
|
fix #1819
Former-commit-id: 1fcd545c3dd78bd2113cde8ef788c5395de11c34
|
2023-12-13 10:14:01 +08:00 |
|
hiyouga
|
15b321da8e
|
remove loftq
Former-commit-id: 3a8a50d4d42082b3bdce549653b398e49f2eb554
|
2023-12-13 01:53:46 +08:00 |
|
hiyouga
|
c9a2a7a3b3
|
fix sharegpt loading
Former-commit-id: 2c8e88f9c14e030f79778893eaeeb7abda97dd0b
|
2023-12-13 00:56:16 +08:00 |
|
hiyouga
|
f9ab303629
|
add model urls
Former-commit-id: 3552035d7eecff86943f02aa26693544fe295f49
|
2023-12-13 00:09:17 +08:00 |
|
hiyouga
|
fdaba1648a
|
update readme
Former-commit-id: 28cc07868c5f65644245e37c1559e230aac18ed0
|
2023-12-12 23:30:29 +08:00 |
|
hiyouga
|
4c69025a83
|
support loftq
Former-commit-id: 6219dfbd9377528bce286c724ae2dd0090881095
|
2023-12-12 22:47:06 +08:00 |
|
hiyouga
|
b43a8dcfbf
|
fix #1795
Former-commit-id: ada0e536c926fd0196d127c89cc817da1008c017
|
2023-12-12 19:58:34 +08:00 |
|
hiyouga
|
1a0bdd305c
|
support system column #1765
Former-commit-id: 0a9c6e0146ebc71d5438c837463d6ab236e227c4
|
2023-12-12 19:45:59 +08:00 |
|
hiyouga
|
cefc0b2f03
|
fix modelscope data hub
Former-commit-id: d5b2c57a356539df9993e4774b856231eca8a6da
|
2023-12-12 18:33:06 +08:00 |
|
hoshi-hiyouga
|
0091af79b2
|
Merge pull request #1802 from tastelikefeet/feat/support_ms
Support ModelScope Datahub
Former-commit-id: 382319915c3d986e018c1346c638b518bb29a6a3
|
2023-12-12 17:58:37 +08:00 |
|
hoshi-hiyouga
|
b67085e13a
|
Merge branch 'main' into feat/support_ms
Former-commit-id: 6382efec52f6be3daa5db0bd280a96162009fca1
|
2023-12-12 17:55:32 +08:00 |
|
hiyouga
|
e293be7423
|
fix webui
Former-commit-id: e6ddebd3ae670f1ceccd7cbe0ad8daee11070eba
|
2023-12-12 15:27:40 +08:00 |
|
xingjun.wang
|
e331e8c200
|
modify guanaco
Former-commit-id: e80a989d49366bf08f62d212d329a90a02d8167e
|
2023-12-12 15:00:37 +08:00 |
|
xingjun.wang
|
277790d868
|
update dataset info
Former-commit-id: 73b50a26b9c6282f28df87338fa4057759c38f69
|
2023-12-12 14:53:59 +08:00 |
|
xingjun.wang
|
6cb2c99e7d
|
add use_streaming
Former-commit-id: adc98c86dad64f1a793017fa628b5cf19abbdd01
|
2023-12-12 14:23:05 +08:00 |
|
xingjun.wang
|
1bd75afae8
|
fix cache dir
Former-commit-id: 1909f0d11732bd99fadc6c1191e026137c6a7dff
|
2023-12-12 14:21:33 +08:00 |
|
xingjun.wang
|
c1974c91e5
|
add print info for test
Former-commit-id: 168321a4da7612620b9528860306f03bf65d019a
|
2023-12-12 14:14:40 +08:00 |
|
xingjun.wang
|
e17f2a3f7f
|
update cache dir
Former-commit-id: edc82b923a3fb03c5af100b5357e10f0c18b4523
|
2023-12-12 13:08:18 +08:00 |
|
xingjun.wang
|
879209829e
|
update args for MsDataset.load
Former-commit-id: 09533e95edc5fa65a38b2f04c6d88506196021b3
|
2023-12-12 13:02:54 +08:00 |
|
xingjun.wang
|
9f17d36ccf
|
add new datasets
Former-commit-id: fe4acc66b0e2bd96c988315192beb161da2d51f8
|
2023-12-12 12:44:15 +08:00 |
|
xingjun.wang
|
92fb73abd4
|
add open orca
Former-commit-id: 0ce18a378255a1d075a38a364520ba7a1e56180f
|
2023-12-12 12:34:04 +08:00 |
|
xingjun.wang
|
6520aecef1
|
update
Former-commit-id: cfba1009d0fc31b5933b558b249d89248f723d6b
|
2023-12-12 12:03:23 +08:00 |
|