Commit Graph

1313 Commits

Author SHA1 Message Date
hiyouga
7c362509a6 add noisy mean initialization #1815
Former-commit-id: a66186b872
2023-12-16 19:47:51 +08:00
hiyouga
4e75ca1222 support dpo-ftx
Former-commit-id: b87c74289d
2023-12-16 19:21:41 +08:00
hiyouga
f0f9d253d8 support autogptq in llama board #246
Former-commit-id: 71389be37c
2023-12-16 16:31:30 +08:00
yhyu13
cc91724507 Use llmtuner logger
Former-commit-id: fc70a92cb6
2023-12-16 07:15:27 +00:00
yhyu13
362e3c913f Improve logging for unknown args
Former-commit-id: 26817143ff
2023-12-16 05:16:29 +00:00
hiyouga
7db6fe4754 update tips
Former-commit-id: 3551171d49
2023-12-15 23:52:50 +08:00
hiyouga
9a88387b91 fix #1770
Former-commit-id: 439a26c276
2023-12-15 23:50:15 +08:00
hiyouga
7dbc670902 support quantization in export model
Former-commit-id: 3524aa1e58
2023-12-15 23:44:50 +08:00
hiyouga
2db4cfab40 update dc link
Former-commit-id: 87ef3f47b5
2023-12-15 22:11:31 +08:00
hiyouga
329576a5b4 fix bug
Former-commit-id: 00c77104f8
2023-12-15 21:54:02 +08:00
hiyouga
f32c8614c2 fix bug
Former-commit-id: 9e509b99af
2023-12-15 21:49:26 +08:00
hiyouga
d24d2f0458 add configurer
Former-commit-id: 2740aa9cbb
2023-12-15 21:46:40 +08:00
hiyouga
bd03307bbd refactor adapter hparam
Former-commit-id: 0716f5e470
2023-12-15 20:53:11 +08:00
hiyouga
6a186d4386 add loftq
Former-commit-id: d4c351f1ec
2023-12-14 21:53:56 +08:00
hiyouga
e55e32efc4 fix valuehead model
Former-commit-id: bfdee1608f
2023-12-14 20:15:20 +08:00
hoshi-hiyouga
fa99ead86b tiny fix
Former-commit-id: 81167cd19d
2023-12-13 17:32:36 +08:00
hoshi-hiyouga
3f8d33695c revert peft version
Former-commit-id: 9b0630f84f
2023-12-13 10:49:45 +08:00
hoshi-hiyouga
ad1860d35e update peft version
Former-commit-id: 573a12c86b
2023-12-13 10:23:51 +08:00
hoshi-hiyouga
657dff438c tiny fix
Former-commit-id: 6953096c9d
2023-12-13 10:21:29 +08:00
hoshi-hiyouga
5b211cfbe9 fix #1819
Former-commit-id: 1fcd545c3d
2023-12-13 10:14:01 +08:00
hiyouga
15b321da8e remove loftq
Former-commit-id: 3a8a50d4d4
2023-12-13 01:53:46 +08:00
hiyouga
c9a2a7a3b3 fix sharegpt loading
Former-commit-id: 2c8e88f9c1
2023-12-13 00:56:16 +08:00
hiyouga
f9ab303629 add model urls
Former-commit-id: 3552035d7e
2023-12-13 00:09:17 +08:00
hiyouga
4c69025a83 support loftq
Former-commit-id: 6219dfbd93
2023-12-12 22:47:06 +08:00
hiyouga
b43a8dcfbf fix #1795
Former-commit-id: ada0e536c9
2023-12-12 19:58:34 +08:00
hiyouga
1a0bdd305c support system column #1765
Former-commit-id: 0a9c6e0146
2023-12-12 19:45:59 +08:00
hiyouga
cefc0b2f03 fix modelscope data hub
Former-commit-id: d5b2c57a35
2023-12-12 18:33:06 +08:00
hoshi-hiyouga
b67085e13a Merge branch 'main' into feat/support_ms
Former-commit-id: 6382efec52
2023-12-12 17:55:32 +08:00
hiyouga
e293be7423 fix webui
Former-commit-id: e6ddebd3ae
2023-12-12 15:27:40 +08:00
xingjun.wang
6cb2c99e7d add use_streaming
Former-commit-id: adc98c86da
2023-12-12 14:23:05 +08:00
xingjun.wang
1bd75afae8 fix cache dir
Former-commit-id: 1909f0d117
2023-12-12 14:21:33 +08:00
xingjun.wang
c1974c91e5 add print info for test
Former-commit-id: 168321a4da
2023-12-12 14:14:40 +08:00
xingjun.wang
e17f2a3f7f update cache dir
Former-commit-id: edc82b923a
2023-12-12 13:08:18 +08:00
xingjun.wang
879209829e update args for MsDataset.load
Former-commit-id: 09533e95ed
2023-12-12 13:02:54 +08:00
xingjun.wang
6520aecef1 update
Former-commit-id: cfba1009d0
2023-12-12 12:03:23 +08:00
xingjun.wang
1d65d24071 for test
Former-commit-id: 5b979147f0
2023-12-12 11:52:59 +08:00
xingjun.wang
2918743520 for test
Former-commit-id: 8a908a8c64
2023-12-12 11:47:59 +08:00
hiyouga
bd28dd0fe6 update readme
Former-commit-id: 8cace77808
2023-12-12 11:44:30 +08:00
hiyouga
b7d99ad5f4 support mixtral
Former-commit-id: 96380f5e18
2023-12-12 11:39:04 +08:00
hiyouga
e3e86340ec fix baichuan resize
Former-commit-id: f4657de7d5
2023-12-11 20:55:50 +08:00
hiyouga
2e42e38ff2 tiny fix
Former-commit-id: 0239d29fa0
2023-12-11 18:09:40 +08:00
hiyouga
9ead5a2d21 support resize embeddings #1786
Former-commit-id: 64744dde89
2023-12-11 17:50:02 +08:00
hiyouga
5819eb7121 use peft 0.7.0, fix #1561 #1764
Former-commit-id: 9ce1b0e2f2
2023-12-11 17:13:40 +08:00
hiyouga
b641e9e97e fix #1784
Former-commit-id: 28d5de7e78
2023-12-09 20:53:18 +08:00
yuze.zyz
c523613f0a support ms dataset
Former-commit-id: 9c2247d700
2023-12-08 18:00:57 +08:00
hiyouga
89cf856776 fix #1771 and temporarily fix #1764
Former-commit-id: d42c0b1d34
2023-12-08 16:26:20 +08:00
hiyouga
9b84a706af add models
Former-commit-id: e25f7bae16
2023-12-06 13:33:18 +08:00
hiyouga
027caabbb6 fix ppo trainer save logic
Former-commit-id: d3dccd0693
2023-12-04 19:00:19 +08:00
hiyouga
cd2b0a024b fix #1715
Former-commit-id: c9b166615c
2023-12-03 22:35:47 +08:00
hiyouga
16b7296ae1 release v0.3.3
Former-commit-id: 438dea679b
2023-12-03 21:59:45 +08:00