56 Commits

Author SHA1 Message Date
hiyouga
9898712a24 add orion models
Former-commit-id: 6fc2d5cc0375a183ec95f003e7a27567b2a71514
2024-01-22 21:26:53 +08:00
hiyouga
b27e91222c format style
Former-commit-id: 638234ceee1b19716e45b6e5f4ea54d9122da4df
2024-01-20 20:15:56 +08:00
hiyouga
0a8f46882c update readme
Former-commit-id: 5608a0da8e24f499a04db9bfbd45e296d8011977
2024-01-18 14:30:48 +08:00
hiyouga
4e3bfb799d support function calling
Former-commit-id: d9f1cae35150cce594a7abd96dd2beb811fa33f2
2024-01-18 09:54:23 +08:00
hiyouga
7e16d27fca tiny fix
Former-commit-id: 5a207bb7230789ddefba932095de83002d01c005
2024-01-15 23:34:23 +08:00
hiyouga
21020e51ca support solar 10.7B #1907
Former-commit-id: bf73224f33c93647a6101d36bde5bf8ddfc91438
2024-01-14 00:30:30 +08:00
hiyouga
9771acfd75 support deepseek moe
Former-commit-id: ca3933dc5295bd8d9e5e37ce869ff8fb44761047
2024-01-14 00:14:49 +08:00
hiyouga
dad632091d fix phi modules
Former-commit-id: d1a73fe26ce2c12953c8eebd1d1118abc90dcf74
2024-01-13 23:12:47 +08:00
hiyouga
b29c4fb308 modify weight name
Former-commit-id: 919acc2b0be03ede08cc0018784e8874a920e300
2024-01-09 20:22:47 +08:00
hiyouga
61960189b2 fix #1789
Former-commit-id: 4571068e1e00dc234c9131185fe0924c726add84
2024-01-09 18:31:27 +08:00
hiyouga
4735cb96c1 add yuan model
Former-commit-id: c7ea17d6168d0a032960dbf51c12482f97529b1e
2023-12-29 13:50:24 +08:00
hiyouga
51c636db54 add xverse-65B-2 model
Former-commit-id: 2df923540c3cbf3b06c74801ea66d3523718b84a
2023-12-18 19:24:09 +08:00
hiyouga
1af13cb737 add models
Former-commit-id: 709ac8870a17a96e786b32c75ad8c4e573148cee
2023-12-18 19:09:31 +08:00
hiyouga
397f6bb615 add xverse-65b-chat model
Former-commit-id: 7ae6919b9bb9ecc8d821eea47a03eacd9eb997ac
2023-12-16 20:21:29 +08:00
hiyouga
f0f9d253d8 support autogptq in llama board #246
Former-commit-id: 71389be37cb0f1a65db6e501e11ca14e615c1a24
2023-12-16 16:31:30 +08:00
hiyouga
7dbc670902 support quantization in export model
Former-commit-id: 3524aa1e58da94ab00e9a2024952ea1b4119b2af
2023-12-15 23:44:50 +08:00
hiyouga
f9ab303629 add model urls
Former-commit-id: 3552035d7eecff86943f02aa26693544fe295f49
2023-12-13 00:09:17 +08:00
hiyouga
b7d99ad5f4 support mixtral
Former-commit-id: 96380f5e1887bb166be339e58ab8f65e464d4010
2023-12-12 11:39:04 +08:00
hiyouga
9b84a706af add models
Former-commit-id: e25f7bae16b7ea41a4a1fd1e8db1b961e55d0c5b
2023-12-06 13:33:18 +08:00
hiyouga
f8376b228a add xuanyuan models
Former-commit-id: 6e7af11b989e4cf97ffacbab4736e3434ff6c925
2023-12-02 00:35:29 +08:00
hiyouga
c60e79c12e patch modelscope
Former-commit-id: bd42c229b01a0bf3ceadb8cee5ad49a060cc2d13
2023-12-01 22:53:15 +08:00
yuze.zyz
fcd61657ee remove useless code
Former-commit-id: 5a2392f105704810e9ce96c13fcc8a555726f9b8
2023-12-01 17:28:23 +08:00
tastelikefeet
eb835b693d fix bug
Former-commit-id: d9e52957e272e8133f1b37cf20d193084425e09e
2023-12-01 17:27:00 +08:00
yuze.zyz
b2200409f5 add readme
Former-commit-id: 5aa6751e52b5c2e06727c50e60218226b146b7bf
2023-12-01 16:11:30 +08:00
tastelikefeet
63e12226a0 add model
Former-commit-id: 8ce4d11e38518b0b4657c7e64394d471cbb0bd6d
2023-12-01 15:06:17 +08:00
yuze.zyz
45925e4a9c fix
Former-commit-id: fb2204c183ae8c061ed6ec7f4f1bfbb0b4900c9b
2023-11-29 21:43:58 +08:00
yuze.zyz
e08e0e5814 support ms
Former-commit-id: d38a2e7341100902b6c761895b1fe6191c905d06
2023-11-29 20:36:55 +08:00
hiyouga
ae1048db6d fix #1659
Former-commit-id: 475a3fa0f4c09d4cfd55ec66271a6d3c9eb5f4d2
2023-11-28 20:52:28 +08:00
hiyouga
5f2943dc84 support Yi-34B-Chat models
Former-commit-id: ff1c289229ee382d3e76578bbb6a5e299b969ded
2023-11-23 19:31:49 +08:00
hiyouga
e4f97615f0 update ppo and demo in webui
Former-commit-id: 7537dd434f4c0f0bde06bd8c2ac69bf622772316
2023-11-16 14:55:26 +08:00
hiyouga
3e0b76650a update readme and constants
Former-commit-id: 1e19cf242a1f843b590feefbe24b2cc0a17712b5
2023-11-15 18:04:37 +08:00
hiyouga
06a4820836 disentangle model from tuner and rename modules
Former-commit-id: 4736344eb1595ee023a50d49e8118f4eee46305f
2023-11-15 16:29:09 +08:00
hiyouga
4a767e5593 release v0.2.2, fix #1478 #1466
Former-commit-id: 35cc1e28f675889c44f75a0a3194005c7f23631b
2023-11-13 23:09:05 +08:00
hiyouga
0fbaa42752 refactor constants
Former-commit-id: 3697a3dc9a0be8141951dfe65812844f66059517
2023-11-10 14:16:10 +08:00
hiyouga
3d40bdb600 upgrade peft, fix #1088 #1411
Former-commit-id: b2a60905f384ada92618bf21301fe96dac1c10bf
2023-11-07 16:13:36 +08:00
hiyouga
d48478ef88 update constants
Former-commit-id: f28a034a9b74630a56314446bd0f103c086bda60
2023-10-29 13:30:20 +08:00
hiyouga
d338ab3e19 fix #1068 #1074
Former-commit-id: d11a5454633be9f0600cbd1ab7a26c9c8fa5ed80
2023-09-28 14:39:16 +08:00
hiyouga
deb17942ab fix layer norm dtype
Former-commit-id: 84b7486885c600e5e65c5ba9095d56ecc2502977
2023-09-28 00:25:55 +08:00
hiyouga
5ee1bdecdc add MMLU and C-Eval script
Former-commit-id: 465ee8119aa489a41bee0b01b3c105a2f3dd137f
2023-09-23 00:34:17 +08:00
hiyouga
6a71361a54 remove PeftTrainer
Former-commit-id: b218c271edfb07006ddc34b1aca404088de6c528
2023-09-10 22:23:23 +08:00
hiyouga
51f662860d update baichuan2 template
Former-commit-id: 0531886e1f534217dc3c9c0775d29fcf77ff7f5f
2023-09-06 21:43:06 +08:00
hiyouga
f9aee17f9d add Baichuan2 models
Former-commit-id: 62ce65c6282d2bbcb765354acc2819cc3e983a46
2023-09-06 18:36:04 +08:00
hiyouga
a4fd976048 refactor dataset_attr, add eos in pt, fix #757
Former-commit-id: a9d1fb72f791ae57a4d12f4e3a7e2abccf6a7077
2023-09-01 19:00:45 +08:00
codemayq
4b29d9d2b0 add dataset stage and filter dataset when stage chosen in webui
Former-commit-id: c0e4d1e81b41c9a36291d8bee46d7d807c898c21
2023-08-23 18:54:23 +08:00
hiyouga
02a61b08b1 update webui
Former-commit-id: 9d0f6214b68a653c0a67632437b227ab8f589bed
2023-08-14 22:45:26 +08:00
codemayq
ee7da14f81 add template match and stage in webui
Former-commit-id: 79c68e552722079faf2ab0858870b481844d66ae
2023-08-14 20:42:59 +08:00
hiyouga
3f0a2d6adc support rope scaling, fix #475 #476 #478
Former-commit-id: fa940c17b8d3e379af08804003f1a522c1cd6ac4
2023-08-12 20:46:27 +08:00
codemayq
3ba1b81105 add sft script preview in webui
Former-commit-id: 6bc8e9866d482c945dd98f4e9ab205a7d7270755
2023-08-12 13:53:55 +08:00
hiyouga
79f4ba0d26 Release v0.1.6
Former-commit-id: a48cb0d474ef0648a97387daf5f623498b5e3ee6
2023-08-11 23:25:57 +08:00
hiyouga
abdfa26d06 support DPO training (2305.18290)
Former-commit-id: 3ec4351cfdaf2aefcc7d13345e19d79874ed61d3
2023-08-11 03:02:53 +08:00