44 Commits

Author SHA1 Message Date
hiyouga
06a4820836 disentangle model from tuner and rename modules
Former-commit-id: 4736344eb1595ee023a50d49e8118f4eee46305f
2023-11-15 16:29:09 +08:00
hiyouga
4a767e5593 release v0.2.2, fix #1478 #1466
Former-commit-id: 35cc1e28f675889c44f75a0a3194005c7f23631b
2023-11-13 23:09:05 +08:00
hiyouga
11ff5d1e43 support warning in webui
Former-commit-id: 9cde5e8af6e1a00d9ad084a847c16d016e2ef5ef
2023-11-02 17:57:04 +08:00
hiyouga
2af83198c7 fix webui
Former-commit-id: 11b55a3270983f0b84a8ab1068bc9af26ed80f8b
2023-10-22 17:24:56 +08:00
hiyouga
59fd3155a1 add new options in webui
Former-commit-id: f793ca0a2c9e7abab360658d0e65506c1f97e1ae
2023-10-22 17:17:58 +08:00
hiyouga
0622f8a3a3 fix #1228
Former-commit-id: cb0edd230251886dfbf2c8f0595297be04f5e5b3
2023-10-19 15:54:10 +08:00
hiyouga
eafde5b73f fix config, #1191
Former-commit-id: a6a04be2e6700050c64e59c00a86698f1098e7cc
2023-10-15 18:28:45 +08:00
hiyouga
e5e740b54d disable tqdm in webui mode
Former-commit-id: 0d63584c036a65cc81e5db504274bdf9808d1849
2023-10-15 16:18:25 +08:00
hiyouga
0503d45782 fix eval resuming in webui
Former-commit-id: 273745f9b9d117d4053afc1746108af95b0a51a4
2023-10-15 15:45:38 +08:00
hiyouga
3ae6229140 implement webui resuming training
Former-commit-id: accde3cd39ec7b09d96cf1865f8f51850693f5ce
2023-10-15 04:52:19 +08:00
hiyouga
eadb7c61af fix bugs in webui
Former-commit-id: fde05cacfc8e669909858587ce5e84380b2e35fb
2023-10-15 03:41:58 +08:00
hiyouga
a902ce4dc7 refactor webui
Former-commit-id: 7ed1fa6fe9025a179bbd2a23a0d50213f53ffba2
2023-10-15 03:06:21 +08:00
hiyouga
9ef9cb316b fix webui
Former-commit-id: b240b6792fdb734dd77ed54861fdde059feb1855
2023-10-13 16:27:59 +08:00
hiyouga
c9d1cd108d refactor model_dtype, fix PPO trainer
Former-commit-id: 2818af0b0967d7695f27658acac0b7e2c2728e5d
2023-10-11 23:16:01 +08:00
hiyouga
7082526df5 tiny fix
Former-commit-id: e1dcb8e4dc958a677bf484e27aec43b9710d7287
2023-10-10 17:41:13 +08:00
hiyouga
8de990b1e9 update webui #1086
Former-commit-id: b8dbec086eef655870f2922697e76733de427f5a
2023-10-09 14:50:14 +08:00
hiyouga
4581d09fa6 fix #944
Former-commit-id: 338b8664edea5ae65192ac657bb013581245ae15
2023-09-21 19:51:02 +08:00
hiyouga
c818a7ff60 support lora target auto find
Former-commit-id: bca1a247bcef51dced59655c8a14c197569367ca
2023-09-09 15:38:37 +08:00
hiyouga
9ed4bb63d4 change to right-padding, update reward score #803
Former-commit-id: 8ea32e4046d75ddfa9517669e9de9f48fea720c6
2023-09-08 20:04:31 +08:00
hiyouga
a4fd976048 refactor dataset_attr, add eos in pt, fix #757
Former-commit-id: a9d1fb72f791ae57a4d12f4e3a7e2abccf6a7077
2023-09-01 19:00:45 +08:00
codemayq
433a96b8c2 fix quantization bit is ""
Former-commit-id: a7cc6c4140c23f3b41985a481af69964b87e0feb
2023-08-23 10:08:17 +08:00
codemayq
7b82ea708d fix quantization is ""
Former-commit-id: ec2047b064bdb6a6e084aa268ee3de454f33dd59
2023-08-23 10:04:03 +08:00
hiyouga
edc15c62fa fix system prompt
Former-commit-id: 7407d9daa16bf6b3cd5002e16b2c53e402d2bc39
2023-08-16 01:35:52 +08:00
hiyouga
02a61b08b1 update webui
Former-commit-id: 9d0f6214b68a653c0a67632437b227ab8f589bed
2023-08-14 22:45:26 +08:00
codemayq
4a4623cf2d auto match template when change model_name
Former-commit-id: 0bf892ff1a4b9253ee569a7b0dc5270762af13d3
2023-08-14 20:56:05 +08:00
codemayq
ee7da14f81 add template match and stage in webui
Former-commit-id: 79c68e552722079faf2ab0858870b481844d66ae
2023-08-14 20:42:59 +08:00
hiyouga
6c9b035c0e web UI integrating RLHF
Former-commit-id: ec94274ca155300aee27621c018dd1bbaf78194b
2023-08-14 10:48:47 +08:00
hiyouga
7984ae8b62 fix webui
Former-commit-id: d69b1388e61e5e867ec5c9a9a223677c5b5860ce
2023-08-12 23:52:07 +08:00
hiyouga
3f0a2d6adc support rope scaling, fix #475 #476 #478
Former-commit-id: fa940c17b8d3e379af08804003f1a522c1cd6ac4
2023-08-12 20:46:27 +08:00
codemayq
3ba1b81105 add sft script preview in webui
Former-commit-id: 6bc8e9866d482c945dd98f4e9ab205a7d7270755
2023-08-12 13:53:55 +08:00
hiyouga
79f4ba0d26 Release v0.1.6
Former-commit-id: a48cb0d474ef0648a97387daf5f623498b5e3ee6
2023-08-11 23:25:57 +08:00
hiyouga
abdfa26d06 support DPO training (2305.18290)
Former-commit-id: 3ec4351cfdaf2aefcc7d13345e19d79874ed61d3
2023-08-11 03:02:53 +08:00
hiyouga
58e95776e1 fix webui val size
Former-commit-id: ad6e7c76c7b5590ea6ca55c4f76db2d3206b5987
2023-08-10 15:20:44 +08:00
hiyouga
28a807472b fix rm #420, fix template #426, fix #423
Former-commit-id: 39cd8b6989c9190d213e65467ec41f34ea04c5bc
2023-08-09 16:23:31 +08:00
hiyouga
4f714ba314 update webui
Former-commit-id: 3a720aac669708d17152d4e96c2018b5ccc27b75
2023-08-09 00:26:11 +08:00
hiyouga
4242897b78 modify code structure
Former-commit-id: 08f180e78862cad902b6cdbbd8c86e39b5cacf8a
2023-08-02 23:17:36 +08:00
hiyouga
e80b75b560 support streaming data, fix #284 #274 #268
Former-commit-id: 0411a4b3e122e7907441bc7a64b004948741a620
2023-07-31 23:33:00 +08:00
hiyouga
42c78bf591 Update runner.py
Former-commit-id: 1d1d8538c95784d6dc699160588e410cbb2cb1fd
2023-07-21 13:35:19 +08:00
hiyouga
f769c2d3fc update web UI, support rm predict #210
Former-commit-id: ed0e186a134de816d6a9278f4e47baa6250a52d1
2023-07-21 13:27:27 +08:00
hiyouga
94f2fd634f update UI, fix #212
Former-commit-id: 4d1641c1bff1b2b53c0e9b80b3e3ac7979223ccd
2023-07-20 22:09:06 +08:00
hiyouga
af37ac077c support dev set in web ui
Former-commit-id: fe2887ca1304e5b5cfd7fbd820a9a0c8dedd23ef
2023-07-18 20:40:49 +08:00
hiyouga
0b6f769971 update webUI, fix #179
Former-commit-id: 12d8a8633f1d8db8eb72223f69c074d98af16e01
2023-07-18 15:35:17 +08:00
hiyouga
4e1997a343 a monkey patch for lora_target
Former-commit-id: 262252d67bbe4ebcbb315b5d7a34f9a091f8af0c
2023-07-18 00:31:40 +08:00
hiyouga
091805d38e release v0.1.0
Former-commit-id: f8193e8009451cf569a28a10eb4bd88831844441
2023-07-18 00:18:25 +08:00