24 Commits

Author SHA1 Message Date
hiyouga
8ab5566dc0 support FlashAttention2
Former-commit-id: d8aa1404bee9842f3e4cd037ad8d66c85470ac37
2023-09-10 20:43:56 +08:00
hiyouga
85aa16f6c6 fix #850
Former-commit-id: 815b92e698562bfae6eb9a6fa1b612a05d43ed67
2023-09-10 14:22:03 +08:00
hiyouga
9ed4bb63d4 change to right-padding, update reward score #803
Former-commit-id: 8ea32e4046d75ddfa9517669e9de9f48fea720c6
2023-09-08 20:04:31 +08:00
hiyouga
f74b980650 fix baichuan templates
Former-commit-id: 85b1f6632a752029dabdaed87c58986deb3a6b1d
2023-09-07 18:54:14 +08:00
hiyouga
0d0232479f fix import error
Former-commit-id: 2de1a7610a78e41680970b9f308741f98df489fa
2023-08-23 20:45:03 +08:00
hiyouga
38080233a5 fix #649
Former-commit-id: 57146c101f3e8f688b016a44c85e8ad5d1b6f938
2023-08-23 20:21:15 +08:00
hiyouga
acaac6df9e Release v0.1.7
Former-commit-id: 9c9009f49fdff83a29b83d8f97eb5c99e2574256
2023-08-18 17:21:27 +08:00
hiyouga
fceca0bb6a update training resuming
Former-commit-id: 58f13e22da18babed0d2d4348474e07745da8fa5
2023-08-18 01:41:17 +08:00
hoshi-hiyouga
49d4ae3704 Merge branch 'main' into main
Former-commit-id: 725290324562e093565fae79a05341ebf64486d5
2023-08-18 01:37:23 +08:00
hiyouga
caf4a61e21 fix ChatGLM2 ppo #527 #528
Former-commit-id: 9f4c2adc9a9ca8e458d3868805e077182e0d336a
2023-08-18 00:34:59 +08:00
hiyouga
623a34b16f fix generation bug #532
Former-commit-id: be21fc83f9aed0af1e5a2f83f5d5eeb36f1d283c
2023-08-17 22:21:34 +08:00
hiyouga
a46f277477 fix streaming in pt stage #548 #549
Former-commit-id: b0ed0dec5e6788a0344c09a6cc58d1116265fd68
2023-08-17 17:59:26 +08:00
hiyouga
048f99354f fix generation
Former-commit-id: d9e62711a3349d7c6fd3512fb25c709bdfbb311a
2023-08-16 22:39:54 +08:00
hiyouga
abdfa26d06 support DPO training (2305.18290)
Former-commit-id: 3ec4351cfdaf2aefcc7d13345e19d79874ed61d3
2023-08-11 03:02:53 +08:00
hiyouga
6404167ab7 support val set in streaming mode
Former-commit-id: d86ea314a197fd821770d895e988c48d46679047
2023-08-09 23:00:26 +08:00
niuba
53a89f53aa add last_checkpoint support
Former-commit-id: 2ec68d3398d86773c9076aae6b4e868ced0513d3
2023-08-09 16:39:27 +08:00
hiyouga
b43f37ca19 fix sft trainer
Former-commit-id: df946e6949c77179a5080b780109e22c297caef8
2023-08-09 16:35:03 +08:00
hiyouga
4242897b78 modify code structure
Former-commit-id: 08f180e78862cad902b6cdbbd8c86e39b5cacf8a
2023-08-02 23:17:36 +08:00
hiyouga
e80b75b560 support streaming data, fix #284 #274 #268
Former-commit-id: 0411a4b3e122e7907441bc7a64b004948741a620
2023-07-31 23:33:00 +08:00
hiyouga
64b4f71673 simplify code
Former-commit-id: 67a27730744b71795b10260d050501bfe2329c26
2023-07-20 15:08:57 +08:00
hiyouga
1a23cb2578 fix #196
Former-commit-id: 925a790bc9507c4d23af275a5abfb149959dbdcb
2023-07-19 17:35:38 +08:00
hiyouga
af37ac077c support dev set in web ui
Former-commit-id: fe2887ca1304e5b5cfd7fbd820a9a0c8dedd23ef
2023-07-18 20:40:49 +08:00
hiyouga
799524b37b fix #175
Former-commit-id: 85c2210452cc45470c228f17b2b0df09b47e9575
2023-07-17 18:07:17 +08:00
hiyouga
a696148d6b modity code structure
Former-commit-id: f75137661358f9070bc70c341dfa2cc5fd69cf94
2023-07-15 16:54:28 +08:00