anvie
|
202cd43027
|
add NEFTune optimization
Former-commit-id: 603e0298af64116ac07130fe6661a9ba823c186c
|
2023-10-21 13:24:10 +07:00 |
|
hiyouga
|
0c1e00574d
|
fix shift short attention
Former-commit-id: 9a49cce8e6f6b222f74a07bdab40efee6a77b0f1
|
2023-10-09 17:07:46 +08:00 |
|
hiyouga
|
dd6e9b3cc1
|
tiny fix
Former-commit-id: d24ea58c1a44b94227f4cb60f13fc1dd79997d01
|
2023-09-21 19:52:06 +08:00 |
|
hiyouga
|
f5689c0e6e
|
remove PeftTrainer
Former-commit-id: cc0cff3e991f194732d278e627648e528118a719
|
2023-09-10 22:23:23 +08:00 |
|
hiyouga
|
f326178f89
|
support FlashAttention2
Former-commit-id: 23e56c5554b948d4f08ad87849b261eafd2c7890
|
2023-09-10 20:43:56 +08:00 |
|
hiyouga
|
7947b2f6cd
|
fix #850
Former-commit-id: e5975c4c6b8bd47ec506b0d4a4703bee05495436
|
2023-09-10 14:22:03 +08:00 |
|
hiyouga
|
801d1fa7b9
|
change to right-padding, update reward score #803
Former-commit-id: baa90415bc8f5ebd423d001378b51c3a3a6c2ec7
|
2023-09-08 20:04:31 +08:00 |
|
hiyouga
|
47d9325873
|
fix baichuan templates
Former-commit-id: f48a49e835b32f3991cfad8874c7b9c78953809f
|
2023-09-07 18:54:14 +08:00 |
|
hiyouga
|
618e3ab83c
|
fix import error
Former-commit-id: b3207a974a45038591b8cbbcf20d1ca1142d6679
|
2023-08-23 20:45:03 +08:00 |
|
hiyouga
|
67c0d56bd9
|
fix #649
Former-commit-id: e6120a937ddb4f3c0b9bcb2466742f5cf4f77f8c
|
2023-08-23 20:21:15 +08:00 |
|
hiyouga
|
85abbe1f40
|
Release v0.1.7
Former-commit-id: 81abe8d6cabaa1ebe74dc32a5dc143389e4c9f31
|
2023-08-18 17:21:27 +08:00 |
|
hiyouga
|
c0e887c85a
|
update training resuming
Former-commit-id: 2ec75c31f609e65116ac3b621eeb7d8ccbf69135
|
2023-08-18 01:41:17 +08:00 |
|
hoshi-hiyouga
|
c856724d3f
|
Merge branch 'main' into main
Former-commit-id: 870d2c7bf74d0da5a927bef4b8b01d15cc66a3e9
|
2023-08-18 01:37:23 +08:00 |
|
hiyouga
|
a089d2665a
|
fix ChatGLM2 ppo #527 #528
Former-commit-id: 60d6ad64d7c9f6445b0df8de0153c3a311974198
|
2023-08-18 00:34:59 +08:00 |
|
hiyouga
|
bb7028f7e2
|
fix generation bug #532
Former-commit-id: c071121e67374e5f09798db57cfc8668617a36ae
|
2023-08-17 22:21:34 +08:00 |
|
hiyouga
|
4319b5172c
|
fix streaming in pt stage #548 #549
Former-commit-id: 050e992bee2a9293cc7399b578de807b5bf9bddc
|
2023-08-17 17:59:26 +08:00 |
|
hiyouga
|
22dec02b5f
|
fix generation
Former-commit-id: 66a0300d312ef91c24fcf80667fa3b0bb8e1a342
|
2023-08-16 22:39:54 +08:00 |
|
hiyouga
|
7ada4f5f6f
|
support DPO training (2305.18290)
Former-commit-id: 6d98de148e4af63a7028dfaeb6cf86eb56a4488f
|
2023-08-11 03:02:53 +08:00 |
|
hiyouga
|
6eb4120464
|
support val set in streaming mode
Former-commit-id: faed15b58ed00b1e09bb091e7eee48f5ef7c508b
|
2023-08-09 23:00:26 +08:00 |
|
niuba
|
186d7f2d67
|
add last_checkpoint support
Former-commit-id: 9f1977e4de00b14a9d1b555c25bcaf12998d5046
|
2023-08-09 16:39:27 +08:00 |
|
hiyouga
|
3a821e5f0c
|
fix sft trainer
Former-commit-id: 08cc888b1569572d0cd20bcf3f07e20072a0311a
|
2023-08-09 16:35:03 +08:00 |
|
hiyouga
|
18f73169fd
|
modify code structure
Former-commit-id: 6369f9b1751e6f9bb709ba76a85f69cbe0823e5d
|
2023-08-02 23:17:36 +08:00 |
|
hiyouga
|
63123a9098
|
support streaming data, fix #284 #274 #268
Former-commit-id: 819cc1353599e5fa45658bc56dd0dbe4b258b197
|
2023-07-31 23:33:00 +08:00 |
|
hiyouga
|
23e3895033
|
simplify code
Former-commit-id: d3731754ab7c28ae81f60784e0e4213f279d93fe
|
2023-07-20 15:08:57 +08:00 |
|
hiyouga
|
6bc585e4be
|
fix #196
Former-commit-id: 85fd82926db345a590a7fb32c0e352a1d2f025c3
|
2023-07-19 17:35:38 +08:00 |
|
hiyouga
|
014288df8b
|
support dev set in web ui
Former-commit-id: fe1370561a9b027d9ebdef52733344f1e3683081
|
2023-07-18 20:40:49 +08:00 |
|
hiyouga
|
72058cc816
|
fix #175
Former-commit-id: fd557ebb5e3ef2ca330b4d97731af43f4a5a5fc5
|
2023-07-17 18:07:17 +08:00 |
|
hiyouga
|
a69b1b1c3a
|
modity code structure
Former-commit-id: 0682ed357210897e0b67c4a6eb31a94b3eb929f1
|
2023-07-15 16:54:28 +08:00 |
|