hiyouga
|
f5ba2190fb
|
fix ppo train and dpo eval
Former-commit-id: ced863031836632cb5920e22ae6991f251372118
|
2023-11-07 22:48:51 +08:00 |
|
hiyouga
|
f23e5b602a
|
fix reward model loading
Former-commit-id: 9709ca501180a1afce32e9043aedb359762b437d
|
2023-11-07 17:20:51 +08:00 |
|
hiyouga
|
2eb65d21ac
|
upgrade peft, fix #1088 #1411
Former-commit-id: aa7d104f8e050d12cb8f585bc8a52c850995500f
|
2023-11-07 16:13:36 +08:00 |
|
hiyouga
|
f22886e2b6
|
fix #1097
Former-commit-id: c5b8796322d9d48e815038f9fecf0ce39036a4ee
|
2023-10-08 22:29:26 +08:00 |
|
hiyouga
|
386d85ae72
|
refactor finetuning Args
Former-commit-id: be425a70a4c8f051717cf1e4464dbd79dae4c0b5
|
2023-09-27 22:28:06 +08:00 |
|
hiyouga
|
e19a44c12b
|
fix #762 #814
Former-commit-id: 9a30ee5009040afbc524dbac0dad99904b2adf5f
|
2023-09-12 16:10:10 +08:00 |
|
hiyouga
|
a09a7b650d
|
remove PeftTrainer
Former-commit-id: cc0cff3e991f194732d278e627648e528118a719
|
2023-09-10 22:23:23 +08:00 |
|
hiyouga
|
f91c5f2638
|
fix lora target
Former-commit-id: d822e41e7ac7e310ee49e347fc45754284ce30b8
|
2023-09-09 17:04:45 +08:00 |
|
hiyouga
|
7143c551ab
|
support lora target auto find
Former-commit-id: bce9984733d88bf013847eed523d1c75fdf0995e
|
2023-09-09 15:38:37 +08:00 |
|
hiyouga
|
d5f1b99ac4
|
Release v0.1.6
Former-commit-id: 43c8b3c3c8bfb2e32d17fb3e8b194938e37d54bd
|
2023-08-11 23:25:57 +08:00 |
|
hiyouga
|
ca719a8697
|
support DPO training (2305.18290)
Former-commit-id: 6d98de148e4af63a7028dfaeb6cf86eb56a4488f
|
2023-08-11 03:02:53 +08:00 |
|
hiyouga
|
dd3f3e9749
|
support streaming data, fix #284 #274 #268
Former-commit-id: 819cc1353599e5fa45658bc56dd0dbe4b258b197
|
2023-07-31 23:33:00 +08:00 |
|
hiyouga
|
6261fb362a
|
modity code structure
Former-commit-id: 0682ed357210897e0b67c4a6eb31a94b3eb929f1
|
2023-07-15 16:54:28 +08:00 |
|