hiyouga
|
a3f4925c2c
|
add test cases
Former-commit-id: b27269bd2b
|
2024-06-15 04:05:54 +08:00 |
|
hiyouga
|
81ed4d8abf
|
fix #4209
DeepSpeed ZeRO3 has inflight param error when calling model.eval()
Former-commit-id: cf9f2d6c42
|
2024-06-13 02:25:50 +08:00 |
|
hiyouga
|
ca9468ff04
|
tiny fix
Former-commit-id: f8d8690bf4
|
2024-06-07 05:19:21 +08:00 |
|
hiyouga
|
4f3c89a6eb
|
fix ppo trainer save zero3 model
accelerator.get_state_dict(ds_model) should be called at all ranks
Former-commit-id: 4489d73ac7
|
2024-06-07 05:14:19 +08:00 |
|
hiyouga
|
f76d427332
|
fix ppo in trl 0.8.6
Former-commit-id: 2702d7e952
|
2024-06-07 04:48:29 +08:00 |
|
hiyouga
|
8da149ba40
|
rename files
Former-commit-id: 74f96efef9
|
2024-06-07 00:09:06 +08:00 |
|
hiyouga
|
368695483d
|
fix ppo+zero3 #3108
Former-commit-id: 76c61905b2
|
2024-06-06 23:30:07 +08:00 |
|
hiyouga
|
e0aadd4b34
|
fix ppo dataset bug #4012
Former-commit-id: 149610c636
|
2024-06-06 19:03:20 +08:00 |
|
hiyouga
|
e898d8bbc4
|
update trainers
Former-commit-id: fad2591e31
|
2024-06-06 18:45:49 +08:00 |
|
hiyouga
|
468d0e7ed1
|
10x generate in ppo w/ zero3
https://github.com/huggingface/trl/pull/1483
Former-commit-id: 65cd8bdbdb
|
2024-05-29 00:23:23 +08:00 |
|
hiyouga
|
13d7b48efe
|
improve KTO impl., replace datasets
Former-commit-id: c450ee87a3
|
2024-05-18 03:44:56 +08:00 |
|
hiyouga
|
cae823ddf0
|
rename package
Former-commit-id: 308edbc426
|
2024-05-16 18:39:08 +08:00 |
|