hiyouga
|
4c89aca243
|
update readme
Former-commit-id: a1477208471039d3578980f929f1ca8c2a07aa96
|
2024-06-24 18:22:12 +08:00 |
|
hiyouga
|
05f3a3c944
|
tiny fix
Former-commit-id: f7f440986b0ae3b38ea9f2da80789629d4f79ea1
|
2024-06-16 01:06:41 +08:00 |
|
hiyouga
|
bb88536166
|
add license
Former-commit-id: 69cfc98d7c81756a5ab6bf962240e393e449fef0
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
3ff9b87012
|
add test cases
Former-commit-id: 731176ff34cdf0cbf6b41c40c69f4ceb54c2daf6
|
2024-06-15 04:05:54 +08:00 |
|
hiyouga
|
103a507b39
|
fix #4209
DeepSpeed ZeRO3 has inflight param error when calling model.eval()
Former-commit-id: 4be013f18ea6a35b5a11db98db5f0670ffb41619
|
2024-06-13 02:25:50 +08:00 |
|
hiyouga
|
b0e5a76f4c
|
fix ppo trainer save zero3 model
accelerator.get_state_dict(ds_model) should be called at all ranks
Former-commit-id: 3a0f60f0aa072531e4ae5819ec00c8fa42aa0913
|
2024-06-07 05:14:19 +08:00 |
|
hiyouga
|
8692796c9b
|
fix ppo in trl 0.8.6
Former-commit-id: 5e0d66a0d80b4bd4a8506e2317209d8fb9d25ff6
|
2024-06-07 04:48:29 +08:00 |
|
hiyouga
|
fcb134e144
|
rename files
Former-commit-id: e1a8431770fc36c0c9ee7fed4abbc3d7fdcc5efd
|
2024-06-07 00:09:06 +08:00 |
|
hiyouga
|
35b5117a59
|
fix ppo+zero3 #3108
Former-commit-id: 33a93cc29e3e57bf001515000c0a70c112573dea
|
2024-06-06 23:30:07 +08:00 |
|
hiyouga
|
351b4efc6c
|
10x generate in ppo w/ zero3
https://github.com/huggingface/trl/pull/1483
Former-commit-id: 5dc43ba8b373d8803bc22d88b3d0d95ef8b9c7f8
|
2024-05-29 00:23:23 +08:00 |
|
hiyouga
|
dfa686b617
|
rename package
Former-commit-id: a07ff0c083558cfe6f474d13027642d3052fee08
|
2024-05-16 18:39:08 +08:00 |
|