hiyouga
|
e6ba7ef3e6
|
improve rlhf
Former-commit-id: e441780e3db256ca09a442ea9254e7ce16898a07
|
2024-07-02 22:23:08 +08:00 |
|
hiyouga
|
973cf8e980
|
tiny fix
Former-commit-id: 5dd2e5c3323f56420b5845a5ed28bcd9d4da5e41
|
2024-07-01 05:43:17 +08:00 |
|
hiyouga
|
884b49e662
|
add eval acc
Former-commit-id: 7ffde76fbfb6192e3aac31ccc098f31ce89181ae
|
2024-07-01 03:51:20 +08:00 |
|
hiyouga
|
46f0189e88
|
refactor pissa, improve llamaboard
Former-commit-id: 619556e46c19718f702c97df5d570a2a4c5fb13a
|
2024-06-28 01:04:24 +08:00 |
|
hzhaoy
|
89d9dd5aa5
|
fix #4579
Former-commit-id: 0fa298ff6a4febea36ea9f11c7594277a77e6e9b
|
2024-06-27 13:49:57 +08:00 |
|
hiyouga
|
9fd7a410bb
|
tiny fix about badam
Former-commit-id: 03f49267c7406e36aee35639f86e6e0383897090
|
2024-06-25 01:54:53 +08:00 |
|
Jonery
|
fa3150548e
|
Cleaner integration.
Former-commit-id: 26d4b05d424bd71f570195dd433258caf6465d92
|
2024-06-19 12:29:40 +08:00 |
|
Jonery
|
12fcfc2b72
|
Support distributed BAdam.
Former-commit-id: bdcb986e37975911c190a74d3e60bb77aa2033bd
|
2024-06-18 12:27:47 +08:00 |
|
Jonery
|
95ae30f678
|
Merge remote-tracking branch 'upstream/main'
Former-commit-id: 37834a7e79473ccf50ad7f67745b97c274c326d9
|
2024-06-17 18:44:51 +08:00 |
|
Jonery
|
ba303fd1aa
|
adapt for badam with ds zero3
Former-commit-id: fff2a020ec8713022bd8145f4a7168168ea07ca4
|
2024-06-17 18:18:10 +08:00 |
|
hiyouga
|
32f45c9e91
|
support pissa
Former-commit-id: ef8e45f2eaf466c54e9a671512a2974575677b08
|
2024-06-16 01:08:12 +08:00 |
|
hiyouga
|
05f3a3c944
|
tiny fix
Former-commit-id: f7f440986b0ae3b38ea9f2da80789629d4f79ea1
|
2024-06-16 01:06:41 +08:00 |
|
hiyouga
|
bb88536166
|
add license
Former-commit-id: 69cfc98d7c81756a5ab6bf962240e393e449fef0
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
a30931fe0f
|
fix #4295
Former-commit-id: 08f657868f9d605b837c5d8c2946a25cc05c8735
|
2024-06-15 04:34:55 +08:00 |
|
hiyouga
|
49b58fd6af
|
fix #4221
Former-commit-id: 05a3be4853b941909e7d193c31e8d62c8c5f879b
|
2024-06-13 02:48:21 +08:00 |
|
hiyouga
|
0a75224f62
|
clean code
Former-commit-id: f54cafd5c7f0383370d1a2f357834a61a97397ce
|
2024-06-13 01:58:16 +08:00 |
|
hiyouga
|
b0e5a76f4c
|
fix ppo trainer save zero3 model
accelerator.get_state_dict(ds_model) should be called at all ranks
Former-commit-id: 3a0f60f0aa072531e4ae5819ec00c8fa42aa0913
|
2024-06-07 05:14:19 +08:00 |
|
hiyouga
|
fcb134e144
|
rename files
Former-commit-id: e1a8431770fc36c0c9ee7fed4abbc3d7fdcc5efd
|
2024-06-07 00:09:06 +08:00 |
|
hiyouga
|
dfa686b617
|
rename package
Former-commit-id: a07ff0c083558cfe6f474d13027642d3052fee08
|
2024-05-16 18:39:08 +08:00 |
|