hiyouga
|
66e0e651b9
|
format style
Former-commit-id: 53b683531b83cd1d19de97c6565f16c1eca6f5e1
|
2024-01-20 20:15:56 +08:00 |
|
hiyouga
|
a423274fd9
|
support function calling
Former-commit-id: 66533b3f65babf2429c92c0f8fafe4eff5e0ff63
|
2024-01-18 09:54:23 +08:00 |
|
hiyouga
|
4d6669c268
|
fix #1789
Former-commit-id: d86455f685fa531e651333e00b4fe54d895cf2e4
|
2024-01-09 18:31:27 +08:00 |
|
hiyouga
|
53d7c5109f
|
fix ppo trainer
Former-commit-id: ca5b5823b03822ef899405d233a82396be997f44
|
2023-12-28 18:09:28 +08:00 |
|
hiyouga
|
790a31404a
|
fix #1742
Former-commit-id: efbb32afdcf0d6aa4ca26f54c95f76dbb84f77dc
|
2023-12-16 20:50:45 +08:00 |
|
hiyouga
|
7b7bfea37d
|
fix ppo trainer save logic
Former-commit-id: 5e70c41e4e12a1109570b0ff56346fe212c028ed
|
2023-12-04 19:00:19 +08:00 |
|
hiyouga
|
09f165d442
|
fix bug
Former-commit-id: 2fd7a8fc3134af66193a5e8db8fea35025f82de9
|
2023-12-03 21:40:40 +08:00 |
|
hiyouga
|
60aea7521b
|
ppo support rm server
Former-commit-id: 20b0edf16f5b42cb2c4a795674647afb68cb3a4a
|
2023-12-03 21:38:51 +08:00 |
|
hiyouga
|
99ceee840e
|
fix #1597
Former-commit-id: d77a3a79a0e854803a57af8ac6a7246691f69f70
|
2023-11-30 21:47:06 +08:00 |
|
hiyouga
|
8ed68301e3
|
fix #1668
Former-commit-id: bccc71259e703ca1e1d88169e385a026c4efa92e
|
2023-11-30 21:02:00 +08:00 |
|
hiyouga
|
0e6f4f981e
|
fix #1658
Former-commit-id: 3126687c4820c34daa6a2e9e3bf9065ad59e92dc
|
2023-11-28 20:57:24 +08:00 |
|
hiyouga
|
28258aecd2
|
update ppo trainer
Former-commit-id: caa525a5c6f228b9ad71387d1fe4f1c2ffa2479e
|
2023-11-20 21:39:15 +08:00 |
|
hiyouga
|
adf2730d1d
|
fix #1567
Former-commit-id: 8c01ffe8d277d49a413571e0669f460c8d0802bf
|
2023-11-20 18:46:36 +08:00 |
|
hiyouga
|
de3a84ac59
|
fix rlhf callback
Former-commit-id: f5485452d660caef56474cb7dc37abbe4f34599e
|
2023-11-16 03:26:19 +08:00 |
|
hiyouga
|
7a3a0144a5
|
support full-parameter PPO
Former-commit-id: 4af967d69475e1c9fdf1a7983cd6b83bd431abff
|
2023-11-16 02:08:04 +08:00 |
|
hiyouga
|
09a4474e7f
|
disentangle model from tuner and rename modules
Former-commit-id: 02cbf91e7e424f8379c1fed01b82a5f7a83b6947
|
2023-11-15 16:29:09 +08:00 |
|