hiyouga
|
e057c8de48
|
support mllm hf inference
|
2024-04-26 05:34:58 +08:00 |
|
hiyouga
|
ca793028c6
|
release v0.6.1
|
2024-03-29 11:36:08 +08:00 |
|
hiyouga
|
8d603f8820
|
fix #2982
|
2024-03-28 20:22:31 +08:00 |
|
hiyouga
|
8c77b10912
|
update trainers
|
2024-03-28 18:16:27 +08:00 |
|
hiyouga
|
9bec3c98a2
|
fix #2777 #2895
|
2024-03-20 17:59:45 +08:00 |
|
hiyouga
|
8664262cde
|
support layerwise galore
|
2024-03-10 00:24:11 +08:00 |
|
hiyouga
|
28f7862188
|
support galore
|
2024-03-07 22:41:36 +08:00 |
|
hiyouga
|
4e5fae2fac
|
fix #2649
|
2024-03-01 13:02:41 +08:00 |
|
stephen
|
42c23798f2
|
update project_kwargs for ppo config
|
2024-02-21 13:47:38 +08:00 |
|
hiyouga
|
638234ceee
|
format style
|
2024-01-20 20:15:56 +08:00 |
|
hiyouga
|
f6d6e00337
|
fix tests
|
2024-01-20 19:58:04 +08:00 |
|
hiyouga
|
d9f1cae351
|
support function calling
|
2024-01-18 09:54:23 +08:00 |
|
hiyouga
|
898ec3696a
|
fix #2161
|
2024-01-11 17:04:13 +08:00 |
|
hiyouga
|
4571068e1e
|
fix #1789
|
2024-01-09 18:31:27 +08:00 |
|
hiyouga
|
7df4f3ab20
|
implement rm server #1543
|
2023-12-03 20:52:54 +08:00 |
|
hiyouga
|
5021062493
|
update ppo trainer
|
2023-11-20 21:39:15 +08:00 |
|
hoshi-hiyouga
|
48211e3799
|
Merge pull request #1553 from hannlp/hans
Change the default argument settings for PPO training
|
2023-11-20 20:32:55 +08:00 |
|
hiyouga
|
99a3f06377
|
fix #1567
|
2023-11-20 18:46:36 +08:00 |
|
Yuchen Han
|
eeb5249d0b
|
Update workflow.py
|
2023-11-17 00:16:27 -08:00 |
|
hiyouga
|
35b91ea34c
|
fix import bug
|
2023-11-16 02:27:03 +08:00 |
|
hiyouga
|
ce78303600
|
support full-parameter PPO
|
2023-11-16 02:08:04 +08:00 |
|
hiyouga
|
4736344eb1
|
disentangle model from tuner and rename modules
|
2023-11-15 16:29:09 +08:00 |
|