13 Commits

Author SHA1 Message Date
hiyouga
1173441661 fix #2766
Former-commit-id: 412c52e325660e8b871ffd59f5564f84f46a143f
2024-03-09 21:35:24 +08:00
hiyouga
5b50458acf fix galore
Former-commit-id: 33a4c24a8a3c153bc62edf74b9246699a0ae3233
2024-03-08 00:44:51 +08:00
hiyouga
2c010c72b8 support galore
Former-commit-id: 28f78621883917425fabe49f5473778111012127
2024-03-07 22:41:36 +08:00
hiyouga
96265ec154 support llama pro #2338 , add rslora
Former-commit-id: 7924ffc55d98e33bfbfbca303e46c8f476435673
2024-02-15 02:27:36 +08:00
hiyouga
b27e91222c format style
Former-commit-id: 638234ceee1b19716e45b6e5f4ea54d9122da4df
2024-01-20 20:15:56 +08:00
hiyouga
4e3bfb799d support function calling
Former-commit-id: d9f1cae35150cce594a7abd96dd2beb811fa33f2
2024-01-18 09:54:23 +08:00
hiyouga
bd03307bbd refactor adapter hparam
Former-commit-id: 0716f5e470afffd2df5a815712b552a4b4797153
2023-12-15 20:53:11 +08:00
hiyouga
64eead3fb1 ppo support rm server
Former-commit-id: 747db4017291b0eb91946f57011bb31659056037
2023-12-03 21:38:51 +08:00
hiyouga
1cb390b9b2 implement rm server #1543
Former-commit-id: 7df4f3ab206fddb462f6ed865eaf04234fd72ed6
2023-12-03 20:52:54 +08:00
hiyouga
48d6d925f7 fix #1558
Former-commit-id: 1740131d63d32aefc0370441baf4716ddb5ebcfe
2023-11-19 14:15:47 +08:00
hiyouga
0ed0b8f9c5 fix bug in freeze tuning
Former-commit-id: ff52b1779c909819d0aef83d3f7ea663199cbe54
2023-11-16 14:25:11 +08:00
hiyouga
b71da932eb fix bug in PPO training
Former-commit-id: 856522a3df4bb9ddfaaa137119eceb9574873950
2023-11-16 02:32:54 +08:00
hiyouga
eb5a852dd5 fix import bug
Former-commit-id: 35b91ea34caade45dd51813b94da5177b852aa4c
2023-11-16 02:27:03 +08:00