hiyouga
|
8cace77808
|
update readme
|
2023-12-12 11:44:30 +08:00 |
|
hiyouga
|
96380f5e18
|
support mixtral
|
2023-12-12 11:39:04 +08:00 |
|
hiyouga
|
f4657de7d5
|
fix baichuan resize
|
2023-12-11 20:55:50 +08:00 |
|
hiyouga
|
0239d29fa0
|
tiny fix
|
2023-12-11 18:09:40 +08:00 |
|
hiyouga
|
64744dde89
|
support resize embeddings #1786
|
2023-12-11 17:50:02 +08:00 |
|
hiyouga
|
9ce1b0e2f2
|
use peft 0.7.0, fix #1561 #1764
|
2023-12-11 17:13:40 +08:00 |
|
hiyouga
|
d42c0b1d34
|
fix #1771 and temporarily fix #1764
|
2023-12-08 16:26:20 +08:00 |
|
hiyouga
|
c9b166615c
|
fix #1715
|
2023-12-03 22:35:47 +08:00 |
|
hiyouga
|
7df4f3ab20
|
implement rm server #1543
|
2023-12-03 20:52:54 +08:00 |
|
hiyouga
|
03d05991f8
|
fix #1707 #1710
|
2023-12-03 11:33:12 +08:00 |
|
hiyouga
|
b69763ff92
|
fix #1642
|
2023-12-02 00:37:53 +08:00 |
|
hiyouga
|
f57445c7a0
|
fix gptq training
|
2023-12-02 00:27:15 +08:00 |
|
hiyouga
|
a973ce6e89
|
tiny fix
|
2023-12-01 23:37:10 +08:00 |
|
hiyouga
|
01e6c539b0
|
fix gptq model inference
|
2023-12-01 23:34:14 +08:00 |
|
hiyouga
|
bd42c229b0
|
patch modelscope
|
2023-12-01 22:53:15 +08:00 |
|
yuze.zyz
|
5aa6751e52
|
add readme
|
2023-12-01 16:11:30 +08:00 |
|
yuze.zyz
|
fb2204c183
|
fix
|
2023-11-29 21:43:58 +08:00 |
|
yuze.zyz
|
d38a2e7341
|
support ms
|
2023-11-29 20:36:55 +08:00 |
|
hiyouga
|
475a3fa0f4
|
fix #1659
|
2023-11-28 20:52:28 +08:00 |
|
hiyouga
|
9ea9380145
|
support GPTQ tuning #729 #1481 #1545 , fix chatglm template #1453 #1480 #1569
|
2023-11-20 22:52:11 +08:00 |
|
hiyouga
|
5021062493
|
update ppo trainer
|
2023-11-20 21:39:15 +08:00 |
|
hiyouga
|
2a36fd5064
|
fix value head model resuming
|
2023-11-20 19:01:37 +08:00 |
|
hiyouga
|
99a3f06377
|
fix #1567
|
2023-11-20 18:46:36 +08:00 |
|
hiyouga
|
211b2db5a8
|
fix model card network issue
|
2023-11-19 23:03:19 +08:00 |
|
hiyouga
|
1740131d63
|
fix #1558
|
2023-11-19 14:15:47 +08:00 |
|
hiyouga
|
ff6056405d
|
fix evaluator and cached_file in 4.31.0
|
2023-11-18 19:39:23 +08:00 |
|
hiyouga
|
ccb0f58e22
|
fix quantization
|
2023-11-17 22:21:29 +08:00 |
|
hiyouga
|
1bbc1be95e
|
fix #1550
|
2023-11-17 17:23:13 +08:00 |
|
hiyouga
|
6efa38be46
|
fix bug in web ui
|
2023-11-16 15:21:24 +08:00 |
|
hiyouga
|
ff52b1779c
|
fix bug in freeze tuning
|
2023-11-16 14:25:11 +08:00 |
|
hiyouga
|
1817ffc86f
|
fix rlhf callback
|
2023-11-16 03:26:19 +08:00 |
|
hiyouga
|
856522a3df
|
fix bug in PPO training
|
2023-11-16 02:32:54 +08:00 |
|
hiyouga
|
35b91ea34c
|
fix import bug
|
2023-11-16 02:27:03 +08:00 |
|
hiyouga
|
ce78303600
|
support full-parameter PPO
|
2023-11-16 02:08:04 +08:00 |
|
hiyouga
|
4907452d95
|
support multiple modules in freeze training #1514
|
2023-11-15 17:08:18 +08:00 |
|
hiyouga
|
4736344eb1
|
disentangle model from tuner and rename modules
|
2023-11-15 16:29:09 +08:00 |
|