Commit Graph

31 Commits

Author SHA1 Message Date
hiyouga
6219dfbd93 support loftq 2023-12-12 22:47:06 +08:00
hiyouga
8cace77808 update readme 2023-12-12 11:44:30 +08:00
hiyouga
7df4f3ab20 implement rm server #1543 2023-12-03 20:52:54 +08:00
hiyouga
475a3fa0f4 fix #1659 2023-11-28 20:52:28 +08:00
hiyouga
859a6ea942 support export size setting 2023-11-26 18:34:09 +08:00
hiyouga
5021062493 update ppo trainer 2023-11-20 21:39:15 +08:00
Yuchen Han
b24635d22b Update finetuning_args.py 2023-11-17 00:15:51 -08:00
hiyouga
1817ffc86f fix rlhf callback 2023-11-16 03:26:19 +08:00
hiyouga
856522a3df fix bug in PPO training 2023-11-16 02:32:54 +08:00
hiyouga
ce78303600 support full-parameter PPO 2023-11-16 02:08:04 +08:00
hiyouga
4907452d95 support multiple modules in freeze training #1514 2023-11-15 17:08:18 +08:00
hiyouga
442aefb925 refactor evaluation, upgrade trl to 074 2023-11-13 22:20:35 +08:00
hiyouga
415bca900e tiny fix 2023-11-09 17:20:49 +08:00
Yanqing
3684dffa14 Update finetuning_args.py
更新 chatglm/falcon/bloom 的 lora_target 的名称
2023-11-09 17:04:40 +08:00
hiyouga
01260d9754 fix ppo train and dpo eval 2023-11-07 22:48:51 +08:00
hiyouga
b2a60905f3 upgrade peft, fix #1088 #1411 2023-11-07 16:13:36 +08:00
hiyouga
7b4acf7265 reimplement neftune 2023-10-22 16:15:08 +08:00
anvie
57fb40aa04 add NEFTune optimization 2023-10-21 13:24:10 +07:00
hiyouga
ea82f8a82a refactor export, fix #1190 2023-10-15 16:01:48 +08:00
hiyouga
11bd271364 fix ppo args 2023-10-11 23:40:50 +08:00
hiyouga
620efe1d8d refactor finetuning Args 2023-09-27 22:28:06 +08:00
hiyouga
4318347d3f update template 2023-08-22 19:46:09 +08:00
hiyouga
53e33418d0 support ppo score norm (trl 0.5.1.dev required) 2023-08-18 12:02:42 +08:00
hiyouga
9020524418 fix PPO trainer #551 , update readme 2023-08-18 11:43:10 +08:00
hiyouga
a48cb0d474 Release v0.1.6 2023-08-11 23:25:57 +08:00
hiyouga
3ec4351cfd support DPO training (2305.18290) 2023-08-11 03:02:53 +08:00
hiyouga
5453b93db0 update args spec 2023-08-07 15:23:35 +08:00
hiyouga
87f8f830e2 support Qwen-7B, fix InternLM-7B inference 2023-08-03 15:53:32 +08:00
hiyouga
8f7819fcaa fix #194 2023-07-19 17:07:33 +08:00
hiyouga
657cf0f55a create chat model 2023-07-15 19:26:20 +08:00
hiyouga
f751376613 modity code structure 2023-07-15 16:54:28 +08:00