Commit Graph

97 Commits

Author SHA1 Message Date
Chengcheng Pei
573e3183e6 1, log exceptions in details; 2, check processor is None before calling it. 2024-09-25 12:59:48 -07:00
hiyouga
c7e51ff187 fix #5411 2024-09-11 17:36:42 +08:00
hiyouga
90d6df6222 release v0.9.0 (real) 2024-09-09 01:00:25 +08:00
hiyouga
54c6905937 add docstrings, refactor logger 2024-09-08 00:56:56 +08:00
hoshi-hiyouga
e9bda48c6d fix #5366 2024-09-05 18:08:09 +08:00
hiyouga
1173f7fc1d fix ci 2024-09-05 03:02:59 +08:00
hiyouga
dabad5570b update get template 2024-09-04 22:36:20 +08:00
hiyouga
47ea97fb1b lazy image load 2024-09-04 02:27:08 +08:00
hiyouga
22959bcdd3 lint 2024-09-03 00:46:25 +08:00
hiyouga
a61c8c4890 fix #5324 2024-09-02 23:56:21 +08:00
hoshi-hiyouga
99fd9637bd fix trainer predict 2024-09-02 10:15:29 +08:00
hoshi-hiyouga
a6c6750e8a remove .cpu() 2024-09-02 10:10:53 +08:00
hiyouga
8e49940746 add rlhf-v dataset 2024-09-01 22:57:41 +08:00
hiyouga
55027282cd tiny fix 2024-09-01 21:15:44 +08:00
hiyouga
9967ccb3ae fix mixed mm inputs and rlhf-v 2024-09-01 20:52:47 +08:00
hiyouga
a025c3df61 remove visual_inputs, fix qlora 2024-08-31 00:24:51 +08:00
hiyouga
a244f143f4 optimize predict vram 2024-08-30 23:08:45 +08:00
hiyouga
3382317e32 refactor mm training 2024-08-30 02:14:31 +08:00
hiyouga
ad72f3e065 fix #5295 2024-08-29 20:30:18 +08:00
hiyouga
364b757e30 fix #5305 2024-08-29 20:16:01 +08:00
hiyouga
f6ae4e75dd tiny fix 2024-08-27 12:49:32 +08:00
hiyouga
c8b4c7fee5 tiny fix 2024-08-20 00:10:52 +08:00
liu-zichen
ddee718b31 fix lr not change 2024-08-13 16:33:34 +08:00
hiyouga
e2a28f51c6 add adam_mini to readme 2024-08-09 20:02:03 +08:00
moontidef
82bc15dc79 feat: add support for adammini 2024-08-07 10:08:22 +08:00
moontidef
40908a36fa fix: rename optimzer to optimizer 2024-08-07 10:05:01 +08:00
hiyouga
b7ca6c8dc1 fix #5048 2024-08-05 23:48:19 +08:00
codingma
2c1ca9f742 fix pissa save 2024-07-29 10:44:34 +08:00
hiyouga
608de799a2 add unittest 2024-07-19 01:06:27 +08:00
hiyouga
beec77a089 fix metrics #4786 2024-07-17 00:47:00 +08:00
hiyouga
d774b94f12 support batch_eval_metrics, fix #4826 2024-07-17 00:33:00 +08:00
hiyouga
fd8cc49008 fix #4820 2024-07-15 22:32:07 +08:00
hiyouga
29ebcd75d5 fix up 2024-07-15 01:04:56 +08:00
hoshi-hiyouga
15b399a82f Merge pull request #4691 from codemayq/feature-suppot-eval-dataset
add eval dataset support
2024-07-15 01:00:34 +08:00
hzhaoy
8bab99c582 tiny fix 2024-07-12 00:28:44 +08:00
hoshi-hiyouga
39cd89ce17 Update callbacks.py 2024-07-10 13:32:20 +08:00
-.-
cff89a2e89 fix src/llamafactory/train/callbacks.py 2024-07-10 12:05:51 +08:00
hiyouga
51942acee8 fix #4731 2024-07-10 11:32:36 +08:00
hiyouga
fb0c400116 fix ppo trainer 2024-07-10 11:05:45 +08:00
hiyouga
2f09520c0d fix #4742 2024-07-09 23:24:24 +08:00
codingma
76f3bbcfc0 1. add custom eval dataset support
2. merge load dataset and split dataset function
2024-07-05 15:52:10 +08:00
hiyouga
6fd6aa4530 fix packing for eager/sdpa attn 2024-07-04 01:52:43 +08:00
hiyouga
8845e94f91 fix #4609
unwrap_model_for_generation(reward_model) is necessary for zero3 training
2024-07-03 19:45:51 +08:00
hiyouga
8b1172b910 tiny fix 2024-07-03 02:31:50 +08:00
hiyouga
71cdf8956e tiny fix 2024-07-02 23:06:13 +08:00
hiyouga
821bb6660e remove rlhf support for chatglm2&3 2024-07-02 23:03:17 +08:00
hiyouga
c13ae2df19 upcast logits 2024-07-02 22:32:05 +08:00
hiyouga
c47ab6c072 improve rlhf 2024-07-02 22:23:08 +08:00
hiyouga
4c296001c4 fix ppo callbacks 2024-07-02 17:34:56 +08:00
hiyouga
73280b7dc7 tiny fix 2024-07-01 05:43:17 +08:00