Commit Graph

52 Commits

Author SHA1 Message Date
hiyouga
1800f8c72d fix #6499 2025-01-02 11:28:54 +00:00
hiyouga
2719867982 fix #6448 2024-12-27 16:54:39 +00:00
hiyouga
5111cac6f8 support report custom args 2024-12-21 21:42:45 +00:00
hoshi-hiyouga
947e22a4a3 Merge pull request #6401 from Zeyi-Lin/hiyouga/swanlab
feat: add swanlab for experiment tracking and visualization.
2024-12-21 14:09:33 +08:00
hiyouga
d4c1fda1ad fix #6391 2024-12-19 12:16:38 +00:00
hiyouga
c7cedc7569 support disable shuffling 2024-12-19 08:53:21 +00:00
hiyouga
96f8f103e5 add swanlab 2024-12-19 07:12:31 +00:00
hiyouga
eda76de32b support control eos, fix #6345 2024-12-17 10:42:05 +00:00
hiyouga
142191e466 fix #6348 2024-12-17 10:06:46 +00:00
hiyouga
2811814fc4 fix mrope 2024-12-12 15:08:17 +00:00
hiyouga
1324d158f9 support batch infer in vllm 2024-12-04 13:50:00 +00:00
Ting
40627c601e code refactor 2024-11-19 20:33:18 +08:00
Ting
ef6e14550d update 2024-11-19 19:10:07 +08:00
Ting
b9f00286d8 support efficient tokens calculation on sft/dpo 2024-11-19 17:15:47 +08:00
hiyouga
4270f7dfb9 fix dpo metrics 2024-11-02 20:59:01 +08:00
hiyouga
c38aa29336 support rank0 logger 2024-11-02 18:31:04 +08:00
hiyouga
30567a1487 fix incorrect loss value for vlms 2024-10-30 08:56:46 +00:00
hiyouga
54c6905937 add docstrings, refactor logger 2024-09-08 00:56:56 +08:00
hiyouga
dabad5570b update get template 2024-09-04 22:36:20 +08:00
hiyouga
47ea97fb1b lazy image load 2024-09-04 02:27:08 +08:00
hiyouga
22959bcdd3 lint 2024-09-03 00:46:25 +08:00
hiyouga
a61c8c4890 fix #5324 2024-09-02 23:56:21 +08:00
hoshi-hiyouga
99fd9637bd fix trainer predict 2024-09-02 10:15:29 +08:00
hoshi-hiyouga
a6c6750e8a remove .cpu() 2024-09-02 10:10:53 +08:00
hiyouga
a025c3df61 remove visual_inputs, fix qlora 2024-08-31 00:24:51 +08:00
hiyouga
a244f143f4 optimize predict vram 2024-08-30 23:08:45 +08:00
moontidef
40908a36fa fix: rename optimzer to optimizer 2024-08-07 10:05:01 +08:00
hiyouga
beec77a089 fix metrics #4786 2024-07-17 00:47:00 +08:00
hiyouga
d774b94f12 support batch_eval_metrics, fix #4826 2024-07-17 00:33:00 +08:00
hiyouga
fd8cc49008 fix #4820 2024-07-15 22:32:07 +08:00
hiyouga
29ebcd75d5 fix up 2024-07-15 01:04:56 +08:00
codingma
76f3bbcfc0 1. add custom eval dataset support
2. merge load dataset and split dataset function
2024-07-05 15:52:10 +08:00
hiyouga
6fd6aa4530 fix packing for eager/sdpa attn 2024-07-04 01:52:43 +08:00
hiyouga
c47ab6c072 improve rlhf 2024-07-02 22:23:08 +08:00
hiyouga
73280b7dc7 tiny fix 2024-07-01 05:43:17 +08:00
hiyouga
1856a08e87 add eval acc 2024-07-01 03:51:20 +08:00
hiyouga
8baf3b22b0 refactor pissa, improve llamaboard 2024-06-28 01:04:24 +08:00
hzhaoy
677c86594e fix #4579 2024-06-27 13:49:57 +08:00
hiyouga
095fab58d3 tiny fix about badam 2024-06-25 01:54:53 +08:00
Jonery
5c2ff1b749 Cleaner integration. 2024-06-19 12:29:40 +08:00
Jonery
0f72aac8c9 Support distributed BAdam. 2024-06-18 12:27:47 +08:00
Jonery
ea1f3ba5e0 Merge remote-tracking branch 'upstream/main' 2024-06-17 18:44:51 +08:00
Jonery
33b4372778 adapt for badam with ds zero3 2024-06-17 18:18:10 +08:00
hiyouga
8c1046d78a support pissa 2024-06-16 01:08:12 +08:00
hiyouga
38b6b0f52e tiny fix 2024-06-16 01:06:41 +08:00
hiyouga
d87108daa6 add license 2024-06-15 17:54:33 +08:00
hiyouga
78589cf90c fix #4295 2024-06-15 04:34:55 +08:00
hiyouga
6baafd4eb3 fix #4221 2024-06-13 02:48:21 +08:00
hiyouga
2ed8270112 clean code 2024-06-13 01:58:16 +08:00
hiyouga
4489d73ac7 fix ppo trainer save zero3 model
accelerator.get_state_dict(ds_model) should be called at all ranks
2024-06-07 05:14:19 +08:00