hiyouga
|
1800f8c72d
|
fix #6499
|
2025-01-02 11:28:54 +00:00 |
|
hiyouga
|
2719867982
|
fix #6448
|
2024-12-27 16:54:39 +00:00 |
|
hiyouga
|
5111cac6f8
|
support report custom args
|
2024-12-21 21:42:45 +00:00 |
|
hoshi-hiyouga
|
947e22a4a3
|
Merge pull request #6401 from Zeyi-Lin/hiyouga/swanlab
feat: add swanlab for experiment tracking and visualization.
|
2024-12-21 14:09:33 +08:00 |
|
hiyouga
|
d4c1fda1ad
|
fix #6391
|
2024-12-19 12:16:38 +00:00 |
|
hiyouga
|
c7cedc7569
|
support disable shuffling
|
2024-12-19 08:53:21 +00:00 |
|
hiyouga
|
96f8f103e5
|
add swanlab
|
2024-12-19 07:12:31 +00:00 |
|
hiyouga
|
eda76de32b
|
support control eos, fix #6345
|
2024-12-17 10:42:05 +00:00 |
|
hiyouga
|
142191e466
|
fix #6348
|
2024-12-17 10:06:46 +00:00 |
|
hiyouga
|
2811814fc4
|
fix mrope
|
2024-12-12 15:08:17 +00:00 |
|
hiyouga
|
1324d158f9
|
support batch infer in vllm
|
2024-12-04 13:50:00 +00:00 |
|
Ting
|
40627c601e
|
code refactor
|
2024-11-19 20:33:18 +08:00 |
|
Ting
|
ef6e14550d
|
update
|
2024-11-19 19:10:07 +08:00 |
|
Ting
|
b9f00286d8
|
support efficient tokens calculation on sft/dpo
|
2024-11-19 17:15:47 +08:00 |
|
hiyouga
|
4270f7dfb9
|
fix dpo metrics
|
2024-11-02 20:59:01 +08:00 |
|
hiyouga
|
c38aa29336
|
support rank0 logger
|
2024-11-02 18:31:04 +08:00 |
|
hiyouga
|
30567a1487
|
fix incorrect loss value for vlms
|
2024-10-30 08:56:46 +00:00 |
|
hiyouga
|
54c6905937
|
add docstrings, refactor logger
|
2024-09-08 00:56:56 +08:00 |
|
hiyouga
|
dabad5570b
|
update get template
|
2024-09-04 22:36:20 +08:00 |
|
hiyouga
|
47ea97fb1b
|
lazy image load
|
2024-09-04 02:27:08 +08:00 |
|
hiyouga
|
22959bcdd3
|
lint
|
2024-09-03 00:46:25 +08:00 |
|
hiyouga
|
a61c8c4890
|
fix #5324
|
2024-09-02 23:56:21 +08:00 |
|
hoshi-hiyouga
|
99fd9637bd
|
fix trainer predict
|
2024-09-02 10:15:29 +08:00 |
|
hoshi-hiyouga
|
a6c6750e8a
|
remove .cpu()
|
2024-09-02 10:10:53 +08:00 |
|
hiyouga
|
a025c3df61
|
remove visual_inputs, fix qlora
|
2024-08-31 00:24:51 +08:00 |
|
hiyouga
|
a244f143f4
|
optimize predict vram
|
2024-08-30 23:08:45 +08:00 |
|
moontidef
|
40908a36fa
|
fix: rename optimzer to optimizer
|
2024-08-07 10:05:01 +08:00 |
|
hiyouga
|
beec77a089
|
fix metrics #4786
|
2024-07-17 00:47:00 +08:00 |
|
hiyouga
|
d774b94f12
|
support batch_eval_metrics, fix #4826
|
2024-07-17 00:33:00 +08:00 |
|
hiyouga
|
fd8cc49008
|
fix #4820
|
2024-07-15 22:32:07 +08:00 |
|
hiyouga
|
29ebcd75d5
|
fix up
|
2024-07-15 01:04:56 +08:00 |
|
codingma
|
76f3bbcfc0
|
1. add custom eval dataset support
2. merge load dataset and split dataset function
|
2024-07-05 15:52:10 +08:00 |
|
hiyouga
|
6fd6aa4530
|
fix packing for eager/sdpa attn
|
2024-07-04 01:52:43 +08:00 |
|
hiyouga
|
c47ab6c072
|
improve rlhf
|
2024-07-02 22:23:08 +08:00 |
|
hiyouga
|
73280b7dc7
|
tiny fix
|
2024-07-01 05:43:17 +08:00 |
|
hiyouga
|
1856a08e87
|
add eval acc
|
2024-07-01 03:51:20 +08:00 |
|
hiyouga
|
8baf3b22b0
|
refactor pissa, improve llamaboard
|
2024-06-28 01:04:24 +08:00 |
|
hzhaoy
|
677c86594e
|
fix #4579
|
2024-06-27 13:49:57 +08:00 |
|
hiyouga
|
095fab58d3
|
tiny fix about badam
|
2024-06-25 01:54:53 +08:00 |
|
Jonery
|
5c2ff1b749
|
Cleaner integration.
|
2024-06-19 12:29:40 +08:00 |
|
Jonery
|
0f72aac8c9
|
Support distributed BAdam.
|
2024-06-18 12:27:47 +08:00 |
|
Jonery
|
ea1f3ba5e0
|
Merge remote-tracking branch 'upstream/main'
|
2024-06-17 18:44:51 +08:00 |
|
Jonery
|
33b4372778
|
adapt for badam with ds zero3
|
2024-06-17 18:18:10 +08:00 |
|
hiyouga
|
8c1046d78a
|
support pissa
|
2024-06-16 01:08:12 +08:00 |
|
hiyouga
|
38b6b0f52e
|
tiny fix
|
2024-06-16 01:06:41 +08:00 |
|
hiyouga
|
d87108daa6
|
add license
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
78589cf90c
|
fix #4295
|
2024-06-15 04:34:55 +08:00 |
|
hiyouga
|
6baafd4eb3
|
fix #4221
|
2024-06-13 02:48:21 +08:00 |
|
hiyouga
|
2ed8270112
|
clean code
|
2024-06-13 01:58:16 +08:00 |
|
hiyouga
|
4489d73ac7
|
fix ppo trainer save zero3 model
accelerator.get_state_dict(ds_model) should be called at all ranks
|
2024-06-07 05:14:19 +08:00 |
|