Ting
|
ef6e14550d
|
update
|
2024-11-19 19:10:07 +08:00 |
|
Ting
|
b9f00286d8
|
support efficient tokens calculation on sft/dpo
|
2024-11-19 17:15:47 +08:00 |
|
hiyouga
|
4270f7dfb9
|
fix dpo metrics
|
2024-11-02 20:59:01 +08:00 |
|
hiyouga
|
c38aa29336
|
support rank0 logger
|
2024-11-02 18:31:04 +08:00 |
|
hiyouga
|
30567a1487
|
fix incorrect loss value for vlms
|
2024-10-30 08:56:46 +00:00 |
|
hiyouga
|
54c6905937
|
add docstrings, refactor logger
|
2024-09-08 00:56:56 +08:00 |
|
hiyouga
|
dabad5570b
|
update get template
|
2024-09-04 22:36:20 +08:00 |
|
hiyouga
|
47ea97fb1b
|
lazy image load
|
2024-09-04 02:27:08 +08:00 |
|
hiyouga
|
22959bcdd3
|
lint
|
2024-09-03 00:46:25 +08:00 |
|
hiyouga
|
a61c8c4890
|
fix #5324
|
2024-09-02 23:56:21 +08:00 |
|
hoshi-hiyouga
|
99fd9637bd
|
fix trainer predict
|
2024-09-02 10:15:29 +08:00 |
|
hoshi-hiyouga
|
a6c6750e8a
|
remove .cpu()
|
2024-09-02 10:10:53 +08:00 |
|
hiyouga
|
a025c3df61
|
remove visual_inputs, fix qlora
|
2024-08-31 00:24:51 +08:00 |
|
hiyouga
|
a244f143f4
|
optimize predict vram
|
2024-08-30 23:08:45 +08:00 |
|
moontidef
|
40908a36fa
|
fix: rename optimzer to optimizer
|
2024-08-07 10:05:01 +08:00 |
|
hiyouga
|
beec77a089
|
fix metrics #4786
|
2024-07-17 00:47:00 +08:00 |
|
hiyouga
|
d774b94f12
|
support batch_eval_metrics, fix #4826
|
2024-07-17 00:33:00 +08:00 |
|
hiyouga
|
fd8cc49008
|
fix #4820
|
2024-07-15 22:32:07 +08:00 |
|
hiyouga
|
29ebcd75d5
|
fix up
|
2024-07-15 01:04:56 +08:00 |
|
codingma
|
76f3bbcfc0
|
1. add custom eval dataset support
2. merge load dataset and split dataset function
|
2024-07-05 15:52:10 +08:00 |
|
hiyouga
|
6fd6aa4530
|
fix packing for eager/sdpa attn
|
2024-07-04 01:52:43 +08:00 |
|
hiyouga
|
c47ab6c072
|
improve rlhf
|
2024-07-02 22:23:08 +08:00 |
|
hiyouga
|
73280b7dc7
|
tiny fix
|
2024-07-01 05:43:17 +08:00 |
|
hiyouga
|
1856a08e87
|
add eval acc
|
2024-07-01 03:51:20 +08:00 |
|
hiyouga
|
8baf3b22b0
|
refactor pissa, improve llamaboard
|
2024-06-28 01:04:24 +08:00 |
|
hzhaoy
|
677c86594e
|
fix #4579
|
2024-06-27 13:49:57 +08:00 |
|
hiyouga
|
095fab58d3
|
tiny fix about badam
|
2024-06-25 01:54:53 +08:00 |
|
Jonery
|
5c2ff1b749
|
Cleaner integration.
|
2024-06-19 12:29:40 +08:00 |
|
Jonery
|
0f72aac8c9
|
Support distributed BAdam.
|
2024-06-18 12:27:47 +08:00 |
|
Jonery
|
ea1f3ba5e0
|
Merge remote-tracking branch 'upstream/main'
|
2024-06-17 18:44:51 +08:00 |
|
Jonery
|
33b4372778
|
adapt for badam with ds zero3
|
2024-06-17 18:18:10 +08:00 |
|
hiyouga
|
8c1046d78a
|
support pissa
|
2024-06-16 01:08:12 +08:00 |
|
hiyouga
|
38b6b0f52e
|
tiny fix
|
2024-06-16 01:06:41 +08:00 |
|
hiyouga
|
d87108daa6
|
add license
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
78589cf90c
|
fix #4295
|
2024-06-15 04:34:55 +08:00 |
|
hiyouga
|
6baafd4eb3
|
fix #4221
|
2024-06-13 02:48:21 +08:00 |
|
hiyouga
|
2ed8270112
|
clean code
|
2024-06-13 01:58:16 +08:00 |
|
hiyouga
|
4489d73ac7
|
fix ppo trainer save zero3 model
accelerator.get_state_dict(ds_model) should be called at all ranks
|
2024-06-07 05:14:19 +08:00 |
|
hiyouga
|
74f96efef9
|
rename files
|
2024-06-07 00:09:06 +08:00 |
|
hiyouga
|
308edbc426
|
rename package
|
2024-05-16 18:39:08 +08:00 |
|