moontidef
|
b0d32b2041
|
fix: rename optimzer to optimizer
Former-commit-id: 40908a36fa
|
2024-08-07 10:05:01 +08:00 |
|
hiyouga
|
3c7b10b1fa
|
fix metrics #4786
Former-commit-id: beec77a089
|
2024-07-17 00:47:00 +08:00 |
|
hiyouga
|
e90fae61f4
|
support batch_eval_metrics, fix #4826
Former-commit-id: d774b94f12
|
2024-07-17 00:33:00 +08:00 |
|
hiyouga
|
84e6715423
|
fix #4820
Former-commit-id: fd8cc49008
|
2024-07-15 22:32:07 +08:00 |
|
hiyouga
|
14bc7b0551
|
fix up
Former-commit-id: 29ebcd75d5
|
2024-07-15 01:04:56 +08:00 |
|
codingma
|
74f0d02eb8
|
1. add custom eval dataset support
2. merge load dataset and split dataset function
Former-commit-id: 76f3bbcfc0
|
2024-07-05 15:52:10 +08:00 |
|
hiyouga
|
7b3c1f29ff
|
fix packing for eager/sdpa attn
Former-commit-id: 6fd6aa4530
|
2024-07-04 01:52:43 +08:00 |
|
hiyouga
|
cc31014002
|
improve rlhf
Former-commit-id: c47ab6c072
|
2024-07-02 22:23:08 +08:00 |
|
hiyouga
|
2cf03017a0
|
tiny fix
Former-commit-id: 73280b7dc7
|
2024-07-01 05:43:17 +08:00 |
|
hiyouga
|
54e786346e
|
add eval acc
Former-commit-id: 1856a08e87
|
2024-07-01 03:51:20 +08:00 |
|
hiyouga
|
835f0578c2
|
refactor pissa, improve llamaboard
Former-commit-id: 8baf3b22b0
|
2024-06-28 01:04:24 +08:00 |
|
hzhaoy
|
e1751f6398
|
fix #4579
Former-commit-id: 677c86594e
|
2024-06-27 13:49:57 +08:00 |
|
hiyouga
|
a225b5a70c
|
tiny fix about badam
Former-commit-id: 095fab58d3
|
2024-06-25 01:54:53 +08:00 |
|
Jonery
|
c779899f7b
|
Cleaner integration.
Former-commit-id: 5c2ff1b749
|
2024-06-19 12:29:40 +08:00 |
|
Jonery
|
3a5eacb4cf
|
Support distributed BAdam.
Former-commit-id: 0f72aac8c9
|
2024-06-18 12:27:47 +08:00 |
|
Jonery
|
5d59f6562a
|
Merge remote-tracking branch 'upstream/main'
Former-commit-id: ea1f3ba5e0
|
2024-06-17 18:44:51 +08:00 |
|
Jonery
|
756566342d
|
adapt for badam with ds zero3
Former-commit-id: 33b4372778
|
2024-06-17 18:18:10 +08:00 |
|
hiyouga
|
f25b8626bf
|
support pissa
Former-commit-id: 8c1046d78a
|
2024-06-16 01:08:12 +08:00 |
|
hiyouga
|
c0c6b8075a
|
tiny fix
Former-commit-id: 38b6b0f52e
|
2024-06-16 01:06:41 +08:00 |
|
hiyouga
|
2946153cea
|
add license
Former-commit-id: d87108daa6
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
ab66ae8cd2
|
fix #4295
Former-commit-id: 78589cf90c
|
2024-06-15 04:34:55 +08:00 |
|
hiyouga
|
8fccaf20c5
|
fix #4221
Former-commit-id: 6baafd4eb3
|
2024-06-13 02:48:21 +08:00 |
|
hiyouga
|
833aa324c2
|
clean code
Former-commit-id: 2ed8270112
|
2024-06-13 01:58:16 +08:00 |
|
hiyouga
|
4f3c89a6eb
|
fix ppo trainer save zero3 model
accelerator.get_state_dict(ds_model) should be called at all ranks
Former-commit-id: 4489d73ac7
|
2024-06-07 05:14:19 +08:00 |
|
hiyouga
|
8da149ba40
|
rename files
Former-commit-id: 74f96efef9
|
2024-06-07 00:09:06 +08:00 |
|
hiyouga
|
cae823ddf0
|
rename package
Former-commit-id: 308edbc426
|
2024-05-16 18:39:08 +08:00 |
|