hiyouga
|
26d914b8fc
|
fix ci
Former-commit-id: 280c0f3f2cea4dfced797cc0e15f72b8b3a93542
|
2024-09-05 03:02:59 +08:00 |
|
hiyouga
|
af178cbcd1
|
update get template
Former-commit-id: 21ea0d0786f91c0bce79630963e66b815a6792a0
|
2024-09-04 22:36:20 +08:00 |
|
hiyouga
|
7056087e92
|
lazy image load
Former-commit-id: cdd733b575411e003bc5ffd6560dd8eff8aa09cf
|
2024-09-04 02:27:08 +08:00 |
|
hiyouga
|
2e1396cd6b
|
lint
Former-commit-id: d821d933e6cb982d648a69f85f6ad01d0560ed70
|
2024-09-03 00:46:25 +08:00 |
|
hiyouga
|
b5e9df5df8
|
fix #5324
Former-commit-id: f7aa06c9c0b18c28419ea5792410915d3f322cbf
|
2024-09-02 23:56:21 +08:00 |
|
hoshi-hiyouga
|
7367c6ec21
|
fix trainer predict
Former-commit-id: 2790790cd26c6743105555a60523b89f367ebce3
|
2024-09-02 10:15:29 +08:00 |
|
hoshi-hiyouga
|
6579ec8c4c
|
remove .cpu()
Former-commit-id: 35c57cc9dcba305d40282a9757ddc23968c210ac
|
2024-09-02 10:10:53 +08:00 |
|
hiyouga
|
60cf12727b
|
add rlhf-v dataset
Former-commit-id: 3fd18fc34a0c994a738504746abfd5548e002437
|
2024-09-01 22:57:41 +08:00 |
|
hiyouga
|
7621526d22
|
tiny fix
Former-commit-id: 8ccaae3871d8d1fe3ea4633d427aecb2ab3addec
|
2024-09-01 21:15:44 +08:00 |
|
hiyouga
|
7e4c5d4bb3
|
fix mixed mm inputs and rlhf-v
Former-commit-id: 7c248fac20bf85d57a91132ce7a793c7f84e9218
|
2024-09-01 20:52:47 +08:00 |
|
hiyouga
|
2f6fc27c8b
|
remove visual_inputs, fix qlora
Former-commit-id: be30c01c4f1482520ece770bd54c6a4837c26f0a
|
2024-08-31 00:24:51 +08:00 |
|
hiyouga
|
d789b667d7
|
optimize predict vram
Former-commit-id: a577e44eee351b3ed8011a33ae01cd713354ff97
|
2024-08-30 23:08:45 +08:00 |
|
hiyouga
|
c62a6ca59d
|
refactor mm training
Former-commit-id: 179c0558699e287cbf38a2d73bff47e86d589c5a
|
2024-08-30 02:14:31 +08:00 |
|
hiyouga
|
7c6785d3df
|
fix #5295
Former-commit-id: c76873b0eb8225f6e6bfc7223c6012387dceb8ed
|
2024-08-29 20:30:18 +08:00 |
|
hiyouga
|
77341ee3c4
|
fix #5305
Former-commit-id: a710ebaf97c258c802f24e508d83f1f3f10edc6d
|
2024-08-29 20:16:01 +08:00 |
|
hiyouga
|
ca5a759f94
|
tiny fix
Former-commit-id: d2cede7023bbe28525ef8b4ad27247445d8c22e5
|
2024-08-27 12:49:32 +08:00 |
|
hiyouga
|
d111a324bc
|
tiny fix
Former-commit-id: 23961bdf6fdbcde64e7b943f699fdeb4ac024043
|
2024-08-20 00:10:52 +08:00 |
|
liu-zichen
|
75dfe259cf
|
fix lr not change
Former-commit-id: 387dd2d51b5d8cd666459040fdd16525b34720d9
|
2024-08-13 16:33:34 +08:00 |
|
hiyouga
|
59cbce1a46
|
add adam_mini to readme
Former-commit-id: d610c6bcf8a8ba6f4236f5d11f79571b83f4fb11
|
2024-08-09 20:02:03 +08:00 |
|
moontidef
|
8f42d7df56
|
feat: add support for adammini
Former-commit-id: a2d5fafb705ff44db1711e972490f0abebc2012b
|
2024-08-07 10:08:22 +08:00 |
|
moontidef
|
33a90b9026
|
fix: rename optimzer to optimizer
Former-commit-id: 186dc1fde822e6a603ac273538741ea3853f243e
|
2024-08-07 10:05:01 +08:00 |
|
hiyouga
|
13093963b1
|
fix #5048
Former-commit-id: 71a6861667ae68c1fd6a69acf68e1359b858cf1b
|
2024-08-05 23:48:19 +08:00 |
|
codingma
|
75e80fa820
|
fix pissa save
Former-commit-id: 25a1dad7c8df79c15efecb8c6f871a13a327f57a
|
2024-07-29 10:44:34 +08:00 |
|
hiyouga
|
994b9089e9
|
add unittest
Former-commit-id: 8a1f0c5f922989e08a19c65de0b2c4afd2a5771f
|
2024-07-19 01:06:27 +08:00 |
|
hiyouga
|
341225a405
|
fix metrics #4786
Former-commit-id: 7d0c4bd394fc3cba197db1719f1164b9dd66ac21
|
2024-07-17 00:47:00 +08:00 |
|
hiyouga
|
8c93921952
|
support batch_eval_metrics, fix #4826
Former-commit-id: 3fe1df17188825f8a32fbe6a1294b4b532ce0c85
|
2024-07-17 00:33:00 +08:00 |
|
hiyouga
|
1891b64072
|
fix #4820
Former-commit-id: 8c0f8357e1eebee32010fe715554f1136b68b4ba
|
2024-07-15 22:32:07 +08:00 |
|
hiyouga
|
e4d11a117b
|
fix up
Former-commit-id: 43a56cb331fae899ca35b0c312730d4ab79d0c42
|
2024-07-15 01:04:56 +08:00 |
|
hoshi-hiyouga
|
68365045b4
|
Merge pull request #4691 from codemayq/feature-suppot-eval-dataset
add eval dataset support
Former-commit-id: 51eb379b44fad0336fc96c329ec98dc4528b9c2c
|
2024-07-15 01:00:34 +08:00 |
|
hzhaoy
|
137c87ff60
|
tiny fix
Former-commit-id: 48be67c41eb394d276b41ca22b28e1ef10af4920
|
2024-07-12 00:28:44 +08:00 |
|
hoshi-hiyouga
|
460a40756c
|
Update callbacks.py
Former-commit-id: 526376967deaad73b7ca11063a2e3f0c9a0add98
|
2024-07-10 13:32:20 +08:00 |
|
-.-
|
18057e14ef
|
fix src/llamafactory/train/callbacks.py
Former-commit-id: c79a21aeaa5462770790887a6826d335e1ded5a2
|
2024-07-10 12:05:51 +08:00 |
|
hiyouga
|
025c8fe302
|
fix #4731
Former-commit-id: 99e016ee552a551b52b6fcf3616cb57a5b927715
|
2024-07-10 11:32:36 +08:00 |
|
hiyouga
|
446129ca7a
|
fix ppo trainer
Former-commit-id: a03b2e5ef0d5d6b1b27753438745385d290cb211
|
2024-07-10 11:05:45 +08:00 |
|
hiyouga
|
834c4e8ad9
|
fix #4742
Former-commit-id: ae9cf84347878fcc462f35db941c14e1df104276
|
2024-07-09 23:24:24 +08:00 |
|
codingma
|
5f2bd04799
|
1. add custom eval dataset support
2. merge load dataset and split dataset function
Former-commit-id: 963d97ba07e7efa3a4544c4d077283d9e112b3ad
|
2024-07-05 15:52:10 +08:00 |
|
hiyouga
|
3d219b91b9
|
fix packing for eager/sdpa attn
Former-commit-id: 735a033ceb7f2da6da71d138ea091d8a665411a9
|
2024-07-04 01:52:43 +08:00 |
|
hiyouga
|
0b0e27c2f1
|
fix #4609
unwrap_model_for_generation(reward_model) is necessary for zero3 training
Former-commit-id: c8d5b21700577cae8d6ca03359bcf1762c8b7cb8
|
2024-07-03 19:45:51 +08:00 |
|
hiyouga
|
a42671c2d7
|
tiny fix
Former-commit-id: d944020257f363f38e62de6279b337e399b7c65e
|
2024-07-03 02:31:50 +08:00 |
|
hiyouga
|
f17ab6ad92
|
tiny fix
Former-commit-id: 98c4a0af6b3e27ae393d2847f48a01d23d9c8780
|
2024-07-02 23:06:13 +08:00 |
|
hiyouga
|
ca548af2a2
|
remove rlhf support for chatglm2&3
Former-commit-id: bcbb5b71961b89719bffb0d202c431c82e6067cc
|
2024-07-02 23:03:17 +08:00 |
|
hiyouga
|
579997688f
|
upcast logits
Former-commit-id: df61660351c8af30591471807a20869a45bb055a
|
2024-07-02 22:32:05 +08:00 |
|
hiyouga
|
e6ba7ef3e6
|
improve rlhf
Former-commit-id: e441780e3db256ca09a442ea9254e7ce16898a07
|
2024-07-02 22:23:08 +08:00 |
|
hiyouga
|
96a81ce89d
|
fix ppo callbacks
Former-commit-id: 54f1c67c2a802b1d8368a6d1837d4c9a729f2695
|
2024-07-02 17:34:56 +08:00 |
|
hiyouga
|
973cf8e980
|
tiny fix
Former-commit-id: 5dd2e5c3323f56420b5845a5ed28bcd9d4da5e41
|
2024-07-01 05:43:17 +08:00 |
|
hiyouga
|
884b49e662
|
add eval acc
Former-commit-id: 7ffde76fbfb6192e3aac31ccc098f31ce89181ae
|
2024-07-01 03:51:20 +08:00 |
|
hiyouga
|
3c4f8eaa55
|
loose gemma2 attention
Former-commit-id: a0b645017a2de3d58b6cbc71bd91ec96fc7a818b
|
2024-06-29 01:42:14 +08:00 |
|
hiyouga
|
42e7489713
|
add Gemma2 models
Former-commit-id: 8fc5a248ecfd6861cb90dac6c14fe89cdeaf8921
|
2024-06-28 01:26:50 +08:00 |
|
hiyouga
|
46f0189e88
|
refactor pissa, improve llamaboard
Former-commit-id: 619556e46c19718f702c97df5d570a2a4c5fb13a
|
2024-06-28 01:04:24 +08:00 |
|
hzhaoy
|
89d9dd5aa5
|
fix #4579
Former-commit-id: 0fa298ff6a4febea36ea9f11c7594277a77e6e9b
|
2024-06-27 13:49:57 +08:00 |
|