hiyouga
|
cedf58978e
|
support autogptq in llama board #246
Former-commit-id: fea01226703d1534b5cf511bcb6a49e73bc86ce1
|
2023-12-16 16:31:30 +08:00 |
|
yhyu13
|
963773a7df
|
Use llmtuner logger
Former-commit-id: ef5a560b4246e04e0ef2612e3520e05288e93707
|
2023-12-16 07:15:27 +00:00 |
|
yhyu13
|
c2cb100805
|
Improve logging for unknown args
Former-commit-id: 03e49d76ca91f7fcaf1c013740d5f6bfc11a2028
|
2023-12-16 05:16:29 +00:00 |
|
hiyouga
|
cf70644f8f
|
fix #1715
Former-commit-id: 3f9192dbbbafdc2171d2eb80282d5cae47565b7b
|
2023-12-03 22:35:47 +08:00 |
|
hiyouga
|
9bf0dcfe39
|
fix #1707 #1710
Former-commit-id: 243a596518ad69cf1eec20a082534b9e94353ce4
|
2023-12-03 11:33:12 +08:00 |
|
hiyouga
|
d46ce88519
|
fix gptq training
Former-commit-id: bec58e3dc575aa4247e563881a456328ee5ef496
|
2023-12-02 00:27:15 +08:00 |
|
hiyouga
|
7d758e2232
|
fix #1703
Former-commit-id: eee2e9abf6df345c5471e8ca7639293543ba720c
|
2023-12-01 22:55:41 +08:00 |
|
hiyouga
|
caf4fa46e0
|
patch modelscope
Former-commit-id: 8888cf53f040f5a2d8c0e59cddf79b252449bf58
|
2023-12-01 22:53:15 +08:00 |
|
hiyouga
|
eec19b9693
|
tiny fix
Former-commit-id: 37aa7099dff2a9a7b52e259dac92de41ce606946
|
2023-12-01 15:58:50 +08:00 |
|
billvsme
|
bd907d8dce
|
improve get_current_device
Former-commit-id: 2b07815e7fc8dc6ad0a7e9eccdd6681fbab35f3c
|
2023-11-30 22:40:35 +08:00 |
|
hiyouga
|
78e6ac0156
|
update ppo trainer
Former-commit-id: caa525a5c6f228b9ad71387d1fe4f1c2ffa2479e
|
2023-11-20 21:39:15 +08:00 |
|
hiyouga
|
226156bdf1
|
fix #1558
Former-commit-id: 263b2b24c8a649b51fa5ae768a24e67def8e0e96
|
2023-11-19 14:15:47 +08:00 |
|
hiyouga
|
685d0c975a
|
support full-parameter PPO
Former-commit-id: 4af967d69475e1c9fdf1a7983cd6b83bd431abff
|
2023-11-16 02:08:04 +08:00 |
|
hiyouga
|
5a206d54c9
|
disentangle model from tuner and rename modules
Former-commit-id: 02cbf91e7e424f8379c1fed01b82a5f7a83b6947
|
2023-11-15 16:29:09 +08:00 |
|
hiyouga
|
c6dfbfa62c
|
refactor evaluation, upgrade trl to 074
Former-commit-id: ed09ebe2c1926ffdb0520b3866f7fd03a9aed046
|
2023-11-13 22:20:35 +08:00 |
|
hiyouga
|
f1a8fcf917
|
refactor model_dtype, fix PPO trainer
Former-commit-id: 3e17ee5afbcb823a7c9a2f91864b3750cd79edb4
|
2023-10-11 23:16:01 +08:00 |
|
hiyouga
|
34da72ffbb
|
support lora target auto find
Former-commit-id: bce9984733d88bf013847eed523d1c75fdf0995e
|
2023-09-09 15:38:37 +08:00 |
|
hiyouga
|
79cf2ebfe4
|
add Baichuan2 models
Former-commit-id: 36960025e9274b574f57e7a7bf453cd96956e922
|
2023-09-06 18:36:04 +08:00 |
|
hiyouga
|
a089d2665a
|
fix ChatGLM2 ppo #527 #528
Former-commit-id: 60d6ad64d7c9f6445b0df8de0153c3a311974198
|
2023-08-18 00:34:59 +08:00 |
|
hiyouga
|
bb7028f7e2
|
fix generation bug #532
Former-commit-id: c071121e67374e5f09798db57cfc8668617a36ae
|
2023-08-17 22:21:34 +08:00 |
|
hiyouga
|
0e383cf771
|
update webui
Former-commit-id: da30d0fb4abdb825f3383ddd106bb06a84695b7a
|
2023-08-14 22:45:26 +08:00 |
|
hiyouga
|
9644286027
|
fix unusual output of 8bit models #278 #391
Former-commit-id: 337ce5272b81f5561162beb08814b0e5abf23703
|
2023-08-12 00:25:29 +08:00 |
|
hiyouga
|
7ada4f5f6f
|
support DPO training (2305.18290)
Former-commit-id: 6d98de148e4af63a7028dfaeb6cf86eb56a4488f
|
2023-08-11 03:02:53 +08:00 |
|
hiyouga
|
a9a886ed6c
|
tiny fix
Former-commit-id: 81ef7017a4c96441951adeff0276cc5ab76a3544
|
2023-08-03 17:42:28 +08:00 |
|
hiyouga
|
bf15c4a03c
|
support Qwen-7B, fix InternLM-7B inference
Former-commit-id: 25d2ca29ecb70cbfd5206333c667042a0c4d2e5a
|
2023-08-03 15:53:32 +08:00 |
|
hiyouga
|
32f2e48c5f
|
Fix #294
Former-commit-id: 09762d9849655f5e6c71b9472d55b42489dd944b
|
2023-08-01 18:13:03 +08:00 |
|
hiyouga
|
63123a9098
|
support streaming data, fix #284 #274 #268
Former-commit-id: 819cc1353599e5fa45658bc56dd0dbe4b258b197
|
2023-07-31 23:33:00 +08:00 |
|
hiyouga
|
c32b68bd1e
|
fix #268
Former-commit-id: 1eee0207fb370bb9e234e9bd3f9a0c47d7d01bc9
|
2023-07-28 17:02:26 +08:00 |
|
hiyouga
|
a69b1b1c3a
|
modity code structure
Former-commit-id: 0682ed357210897e0b67c4a6eb31a94b3eb929f1
|
2023-07-15 16:54:28 +08:00 |
|