Commit Graph

139 Commits

Author SHA1 Message Date
hiyouga
91e62a098f set dev version 2024-06-11 00:50:53 +08:00
hiyouga
2b6ebd6b51 release v0.8.1 2024-06-11 00:44:26 +08:00
hiyouga
972ec9c668 fix llamafactory-cli env 2024-06-08 07:15:45 +08:00
hiyouga
3ac11e77cc set dev version 2024-06-08 06:46:09 +08:00
hiyouga
5aa4ce4756 release v0.8.0 2024-06-08 05:20:54 +08:00
hiyouga
06e5d136a4 add resume args in webui 2024-06-08 00:22:16 +08:00
hiyouga
f9e818d79c fix #4120 2024-06-07 04:18:05 +08:00
hiyouga
8e95648850 add qwen2 models 2024-06-07 00:22:57 +08:00
hiyouga
451b6693c0 fix torch gc 2024-06-06 20:30:25 +08:00
hiyouga
cae4737907 lora modules: all by default 2024-06-06 03:53:28 +08:00
hiyouga
c23cc63d3d add codestral 22B 2024-06-06 03:42:50 +08:00
hiyouga
7daf8366db lint 2024-06-06 03:33:44 +08:00
hoshi-hiyouga
f2580ad403 Merge pull request #4066 from injet-zhou/main
add throughput entry to training log
2024-06-06 03:32:04 +08:00
hiyouga
dc4a00dd63 update train hparams 2024-06-06 01:49:20 +08:00
hiyouga
d4908d5708 add llamafactory-cli env 2024-06-06 01:28:14 +08:00
hiyouga
67fe822324 fix #4090 2024-06-06 00:50:32 +08:00
hiyouga
f48f5e646e support glm-4 2024-06-05 15:16:38 +08:00
faddddeout
b2f0459542 add throughput entry to log 2024-06-04 11:04:29 +00:00
hiyouga
876bc92865 bump versions
transformers 4.37.2->4.41.2
datasets 2.14.3->2.16.0
accelerate 0.27.2->0.30.1
peft 0.10.0->0.11.1
trl 0.8.1->0.8.6
2024-06-03 18:29:38 +08:00
hiyouga
8070871732 better llamaboard
* easily resume from checkpoint
* support full and freeze checkpoints
* faster ui
2024-05-29 23:55:38 +08:00
hiyouga
89ca832740 update readme 2024-05-29 18:39:11 +08:00
hzhaoy
0dd632fe9e add TeleChat-12B/TeleChat-12B-v2 models 2024-05-29 15:00:37 +08:00
hiyouga
7c016b22aa support DDP in webui 2024-05-28 19:24:22 +08:00
hiyouga
c1fdf81df6 tiny fix 2024-05-27 20:54:26 +08:00
hoshi-hiyouga
87ea0a8bcd Merge pull request #3921 from gusye1234/main
Add openchat-3.6-8B support
2024-05-27 20:52:37 +08:00
Jianbai Ye
cff815391f add openchat-3.6-8B support 2024-05-27 20:42:08 +08:00
hiyouga
e626e26446 support Aya23 2024-05-27 20:23:24 +08:00
hiyouga
efa4b196ca add phi-3 7b/14b, mistral v0.3 models 2024-05-27 18:20:16 +08:00
hiyouga
5581cb2e4e update readme 2024-05-27 18:14:02 +08:00
hiyouga
cb63b32986 support SimPO #3900 2024-05-26 23:46:33 +08:00
hiyouga
335501e228 fix #3847 2024-05-21 17:53:06 +08:00
hiyouga
2a67457e39 support paligemma 2024-05-21 00:01:22 +08:00
hiyouga
542229abb3 fix paligemma inference 2024-05-20 23:36:43 +08:00
hiyouga
8ee8ac6eba fix envs 2024-05-19 18:27:18 +08:00
hoshi-hiyouga
33a354548e Merge pull request #3785 from enji-zhou/feature/add_kto
add kto
2024-05-18 03:07:18 +08:00
hiyouga
8af9817605 add deepseek v2 lite model 2024-05-17 13:25:36 +08:00
enji.zhou
db1d5a4f51 add kto 2024-05-17 13:09:17 +08:00
hiyouga
d77bed4091 add falcon 11b 2024-05-17 00:08:33 +08:00
hiyouga
308edbc426 rename package 2024-05-16 18:39:08 +08:00