Commit Graph

1341 Commits

Author SHA1 Message Date
zhouwei
28ae947161 The training efficiency of the Ascend 910A has been significantly enhanced, leveraging the full computational power of the NPU (Neural Processing Unit) and the capabilities of torch_npu, a PyTorch library optimized for NPUs. This improvement has resulted in a remarkable tenfold increase in efficiency. 2024-05-06 13:29:59 +08:00
hoshi-hiyouga
a34f526f10 Merge pull request #3578 from pha123661/main
Fix badam example argument
2024-05-05 23:41:58 +08:00
Oscar
eeb415f6fa Fix badam example outdated argument 2024-05-05 23:35:19 +08:00
codingma
845d5acd03 update wechat 2024-05-05 15:31:47 +08:00
hiyouga
bd095eeb73 add version and help to cli 2024-05-05 02:44:35 +08:00
hiyouga
177604fb6b fix eval scripts 2024-05-05 00:53:07 +08:00
hiyouga
af596988b1 update webui 2024-05-05 00:17:54 +08:00
hiyouga
c1a53a0deb update scripts 2024-05-04 23:05:17 +08:00
hiyouga
25aeaae51b add avg ppl 2024-05-04 22:35:31 +08:00
hiyouga
76a077bdce update ppl script 2024-05-04 22:13:14 +08:00
hiyouga
3a666832c1 add cal_ppl script 2024-05-04 22:02:25 +08:00
hiyouga
57a39783d1 update readme 2024-05-04 17:01:21 +08:00
hiyouga
e984ba3167 remove empty stream response 2024-05-04 16:13:52 +08:00
hiyouga
941924fdbd fix async stream api response 2024-05-04 16:11:18 +08:00
hiyouga
ed8f8be752 update api and support abort eval in webui 2024-05-04 15:59:15 +08:00
hiyouga
d4283bb6bf update readme 2024-05-04 00:43:53 +08:00
hiyouga
9d2ce57345 update readme and webui launch 2024-05-04 00:43:02 +08:00
hiyouga
1409654cef update readme 2024-05-04 00:31:02 +08:00
hiyouga
24cc93ab15 fix eval in webui 2024-05-04 00:19:19 +08:00
hiyouga
510e64ee70 fix webui resume 2024-05-03 23:15:19 +08:00
hiyouga
3010154adb fix slow op in dpo/orpo trainer 2024-05-03 23:06:52 +08:00
hiyouga
9585838ebe fix callback log multigpu #3559 2024-05-03 21:24:27 +08:00
hiyouga
5e6f808e3c enable tqdm in webui 2024-05-03 04:42:50 +08:00
hiyouga
17d2e5147e fix gen_args 2024-05-03 04:24:50 +08:00
hiyouga
530f6b49bb fix colab gradio 2024-05-03 03:54:46 +08:00
hiyouga
245fe47ece update webui and add CLIs 2024-05-03 02:58:23 +08:00
hiyouga
39e964a97a Update prepare.sh 2024-05-02 17:16:02 +08:00
hiyouga
9433c8c215 fix badam configs 2024-05-02 02:47:04 +08:00
hoshi-hiyouga
f1c0eedeb3 Merge pull request #3487 from codemayq/main
support BAdam in WebUI
2024-05-02 02:38:01 +08:00
hoshi-hiyouga
dcd53cb89a Update train.py 2024-05-02 02:21:27 +08:00
hoshi-hiyouga
282b5d5b1f Merge pull request #3490 from khazic/main
Added the second sharegpt format
2024-05-02 02:15:23 +08:00
hoshi-hiyouga
d4d9180c40 Update README_zh.md 2024-05-02 02:14:55 +08:00
hoshi-hiyouga
b072ec9d1b Update README.md 2024-05-02 02:13:46 +08:00
codingma
b4a212f934 Merge branch 'hiyouga:main' into main 2024-04-30 10:02:41 +08:00
codingma
d27e6a46b4 update wechat 2024-04-30 09:40:04 +08:00
Lao
ce17eccf45 Update README_zh.md 2024-04-28 23:31:37 +08:00
khazic
288911fc7b Upgrade the second sharegpt format 2024-04-28 14:30:05 +08:00
khazic
d1ba32e4bb added the second sharegpt format 2024-04-28 14:27:45 +08:00
codingma
26f7170393 support BAdam in WebUI 2024-04-28 11:31:34 +08:00
codingma
e898fabbe3 Merge pull request #3484 from codemayq/main
update wechat
2024-04-28 08:40:08 +08:00
codingma
850f9b554f update wechat 2024-04-28 08:37:19 +08:00
hiyouga
32347901d4 fix setup 2024-04-28 03:49:13 +08:00
hiyouga
b3e33c703e fix llava rlhf 2024-04-28 03:01:49 +08:00
hiyouga
4dbbce21d5 add models to 0.7.0 2024-04-28 01:50:30 +08:00
hiyouga
5ee04d418c update readme 2024-04-26 23:39:19 +08:00
hoshi-hiyouga
8f91420223 Merge pull request #3471 from BUAADreamer/main
add llava_150k en/zh mllm sft data
2024-04-26 23:36:41 +08:00
hoshi-hiyouga
456ad61ac5 Update dataset_info.json 2024-04-26 23:36:13 +08:00
hoshi-hiyouga
c29b257007 Update dataset_info.json 2024-04-26 23:34:34 +08:00
BUAADreamer
a177872010 add llava_150k en/zh mllm sft data 2024-04-26 23:18:58 +08:00
hiyouga
168f56683a release v0.7.0 2024-04-26 23:18:00 +08:00