Commit Graph

98 Commits

Author SHA1 Message Date
Zhangchi Feng
900631755b Merge branch 'hiyouga:main' into main 2024-09-27 18:14:39 +08:00
hoshi-hiyouga
8e5d12c2c4 add modelscope models 2024-09-26 11:22:48 +08:00
marko1616
885a0b77ab Chore: Support llama3.2. 2024-09-25 16:08:44 -04:00
hoshi-hiyouga
92ef62f502 add qwen2.5 models 2024-09-19 02:07:54 +08:00
hiyouga
0ded765784 set dev version 2024-09-11 18:56:37 +08:00
Zhangchi Feng
4643089a7d Merge branch 'hiyouga:main' into main 2024-09-10 13:20:24 +08:00
BUAADreamer
31259e7e0c support llava-next(video) 2024-09-10 12:31:53 +08:00
hiyouga
bdde35fd2e update accelerate ver for schedule_free optimizers 2024-09-09 22:51:08 +08:00
hiyouga
90d6df6222 release v0.9.0 (real) 2024-09-09 01:00:25 +08:00
hiyouga
653fe70acb fix constants 2024-09-08 23:52:30 +08:00
hiyouga
54b5c4b819 release v0.9.0 2024-09-08 23:43:35 +08:00
hiyouga
b6681d7198 support vllm 0.6.0 2024-09-08 02:26:20 +08:00
hiyouga
54c6905937 add docstrings, refactor logger 2024-09-08 00:56:56 +08:00
hoshi-hiyouga
1274356263 Merge pull request #5372 from LDLINGLINGLING/main
增加了对minicpm3.0的适配'
2024-09-05 21:35:42 +08:00
liudan
3d3fbaaff9 根据代码规范修改了代码 2024-09-05 20:17:55 +08:00
hiyouga
359ef8bb0e support Yi-Coder models 2024-09-05 03:12:24 +08:00
hiyouga
8cafc7b055 video datasets 2024-09-05 02:04:17 +08:00
liudan
d7ba97be48 增加了对minicpm3.0的适配' 2024-09-04 23:10:05 +08:00
hiyouga
9967ccb3ae fix mixed mm inputs and rlhf-v 2024-09-01 20:52:47 +08:00
hiyouga
3382317e32 refactor mm training 2024-08-30 02:14:31 +08:00
hiyouga
ad72f3e065 fix #5295 2024-08-29 20:30:18 +08:00
hiyouga
f6ae4e75dd tiny fix 2024-08-27 12:49:32 +08:00
hiyouga
c8b4c7fee5 tiny fix 2024-08-20 00:10:52 +08:00
hoshi-hiyouga
d39f4a62d3 Merge pull request #5188 from Zxilly/main
fix: report correct device count for intel xpu
2024-08-19 23:51:39 +08:00
Ricardo
384ab8db84 _is_bf16_available judgment supports npu 2024-08-16 02:58:22 +00:00
Zxilly
dc36fcc3de fix: report correct device count for intel xpu 2024-08-15 08:30:43 +00:00
hiyouga
dc770efb14 add qwen2 math models 2024-08-09 20:20:35 +08:00
hiyouga
b7ca6c8dc1 fix #5048 2024-08-05 23:48:19 +08:00
codingma
dc09d454f2 support gemma-2-2b 2024-08-01 13:45:48 +08:00
hiyouga
1550fe7331 add mistral nemo model 2024-07-24 16:25:53 +08:00
hiyouga
26533c0604 add llama3.1 2024-07-24 16:20:11 +08:00
hiyouga
88c7fc1599 set dev version 2024-07-19 02:01:46 +08:00
hiyouga
bbd5a64423 release v0.8.3 2024-07-19 01:21:18 +08:00
hiyouga
d774b94f12 support batch_eval_metrics, fix #4826 2024-07-17 00:33:00 +08:00
hoshi-hiyouga
f84b007ebb Update packages.py 2024-07-07 15:48:29 +08:00
Lian Junhong
322663bf90 chore: Update vllm_engine.py to support vllm version >= 0.5.1 2024-07-07 15:08:12 +08:00
hiyouga
53b1002fb7 add codegeex4, internlm2.5 2024-07-06 16:16:47 +08:00
hiyouga
6fd6aa4530 fix packing for eager/sdpa attn 2024-07-04 01:52:43 +08:00
hoshi-hiyouga
87d9b2d005 Merge pull request #4224 from chuan298/main
Implement efficient packing without cross-contamination attention
2024-07-04 01:18:54 +08:00
hiyouga
cce7083024 update packing 2024-07-04 01:10:55 +08:00
hiyouga
8a6a7b9c8a update arg name 2024-07-03 23:23:24 +08:00
hiyouga
c47ab6c072 improve rlhf 2024-07-02 22:23:08 +08:00
hzhaoy
57b7c00430 add TeleChat-1B 2024-07-02 17:49:04 +08:00
hoshi-hiyouga
e8e6af2651 Merge branch 'main' into main 2024-07-01 21:01:09 +08:00
hiyouga
d74244d568 fix #4398 #4592 2024-06-30 21:28:51 +08:00
hiyouga
6f63050e1b add Gemma2 models 2024-06-28 01:26:50 +08:00
hiyouga
8baf3b22b0 refactor pissa, improve llamaboard 2024-06-28 01:04:24 +08:00
hiyouga
ad144c2265 support HQQ/EETQ #4113 2024-06-27 00:29:42 +08:00
hiyouga
e507e60638 update readme 2024-06-24 18:22:12 +08:00
ancv
770f75dc83 move configure_packing to llamafactory.model.patcher and fix constants 2024-06-21 00:45:06 +07:00