hoshi-hiyouga
|
4d7bb69234
|
Update constants.py
|
2024-09-30 16:47:52 +08:00 |
|
shing100
|
3a9569647f
|
add Exaone3.0 template
|
2024-09-30 09:18:25 +09:00 |
|
hoshi-hiyouga
|
b257b91cd0
|
Update constants.py
|
2024-09-29 23:45:34 +08:00 |
|
BUAADreamer
|
bec1cb8d55
|
fix constants
|
2024-09-29 22:40:43 +08:00 |
|
BUAADreamer
|
485fc04716
|
fix constants
|
2024-09-29 22:00:01 +08:00 |
|
BUAADreamer
|
65a8923f5a
|
add more llava-next series template
|
2024-09-29 21:29:29 +08:00 |
|
BUAADreamer
|
6642cd501d
|
add llava-next/llava-next-video/video-llava
|
2024-09-28 00:57:03 +08:00 |
|
Zhangchi Feng
|
900631755b
|
Merge branch 'hiyouga:main' into main
|
2024-09-27 18:14:39 +08:00 |
|
hoshi-hiyouga
|
8e5d12c2c4
|
add modelscope models
|
2024-09-26 11:22:48 +08:00 |
|
marko1616
|
885a0b77ab
|
Chore: Support llama3.2.
|
2024-09-25 16:08:44 -04:00 |
|
hoshi-hiyouga
|
92ef62f502
|
add qwen2.5 models
|
2024-09-19 02:07:54 +08:00 |
|
BUAADreamer
|
31259e7e0c
|
support llava-next(video)
|
2024-09-10 12:31:53 +08:00 |
|
hiyouga
|
90d6df6222
|
release v0.9.0 (real)
|
2024-09-09 01:00:25 +08:00 |
|
hiyouga
|
653fe70acb
|
fix constants
|
2024-09-08 23:52:30 +08:00 |
|
hiyouga
|
54b5c4b819
|
release v0.9.0
|
2024-09-08 23:43:35 +08:00 |
|
hoshi-hiyouga
|
1274356263
|
Merge pull request #5372 from LDLINGLINGLING/main
增加了对minicpm3.0的适配'
|
2024-09-05 21:35:42 +08:00 |
|
liudan
|
3d3fbaaff9
|
根据代码规范修改了代码
|
2024-09-05 20:17:55 +08:00 |
|
hiyouga
|
359ef8bb0e
|
support Yi-Coder models
|
2024-09-05 03:12:24 +08:00 |
|
hiyouga
|
8cafc7b055
|
video datasets
|
2024-09-05 02:04:17 +08:00 |
|
liudan
|
d7ba97be48
|
增加了对minicpm3.0的适配'
|
2024-09-04 23:10:05 +08:00 |
|
hiyouga
|
3382317e32
|
refactor mm training
|
2024-08-30 02:14:31 +08:00 |
|
hiyouga
|
dc770efb14
|
add qwen2 math models
|
2024-08-09 20:20:35 +08:00 |
|
hiyouga
|
b7ca6c8dc1
|
fix #5048
|
2024-08-05 23:48:19 +08:00 |
|
codingma
|
dc09d454f2
|
support gemma-2-2b
|
2024-08-01 13:45:48 +08:00 |
|
hiyouga
|
1550fe7331
|
add mistral nemo model
|
2024-07-24 16:25:53 +08:00 |
|
hiyouga
|
26533c0604
|
add llama3.1
|
2024-07-24 16:20:11 +08:00 |
|
hiyouga
|
53b1002fb7
|
add codegeex4, internlm2.5
|
2024-07-06 16:16:47 +08:00 |
|
hiyouga
|
6fd6aa4530
|
fix packing for eager/sdpa attn
|
2024-07-04 01:52:43 +08:00 |
|
hoshi-hiyouga
|
87d9b2d005
|
Merge pull request #4224 from chuan298/main
Implement efficient packing without cross-contamination attention
|
2024-07-04 01:18:54 +08:00 |
|
hiyouga
|
cce7083024
|
update packing
|
2024-07-04 01:10:55 +08:00 |
|
hiyouga
|
8a6a7b9c8a
|
update arg name
|
2024-07-03 23:23:24 +08:00 |
|
hiyouga
|
c47ab6c072
|
improve rlhf
|
2024-07-02 22:23:08 +08:00 |
|
hzhaoy
|
57b7c00430
|
add TeleChat-1B
|
2024-07-02 17:49:04 +08:00 |
|
hoshi-hiyouga
|
e8e6af2651
|
Merge branch 'main' into main
|
2024-07-01 21:01:09 +08:00 |
|
hiyouga
|
6f63050e1b
|
add Gemma2 models
|
2024-06-28 01:26:50 +08:00 |
|
hiyouga
|
e507e60638
|
update readme
|
2024-06-24 18:22:12 +08:00 |
|
ancv
|
770f75dc83
|
move configure_packing to llamafactory.model.patcher and fix constants
|
2024-06-21 00:45:06 +07:00 |
|
hiyouga
|
a233fbc258
|
add deepseek coder v2 #4346
|
2024-06-18 22:53:54 +08:00 |
|
ancv
|
238f5c3d99
|
update packing with sdpa and eager attention mode
|
2024-06-16 02:25:47 +07:00 |
|
hiyouga
|
572d8bbfdd
|
add minicpm #4227
|
2024-06-15 17:58:52 +08:00 |
|
hiyouga
|
d87108daa6
|
add license
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
06e5d136a4
|
add resume args in webui
|
2024-06-08 00:22:16 +08:00 |
|
hiyouga
|
8e95648850
|
add qwen2 models
|
2024-06-07 00:22:57 +08:00 |
|
hiyouga
|
cae4737907
|
lora modules: all by default
|
2024-06-06 03:53:28 +08:00 |
|
hiyouga
|
c23cc63d3d
|
add codestral 22B
|
2024-06-06 03:42:50 +08:00 |
|
hiyouga
|
f48f5e646e
|
support glm-4
|
2024-06-05 15:16:38 +08:00 |
|
hiyouga
|
8070871732
|
better llamaboard
* easily resume from checkpoint
* support full and freeze checkpoints
* faster ui
|
2024-05-29 23:55:38 +08:00 |
|
hiyouga
|
89ca832740
|
update readme
|
2024-05-29 18:39:11 +08:00 |
|
hzhaoy
|
0dd632fe9e
|
add TeleChat-12B/TeleChat-12B-v2 models
|
2024-05-29 15:00:37 +08:00 |
|
hiyouga
|
c1fdf81df6
|
tiny fix
|
2024-05-27 20:54:26 +08:00 |
|