hiyouga
|
fb385b8c26
|
update examples
Former-commit-id: cce52351b5
|
2024-04-15 22:14:34 +08:00 |
|
hiyouga
|
431e9804ee
|
release v0.6.2
Former-commit-id: 9d4c949461
|
2024-04-11 20:08:51 +08:00 |
|
hiyouga
|
d5ca8763ea
|
fix #3225
Former-commit-id: a99f5ed0b6
|
2024-04-10 23:57:59 +08:00 |
|
hiyouga
|
3069f37021
|
tiny fix
Former-commit-id: 9a99fbc86d
|
2024-04-08 21:28:39 +08:00 |
|
hoshi-hiyouga
|
8682d033eb
|
Merge pull request #3161 from hiyouga/feature/add-mediatek-model
support Breeze-7B
Former-commit-id: 4c6c4a0d88
|
2024-04-08 20:56:51 +08:00 |
|
codingma
|
b5f0ac4c3f
|
add empty line
Former-commit-id: 7b76b4ca08
|
2024-04-07 18:28:08 +08:00 |
|
codingma
|
ed14f8bae7
|
rename template to breeze
Former-commit-id: 5a780e9eec
|
2024-04-07 11:39:54 +08:00 |
|
codingma
|
80aa1f70b6
|
support https://github.com/hiyouga/LLaMA-Factory/issues/3152
Former-commit-id: 2565a32bd9
|
2024-04-07 11:34:01 +08:00 |
|
sliderSun
|
7037dcbf38
|
fix spell error
Former-commit-id: 1d117b7bb6
|
2024-04-07 10:59:15 +08:00 |
|
sliderSun
|
1fbf190eda
|
support Qwen1.5-32B
Former-commit-id: 21650d467c
|
2024-04-07 10:56:03 +08:00 |
|
sliderSun
|
09107affda
|
support Qwen1.5-32B
Former-commit-id: 77044d9ef4
|
2024-04-07 10:26:13 +08:00 |
|
hiyouga
|
f334b89616
|
back to gradio 4.21 and fix chat
Former-commit-id: 4b920f24d3
|
2024-04-04 02:07:20 +08:00 |
|
hiyouga
|
54a4a8217a
|
fix bug in latest gradio
Former-commit-id: 5ddcecda50
|
2024-04-04 00:55:31 +08:00 |
|
hiyouga
|
bf5ffeeae0
|
simplify readme
Former-commit-id: 92dab8a90b
|
2024-04-02 20:07:43 +08:00 |
|
hiyouga
|
f4be51f356
|
add moe aux loss control #3085
Former-commit-id: b267aeb53f
|
2024-04-02 14:26:31 +08:00 |
|
hiyouga
|
8d987b7af7
|
add qwen1.5 moe
Former-commit-id: 54b7d34908
|
2024-04-01 21:49:40 +08:00 |
|
hiyouga
|
34f1de0574
|
fix #3077
Former-commit-id: aee634cd20
|
2024-04-01 21:35:18 +08:00 |
|
hiyouga
|
2f878bde11
|
support ORPO
Former-commit-id: 17bf8a2c3a
|
2024-03-31 18:29:50 +08:00 |
|
hiyouga
|
3bf6dde3a5
|
support save args in webui #2807 #3046
some ideas are borrowed from @marko1616
Former-commit-id: 7a086ed333
|
2024-03-30 23:09:12 +08:00 |
|
hiyouga
|
e4f3d583df
|
fix #2982
Former-commit-id: 8d603f8820
|
2024-03-28 20:22:31 +08:00 |
|
hiyouga
|
eac2a5b1d3
|
fix #3010
Former-commit-id: b19c14870d
|
2024-03-28 18:31:17 +08:00 |
|
hiyouga
|
84c3d509fa
|
fix #2936
Former-commit-id: 140ad4ad56
|
2024-03-24 00:43:21 +08:00 |
|
hiyouga
|
58aa576ae5
|
fix #2941
Former-commit-id: a1c8c98c5f
|
2024-03-24 00:28:44 +08:00 |
|
hiyouga
|
7999836fb6
|
support fsdp + qlora
Former-commit-id: 8408225162
|
2024-03-21 00:36:06 +08:00 |
|
hiyouga
|
096c31bfb6
|
patch for gemma cpt
Former-commit-id: 70a3052dd8
|
2024-03-12 21:21:54 +08:00 |
|
hiyouga
|
c28818c39f
|
fix plot issues
Former-commit-id: 60cc17f3a8
|
2024-03-12 18:41:35 +08:00 |
|
hiyouga
|
14ed926a2d
|
support olmo
Former-commit-id: b3247d6a16
|
2024-03-12 18:30:38 +08:00 |
|
hiyouga
|
868444e124
|
allow non-packing pretraining
Former-commit-id: bdb496644c
|
2024-03-09 22:21:46 +08:00 |
|
hiyouga
|
f373290012
|
add Yi-9B model
Former-commit-id: 57452a4aa1
|
2024-03-07 23:11:57 +08:00 |
|
hiyouga
|
2c010c72b8
|
support galore
Former-commit-id: 28f7862188
|
2024-03-07 22:41:36 +08:00 |
|
hiyouga
|
34533b2f35
|
support vllm
Former-commit-id: d07ad5cc1c
|
2024-03-07 20:26:31 +08:00 |
|
hiyouga
|
31c618f1f7
|
tiny fix
Former-commit-id: 0048a2021e
|
2024-03-06 17:25:08 +08:00 |
|
hiyouga
|
e887aface7
|
fix version checking
Former-commit-id: 3016e65657
|
2024-03-06 14:51:51 +08:00 |
|
hiyouga
|
0e58cd6422
|
fix sub-process error in thread
Former-commit-id: 9c10854b46
|
2024-03-03 15:04:35 +08:00 |
|
hiyouga
|
9ae1514a75
|
update readme, add starcoder2, cosmopedia
Former-commit-id: 894d183214
|
2024-03-03 01:01:46 +08:00 |
|
hiyouga
|
57f85add58
|
update chatglm3 template
Former-commit-id: 38d8b2cef8
|
2024-02-28 21:11:23 +08:00 |
|
hiyouga
|
5abbca70d3
|
support DoRA, AWQ, AQLM #2512
Former-commit-id: cfefacaa37
|
2024-02-28 19:53:28 +08:00 |
|
hiyouga
|
3af5fea981
|
update readme
Former-commit-id: 3ba1054593
|
2024-02-26 17:25:47 +08:00 |
|
Rayrtfr
|
41e9dd2cf8
|
Support Atom Model
Former-commit-id: 6e0fba60b3
|
2024-02-26 10:44:10 +08:00 |
|
hiyouga
|
1845e03921
|
support gemma
Former-commit-id: c99e19641a
|
2024-02-21 23:27:36 +08:00 |
|
hiyouga
|
96265ec154
|
support llama pro #2338 , add rslora
Former-commit-id: 7924ffc55d
|
2024-02-15 02:27:36 +08:00 |
|
hiyouga
|
75adbfec79
|
add option to disable version check
Former-commit-id: 91d09a01ac
|
2024-02-10 22:31:23 +08:00 |
|
hiyouga
|
23dd337ac2
|
lint
Former-commit-id: 88a1bc9773
|
2024-02-07 01:10:04 +08:00 |
|
hiyouga
|
9debd64cef
|
add models
Former-commit-id: 85622ae757
|
2024-02-06 14:57:23 +08:00 |
|
hiyouga
|
dcfb9b5cfa
|
support qwen1.5
Former-commit-id: ccabb5b04a
|
2024-02-06 00:10:51 +08:00 |
|
hiyouga
|
b8a827faeb
|
fix #2320
Former-commit-id: 2bc30763e9
|
2024-01-24 16:19:18 +08:00 |
|
hiyouga
|
9898712a24
|
add orion models
Former-commit-id: 6fc2d5cc03
|
2024-01-22 21:26:53 +08:00 |
|
hiyouga
|
b27e91222c
|
format style
Former-commit-id: 638234ceee
|
2024-01-20 20:15:56 +08:00 |
|
hiyouga
|
2f7684a8ee
|
fix tests
Former-commit-id: f6d6e00337
|
2024-01-20 19:58:04 +08:00 |
|
hiyouga
|
69e8925249
|
support longlora for main branch
Former-commit-id: 38af076a75
|
2024-01-20 19:25:22 +08:00 |
|