codingma
|
2565a32bd9
|
support https://github.com/hiyouga/LLaMA-Factory/issues/3152
|
2024-04-07 11:34:01 +08:00 |
|
hiyouga
|
4b920f24d3
|
back to gradio 4.21 and fix chat
|
2024-04-04 02:07:20 +08:00 |
|
hiyouga
|
5ddcecda50
|
fix bug in latest gradio
|
2024-04-04 00:55:31 +08:00 |
|
hiyouga
|
92dab8a90b
|
simplify readme
|
2024-04-02 20:07:43 +08:00 |
|
hiyouga
|
b267aeb53f
|
add moe aux loss control #3085
|
2024-04-02 14:26:31 +08:00 |
|
hiyouga
|
54b7d34908
|
add qwen1.5 moe
|
2024-04-01 21:49:40 +08:00 |
|
hiyouga
|
aee634cd20
|
fix #3077
|
2024-04-01 21:35:18 +08:00 |
|
hiyouga
|
17bf8a2c3a
|
support ORPO
|
2024-03-31 18:29:50 +08:00 |
|
hiyouga
|
7a086ed333
|
support save args in webui #2807 #3046
some ideas are borrowed from @marko1616
|
2024-03-30 23:09:12 +08:00 |
|
hiyouga
|
8d603f8820
|
fix #2982
|
2024-03-28 20:22:31 +08:00 |
|
hiyouga
|
b19c14870d
|
fix #3010
|
2024-03-28 18:31:17 +08:00 |
|
hiyouga
|
140ad4ad56
|
fix #2936
|
2024-03-24 00:43:21 +08:00 |
|
hiyouga
|
a1c8c98c5f
|
fix #2941
|
2024-03-24 00:28:44 +08:00 |
|
hiyouga
|
8408225162
|
support fsdp + qlora
|
2024-03-21 00:36:06 +08:00 |
|
hiyouga
|
70a3052dd8
|
patch for gemma cpt
|
2024-03-12 21:21:54 +08:00 |
|
hiyouga
|
60cc17f3a8
|
fix plot issues
|
2024-03-12 18:41:35 +08:00 |
|
hiyouga
|
b3247d6a16
|
support olmo
|
2024-03-12 18:30:38 +08:00 |
|
hiyouga
|
bdb496644c
|
allow non-packing pretraining
|
2024-03-09 22:21:46 +08:00 |
|
hiyouga
|
57452a4aa1
|
add Yi-9B model
|
2024-03-07 23:11:57 +08:00 |
|
hiyouga
|
28f7862188
|
support galore
|
2024-03-07 22:41:36 +08:00 |
|
hiyouga
|
d07ad5cc1c
|
support vllm
|
2024-03-07 20:26:31 +08:00 |
|
hiyouga
|
0048a2021e
|
tiny fix
|
2024-03-06 17:25:08 +08:00 |
|
hiyouga
|
3016e65657
|
fix version checking
|
2024-03-06 14:51:51 +08:00 |
|
hiyouga
|
9c10854b46
|
fix sub-process error in thread
|
2024-03-03 15:04:35 +08:00 |
|
hiyouga
|
894d183214
|
update readme, add starcoder2, cosmopedia
|
2024-03-03 01:01:46 +08:00 |
|
hiyouga
|
38d8b2cef8
|
update chatglm3 template
|
2024-02-28 21:11:23 +08:00 |
|
hiyouga
|
cfefacaa37
|
support DoRA, AWQ, AQLM #2512
|
2024-02-28 19:53:28 +08:00 |
|
hiyouga
|
3ba1054593
|
update readme
|
2024-02-26 17:25:47 +08:00 |
|
Rayrtfr
|
6e0fba60b3
|
Support Atom Model
|
2024-02-26 10:44:10 +08:00 |
|
hiyouga
|
c99e19641a
|
support gemma
|
2024-02-21 23:27:36 +08:00 |
|
hiyouga
|
7924ffc55d
|
support llama pro #2338 , add rslora
|
2024-02-15 02:27:36 +08:00 |
|
hiyouga
|
91d09a01ac
|
add option to disable version check
|
2024-02-10 22:31:23 +08:00 |
|
hiyouga
|
88a1bc9773
|
lint
|
2024-02-07 01:10:04 +08:00 |
|
hiyouga
|
85622ae757
|
add models
|
2024-02-06 14:57:23 +08:00 |
|
hiyouga
|
ccabb5b04a
|
support qwen1.5
|
2024-02-06 00:10:51 +08:00 |
|
hiyouga
|
2bc30763e9
|
fix #2320
|
2024-01-24 16:19:18 +08:00 |
|
hiyouga
|
6fc2d5cc03
|
add orion models
|
2024-01-22 21:26:53 +08:00 |
|
hiyouga
|
638234ceee
|
format style
|
2024-01-20 20:15:56 +08:00 |
|
hiyouga
|
f6d6e00337
|
fix tests
|
2024-01-20 19:58:04 +08:00 |
|
hiyouga
|
38af076a75
|
support longlora for main branch
|
2024-01-20 19:25:22 +08:00 |
|
hiyouga
|
5608a0da8e
|
update readme
|
2024-01-18 14:30:48 +08:00 |
|
hiyouga
|
d9f1cae351
|
support function calling
|
2024-01-18 09:54:23 +08:00 |
|
hiyouga
|
5a207bb723
|
tiny fix
|
2024-01-15 23:34:23 +08:00 |
|
hiyouga
|
bf73224f33
|
support solar 10.7B #1907
|
2024-01-14 00:30:30 +08:00 |
|
hiyouga
|
ca3933dc52
|
support deepseek moe
|
2024-01-14 00:14:49 +08:00 |
|
hiyouga
|
d1a73fe26c
|
fix phi modules
|
2024-01-13 23:12:47 +08:00 |
|
hiyouga
|
898ec3696a
|
fix #2161
|
2024-01-11 17:04:13 +08:00 |
|
hiyouga
|
1653c22438
|
improve web ui
|
2024-01-10 12:37:45 +08:00 |
|
hiyouga
|
919acc2b0b
|
modify weight name
|
2024-01-09 20:22:47 +08:00 |
|
hiyouga
|
4571068e1e
|
fix #1789
|
2024-01-09 18:31:27 +08:00 |
|