hiyouga
|
7999836fb6
|
support fsdp + qlora
Former-commit-id: 8408225162
|
2024-03-21 00:36:06 +08:00 |
|
hiyouga
|
096c31bfb6
|
patch for gemma cpt
Former-commit-id: 70a3052dd8
|
2024-03-12 21:21:54 +08:00 |
|
hiyouga
|
c28818c39f
|
fix plot issues
Former-commit-id: 60cc17f3a8
|
2024-03-12 18:41:35 +08:00 |
|
hiyouga
|
14ed926a2d
|
support olmo
Former-commit-id: b3247d6a16
|
2024-03-12 18:30:38 +08:00 |
|
hiyouga
|
868444e124
|
allow non-packing pretraining
Former-commit-id: bdb496644c
|
2024-03-09 22:21:46 +08:00 |
|
hiyouga
|
f373290012
|
add Yi-9B model
Former-commit-id: 57452a4aa1
|
2024-03-07 23:11:57 +08:00 |
|
hiyouga
|
2c010c72b8
|
support galore
Former-commit-id: 28f7862188
|
2024-03-07 22:41:36 +08:00 |
|
hiyouga
|
34533b2f35
|
support vllm
Former-commit-id: d07ad5cc1c
|
2024-03-07 20:26:31 +08:00 |
|
hiyouga
|
31c618f1f7
|
tiny fix
Former-commit-id: 0048a2021e
|
2024-03-06 17:25:08 +08:00 |
|
hiyouga
|
e887aface7
|
fix version checking
Former-commit-id: 3016e65657
|
2024-03-06 14:51:51 +08:00 |
|
hiyouga
|
0e58cd6422
|
fix sub-process error in thread
Former-commit-id: 9c10854b46
|
2024-03-03 15:04:35 +08:00 |
|
hiyouga
|
9ae1514a75
|
update readme, add starcoder2, cosmopedia
Former-commit-id: 894d183214
|
2024-03-03 01:01:46 +08:00 |
|
hiyouga
|
57f85add58
|
update chatglm3 template
Former-commit-id: 38d8b2cef8
|
2024-02-28 21:11:23 +08:00 |
|
hiyouga
|
5abbca70d3
|
support DoRA, AWQ, AQLM #2512
Former-commit-id: cfefacaa37
|
2024-02-28 19:53:28 +08:00 |
|
hiyouga
|
3af5fea981
|
update readme
Former-commit-id: 3ba1054593
|
2024-02-26 17:25:47 +08:00 |
|
Rayrtfr
|
41e9dd2cf8
|
Support Atom Model
Former-commit-id: 6e0fba60b3
|
2024-02-26 10:44:10 +08:00 |
|
hiyouga
|
1845e03921
|
support gemma
Former-commit-id: c99e19641a
|
2024-02-21 23:27:36 +08:00 |
|
hiyouga
|
96265ec154
|
support llama pro #2338 , add rslora
Former-commit-id: 7924ffc55d
|
2024-02-15 02:27:36 +08:00 |
|
hiyouga
|
75adbfec79
|
add option to disable version check
Former-commit-id: 91d09a01ac
|
2024-02-10 22:31:23 +08:00 |
|
hiyouga
|
23dd337ac2
|
lint
Former-commit-id: 88a1bc9773
|
2024-02-07 01:10:04 +08:00 |
|
hiyouga
|
9debd64cef
|
add models
Former-commit-id: 85622ae757
|
2024-02-06 14:57:23 +08:00 |
|
hiyouga
|
dcfb9b5cfa
|
support qwen1.5
Former-commit-id: ccabb5b04a
|
2024-02-06 00:10:51 +08:00 |
|
hiyouga
|
b8a827faeb
|
fix #2320
Former-commit-id: 2bc30763e9
|
2024-01-24 16:19:18 +08:00 |
|
hiyouga
|
9898712a24
|
add orion models
Former-commit-id: 6fc2d5cc03
|
2024-01-22 21:26:53 +08:00 |
|
hiyouga
|
b27e91222c
|
format style
Former-commit-id: 638234ceee
|
2024-01-20 20:15:56 +08:00 |
|
hiyouga
|
2f7684a8ee
|
fix tests
Former-commit-id: f6d6e00337
|
2024-01-20 19:58:04 +08:00 |
|
hiyouga
|
69e8925249
|
support longlora for main branch
Former-commit-id: 38af076a75
|
2024-01-20 19:25:22 +08:00 |
|
hiyouga
|
0a8f46882c
|
update readme
Former-commit-id: 5608a0da8e
|
2024-01-18 14:30:48 +08:00 |
|
hiyouga
|
4e3bfb799d
|
support function calling
Former-commit-id: d9f1cae351
|
2024-01-18 09:54:23 +08:00 |
|
hiyouga
|
7e16d27fca
|
tiny fix
Former-commit-id: 5a207bb723
|
2024-01-15 23:34:23 +08:00 |
|
hiyouga
|
21020e51ca
|
support solar 10.7B #1907
Former-commit-id: bf73224f33
|
2024-01-14 00:30:30 +08:00 |
|
hiyouga
|
9771acfd75
|
support deepseek moe
Former-commit-id: ca3933dc52
|
2024-01-14 00:14:49 +08:00 |
|
hiyouga
|
dad632091d
|
fix phi modules
Former-commit-id: d1a73fe26c
|
2024-01-13 23:12:47 +08:00 |
|
hiyouga
|
6378864390
|
fix #2161
Former-commit-id: 898ec3696a
|
2024-01-11 17:04:13 +08:00 |
|
hiyouga
|
bd6213331f
|
improve web ui
Former-commit-id: 1653c22438
|
2024-01-10 12:37:45 +08:00 |
|
hiyouga
|
b29c4fb308
|
modify weight name
Former-commit-id: 919acc2b0b
|
2024-01-09 20:22:47 +08:00 |
|
hiyouga
|
61960189b2
|
fix #1789
Former-commit-id: 4571068e1e
|
2024-01-09 18:31:27 +08:00 |
|
hiyouga
|
dabd40750c
|
fix #2090
Former-commit-id: cc275abe09
|
2024-01-04 23:05:08 +08:00 |
|
hiyouga
|
9a496950aa
|
fix #2067
Former-commit-id: 368b31f6b7
|
2024-01-04 22:53:03 +08:00 |
|
hiyouga
|
4735cb96c1
|
add yuan model
Former-commit-id: c7ea17d616
|
2023-12-29 13:50:24 +08:00 |
|
hiyouga
|
c1233ab65f
|
add max_memory for gptq #1923
Former-commit-id: c4a3977ad7
|
2023-12-20 18:15:17 +08:00 |
|
hiyouga
|
51c636db54
|
add xverse-65B-2 model
Former-commit-id: 2df923540c
|
2023-12-18 19:24:09 +08:00 |
|
hiyouga
|
1af13cb737
|
add models
Former-commit-id: 709ac8870a
|
2023-12-18 19:09:31 +08:00 |
|
hiyouga
|
397f6bb615
|
add xverse-65b-chat model
Former-commit-id: 7ae6919b9b
|
2023-12-16 20:21:29 +08:00 |
|
hiyouga
|
4e75ca1222
|
support dpo-ftx
Former-commit-id: b87c74289d
|
2023-12-16 19:21:41 +08:00 |
|
hiyouga
|
f0f9d253d8
|
support autogptq in llama board #246
Former-commit-id: 71389be37c
|
2023-12-16 16:31:30 +08:00 |
|
yhyu13
|
cc91724507
|
Use llmtuner logger
Former-commit-id: fc70a92cb6
|
2023-12-16 07:15:27 +00:00 |
|
yhyu13
|
362e3c913f
|
Improve logging for unknown args
Former-commit-id: 26817143ff
|
2023-12-16 05:16:29 +00:00 |
|
hiyouga
|
7dbc670902
|
support quantization in export model
Former-commit-id: 3524aa1e58
|
2023-12-15 23:44:50 +08:00 |
|
hiyouga
|
f9ab303629
|
add model urls
Former-commit-id: 3552035d7e
|
2023-12-13 00:09:17 +08:00 |
|