Commit Graph

199 Commits

Author SHA1 Message Date
hiyouga
7999836fb6 support fsdp + qlora
Former-commit-id: 8408225162
2024-03-21 00:36:06 +08:00
hiyouga
096c31bfb6 patch for gemma cpt
Former-commit-id: 70a3052dd8
2024-03-12 21:21:54 +08:00
hiyouga
c28818c39f fix plot issues
Former-commit-id: 60cc17f3a8
2024-03-12 18:41:35 +08:00
hiyouga
14ed926a2d support olmo
Former-commit-id: b3247d6a16
2024-03-12 18:30:38 +08:00
hiyouga
868444e124 allow non-packing pretraining
Former-commit-id: bdb496644c
2024-03-09 22:21:46 +08:00
hiyouga
f373290012 add Yi-9B model
Former-commit-id: 57452a4aa1
2024-03-07 23:11:57 +08:00
hiyouga
2c010c72b8 support galore
Former-commit-id: 28f7862188
2024-03-07 22:41:36 +08:00
hiyouga
34533b2f35 support vllm
Former-commit-id: d07ad5cc1c
2024-03-07 20:26:31 +08:00
hiyouga
31c618f1f7 tiny fix
Former-commit-id: 0048a2021e
2024-03-06 17:25:08 +08:00
hiyouga
e887aface7 fix version checking
Former-commit-id: 3016e65657
2024-03-06 14:51:51 +08:00
hiyouga
0e58cd6422 fix sub-process error in thread
Former-commit-id: 9c10854b46
2024-03-03 15:04:35 +08:00
hiyouga
9ae1514a75 update readme, add starcoder2, cosmopedia
Former-commit-id: 894d183214
2024-03-03 01:01:46 +08:00
hiyouga
57f85add58 update chatglm3 template
Former-commit-id: 38d8b2cef8
2024-02-28 21:11:23 +08:00
hiyouga
5abbca70d3 support DoRA, AWQ, AQLM #2512
Former-commit-id: cfefacaa37
2024-02-28 19:53:28 +08:00
hiyouga
3af5fea981 update readme
Former-commit-id: 3ba1054593
2024-02-26 17:25:47 +08:00
Rayrtfr
41e9dd2cf8 Support Atom Model
Former-commit-id: 6e0fba60b3
2024-02-26 10:44:10 +08:00
hiyouga
1845e03921 support gemma
Former-commit-id: c99e19641a
2024-02-21 23:27:36 +08:00
hiyouga
96265ec154 support llama pro #2338 , add rslora
Former-commit-id: 7924ffc55d
2024-02-15 02:27:36 +08:00
hiyouga
75adbfec79 add option to disable version check
Former-commit-id: 91d09a01ac
2024-02-10 22:31:23 +08:00
hiyouga
23dd337ac2 lint
Former-commit-id: 88a1bc9773
2024-02-07 01:10:04 +08:00
hiyouga
9debd64cef add models
Former-commit-id: 85622ae757
2024-02-06 14:57:23 +08:00
hiyouga
dcfb9b5cfa support qwen1.5
Former-commit-id: ccabb5b04a
2024-02-06 00:10:51 +08:00
hiyouga
b8a827faeb fix #2320
Former-commit-id: 2bc30763e9
2024-01-24 16:19:18 +08:00
hiyouga
9898712a24 add orion models
Former-commit-id: 6fc2d5cc03
2024-01-22 21:26:53 +08:00
hiyouga
b27e91222c format style
Former-commit-id: 638234ceee
2024-01-20 20:15:56 +08:00
hiyouga
2f7684a8ee fix tests
Former-commit-id: f6d6e00337
2024-01-20 19:58:04 +08:00
hiyouga
69e8925249 support longlora for main branch
Former-commit-id: 38af076a75
2024-01-20 19:25:22 +08:00
hiyouga
0a8f46882c update readme
Former-commit-id: 5608a0da8e
2024-01-18 14:30:48 +08:00
hiyouga
4e3bfb799d support function calling
Former-commit-id: d9f1cae351
2024-01-18 09:54:23 +08:00
hiyouga
7e16d27fca tiny fix
Former-commit-id: 5a207bb723
2024-01-15 23:34:23 +08:00
hiyouga
21020e51ca support solar 10.7B #1907
Former-commit-id: bf73224f33
2024-01-14 00:30:30 +08:00
hiyouga
9771acfd75 support deepseek moe
Former-commit-id: ca3933dc52
2024-01-14 00:14:49 +08:00
hiyouga
dad632091d fix phi modules
Former-commit-id: d1a73fe26c
2024-01-13 23:12:47 +08:00
hiyouga
6378864390 fix #2161
Former-commit-id: 898ec3696a
2024-01-11 17:04:13 +08:00
hiyouga
bd6213331f improve web ui
Former-commit-id: 1653c22438
2024-01-10 12:37:45 +08:00
hiyouga
b29c4fb308 modify weight name
Former-commit-id: 919acc2b0b
2024-01-09 20:22:47 +08:00
hiyouga
61960189b2 fix #1789
Former-commit-id: 4571068e1e
2024-01-09 18:31:27 +08:00
hiyouga
dabd40750c fix #2090
Former-commit-id: cc275abe09
2024-01-04 23:05:08 +08:00
hiyouga
9a496950aa fix #2067
Former-commit-id: 368b31f6b7
2024-01-04 22:53:03 +08:00
hiyouga
4735cb96c1 add yuan model
Former-commit-id: c7ea17d616
2023-12-29 13:50:24 +08:00
hiyouga
c1233ab65f add max_memory for gptq #1923
Former-commit-id: c4a3977ad7
2023-12-20 18:15:17 +08:00
hiyouga
51c636db54 add xverse-65B-2 model
Former-commit-id: 2df923540c
2023-12-18 19:24:09 +08:00
hiyouga
1af13cb737 add models
Former-commit-id: 709ac8870a
2023-12-18 19:09:31 +08:00
hiyouga
397f6bb615 add xverse-65b-chat model
Former-commit-id: 7ae6919b9b
2023-12-16 20:21:29 +08:00
hiyouga
4e75ca1222 support dpo-ftx
Former-commit-id: b87c74289d
2023-12-16 19:21:41 +08:00
hiyouga
f0f9d253d8 support autogptq in llama board #246
Former-commit-id: 71389be37c
2023-12-16 16:31:30 +08:00
yhyu13
cc91724507 Use llmtuner logger
Former-commit-id: fc70a92cb6
2023-12-16 07:15:27 +00:00
yhyu13
362e3c913f Improve logging for unknown args
Former-commit-id: 26817143ff
2023-12-16 05:16:29 +00:00
hiyouga
7dbc670902 support quantization in export model
Former-commit-id: 3524aa1e58
2023-12-15 23:44:50 +08:00
hiyouga
f9ab303629 add model urls
Former-commit-id: 3552035d7e
2023-12-13 00:09:17 +08:00