hiyouga
|
9aa1a2fc17
|
fix #2147
|
2024-01-12 03:30:56 +08:00 |
|
hiyouga
|
ed216bbc46
|
fix qwen template
|
2024-01-05 16:14:56 +08:00 |
|
hiyouga
|
ce2156eaa8
|
fix #2014
|
2023-12-29 15:17:22 +08:00 |
|
hiyouga
|
e165354fac
|
fix export format
|
2023-12-28 18:40:46 +08:00 |
|
hiyouga
|
0bbf7118df
|
fix #1909
|
2023-12-23 14:42:20 +08:00 |
|
hiyouga
|
dec360d5ae
|
fix stop words
|
2023-12-20 19:06:43 +08:00 |
|
hiyouga
|
5af8841c4f
|
fix yi template #1895
|
2023-12-20 18:58:16 +08:00 |
|
hiyouga
|
c4a3977ad7
|
add max_memory for gptq #1923
|
2023-12-20 18:15:17 +08:00 |
|
hiyouga
|
ec1fe1daa9
|
optimize data loading logic
|
2023-12-20 16:15:41 +08:00 |
|
hiyouga
|
c6abbbfe90
|
fix #1909
|
2023-12-20 16:11:07 +08:00 |
|
hiyouga
|
a67a440644
|
add codegeex template
|
2023-12-18 19:52:35 +08:00 |
|
hiyouga
|
3524aa1e58
|
support quantization in export model
|
2023-12-15 23:44:50 +08:00 |
|
hiyouga
|
0716f5e470
|
refactor adapter hparam
|
2023-12-15 20:53:11 +08:00 |
|
hiyouga
|
2c8e88f9c1
|
fix sharegpt loading
|
2023-12-13 00:56:16 +08:00 |
|
hiyouga
|
0a9c6e0146
|
support system column #1765
|
2023-12-12 19:45:59 +08:00 |
|
hiyouga
|
d5b2c57a35
|
fix modelscope data hub
|
2023-12-12 18:33:06 +08:00 |
|
hoshi-hiyouga
|
6382efec52
|
Merge branch 'main' into feat/support_ms
|
2023-12-12 17:55:32 +08:00 |
|
xingjun.wang
|
adc98c86da
|
add use_streaming
|
2023-12-12 14:23:05 +08:00 |
|
xingjun.wang
|
1909f0d117
|
fix cache dir
|
2023-12-12 14:21:33 +08:00 |
|
xingjun.wang
|
168321a4da
|
add print info for test
|
2023-12-12 14:14:40 +08:00 |
|
xingjun.wang
|
edc82b923a
|
update cache dir
|
2023-12-12 13:08:18 +08:00 |
|
xingjun.wang
|
09533e95ed
|
update args for MsDataset.load
|
2023-12-12 13:02:54 +08:00 |
|
xingjun.wang
|
cfba1009d0
|
update
|
2023-12-12 12:03:23 +08:00 |
|
xingjun.wang
|
5b979147f0
|
for test
|
2023-12-12 11:52:59 +08:00 |
|
xingjun.wang
|
8a908a8c64
|
for test
|
2023-12-12 11:47:59 +08:00 |
|
hiyouga
|
96380f5e18
|
support mixtral
|
2023-12-12 11:39:04 +08:00 |
|
hiyouga
|
28d5de7e78
|
fix #1784
|
2023-12-09 20:53:18 +08:00 |
|
yuze.zyz
|
9c2247d700
|
support ms dataset
|
2023-12-08 18:00:57 +08:00 |
|
hiyouga
|
e25f7bae16
|
add models
|
2023-12-06 13:33:18 +08:00 |
|
hiyouga
|
6e7af11b98
|
add xuanyuan models
|
2023-12-02 00:35:29 +08:00 |
|
hiyouga
|
509abe8864
|
add models
|
2023-11-30 19:16:13 +08:00 |
|
hiyouga
|
ff1c289229
|
support Yi-34B-Chat models
|
2023-11-23 19:31:49 +08:00 |
|
hiyouga
|
9ea9380145
|
support GPTQ tuning #729 #1481 #1545 , fix chatglm template #1453 #1480 #1569
|
2023-11-20 22:52:11 +08:00 |
|
hiyouga
|
00baaa990e
|
better data streaming
|
2023-11-19 23:32:47 +08:00 |
|
hiyouga
|
bfb9433165
|
fix Mistral template
https://github.com/lm-sys/FastChat/pull/2547
|
2023-11-19 16:29:30 +08:00 |
|
hiyouga
|
ed9f7705ef
|
fix chatglm template
|
2023-11-16 22:54:15 +08:00 |
|
hiyouga
|
1e19cf242a
|
update readme and constants
|
2023-11-15 18:04:37 +08:00 |
|
hiyouga
|
4736344eb1
|
disentangle model from tuner and rename modules
|
2023-11-15 16:29:09 +08:00 |
|