125 Commits

Author SHA1 Message Date
hiyouga
c9a2a7a3b3 fix sharegpt loading
Former-commit-id: 2c8e88f9c14e030f79778893eaeeb7abda97dd0b
2023-12-13 00:56:16 +08:00
hiyouga
1a0bdd305c support system column #1765
Former-commit-id: 0a9c6e0146ebc71d5438c837463d6ab236e227c4
2023-12-12 19:45:59 +08:00
hiyouga
cefc0b2f03 fix modelscope data hub
Former-commit-id: d5b2c57a356539df9993e4774b856231eca8a6da
2023-12-12 18:33:06 +08:00
hoshi-hiyouga
b67085e13a Merge branch 'main' into feat/support_ms
Former-commit-id: 6382efec52f6be3daa5db0bd280a96162009fca1
2023-12-12 17:55:32 +08:00
xingjun.wang
6cb2c99e7d add use_streaming
Former-commit-id: adc98c86dad64f1a793017fa628b5cf19abbdd01
2023-12-12 14:23:05 +08:00
xingjun.wang
1bd75afae8 fix cache dir
Former-commit-id: 1909f0d11732bd99fadc6c1191e026137c6a7dff
2023-12-12 14:21:33 +08:00
xingjun.wang
c1974c91e5 add print info for test
Former-commit-id: 168321a4da7612620b9528860306f03bf65d019a
2023-12-12 14:14:40 +08:00
xingjun.wang
e17f2a3f7f update cache dir
Former-commit-id: edc82b923a3fb03c5af100b5357e10f0c18b4523
2023-12-12 13:08:18 +08:00
xingjun.wang
879209829e update args for MsDataset.load
Former-commit-id: 09533e95edc5fa65a38b2f04c6d88506196021b3
2023-12-12 13:02:54 +08:00
xingjun.wang
6520aecef1 update
Former-commit-id: cfba1009d0fc31b5933b558b249d89248f723d6b
2023-12-12 12:03:23 +08:00
xingjun.wang
1d65d24071 for test
Former-commit-id: 5b979147f093e86f44c4228ab34d04bdae94f89f
2023-12-12 11:52:59 +08:00
xingjun.wang
2918743520 for test
Former-commit-id: 8a908a8c644f4a961001cdd8388a3a7fea992c55
2023-12-12 11:47:59 +08:00
hiyouga
b7d99ad5f4 support mixtral
Former-commit-id: 96380f5e1887bb166be339e58ab8f65e464d4010
2023-12-12 11:39:04 +08:00
hiyouga
b641e9e97e fix #1784
Former-commit-id: 28d5de7e785f31b223a4646c9c1c770f43e187ec
2023-12-09 20:53:18 +08:00
yuze.zyz
c523613f0a support ms dataset
Former-commit-id: 9c2247d700763f480d88a5dd46480cb32cfc174e
2023-12-08 18:00:57 +08:00
hiyouga
9b84a706af add models
Former-commit-id: e25f7bae16b7ea41a4a1fd1e8db1b961e55d0c5b
2023-12-06 13:33:18 +08:00
hiyouga
f8376b228a add xuanyuan models
Former-commit-id: 6e7af11b989e4cf97ffacbab4736e3434ff6c925
2023-12-02 00:35:29 +08:00
hiyouga
1c43fb6a41 add models
Former-commit-id: 509abe8864ada29ac7fa0f636b662531c8dd3a33
2023-11-30 19:16:13 +08:00
hiyouga
5f2943dc84 support Yi-34B-Chat models
Former-commit-id: ff1c289229ee382d3e76578bbb6a5e299b969ded
2023-11-23 19:31:49 +08:00
hiyouga
4966bd7911 support GPTQ tuning #729 #1481 #1545 , fix chatglm template #1453 #1480 #1569
Former-commit-id: 9ea93801459b0d271d21a2d730c44abae9106c51
2023-11-20 22:52:11 +08:00
hiyouga
32545bd6d9 better data streaming
Former-commit-id: 00baaa990e099d6b75436eaa7a922a07646afa26
2023-11-19 23:32:47 +08:00
hiyouga
8d82d7e994 fix Mistral template
https://github.com/lm-sys/FastChat/pull/2547

Former-commit-id: bfb94331657f385f4653ddcb8f7b57d1c052804d
2023-11-19 16:29:30 +08:00
hiyouga
5de45bf989 fix chatglm template
Former-commit-id: ed9f7705efbed0accf4dc5c9dfa9e3e7e15e1174
2023-11-16 22:54:15 +08:00
hiyouga
3e0b76650a update readme and constants
Former-commit-id: 1e19cf242a1f843b590feefbe24b2cc0a17712b5
2023-11-15 18:04:37 +08:00
hiyouga
06a4820836 disentangle model from tuner and rename modules
Former-commit-id: 4736344eb1595ee023a50d49e8118f4eee46305f
2023-11-15 16:29:09 +08:00