Commit Graph

122 Commits

Author SHA1 Message Date
hiyouga
cab0598fd0 add mixtral 8x22B models 2024-04-17 23:35:59 +08:00
hiyouga
b3ac14ffc4 add empty template 2024-04-16 03:10:02 +08:00
hoshi-hiyouga
3ccf0d0977 Update template.py 2024-04-15 22:58:01 +08:00
marko1616
2c89b38720 change default_system accroding to official template 2024-04-15 20:45:46 +08:00
marko1616
90c5dddf9a Revert "Add support for function call(Not strictly following origin)"
This reverts commit d7b9bbc8b9.
2024-04-15 20:27:09 +08:00
marko1616
d7b9bbc8b9 Add support for function call(Not strictly following origin) 2024-04-15 20:16:52 +08:00
marko1616
42806323f0 Typo fix 2024-04-13 07:52:11 +08:00
marko1616
6574a721d2 Add template&support(Not tested) 2024-04-13 04:31:33 +08:00
hiyouga
9d4c949461 release v0.6.2 2024-04-11 20:08:51 +08:00
hiyouga
9a99fbc86d tiny fix 2024-04-08 21:28:39 +08:00
codingma
34bdcba017 rename template to breeze 2024-04-07 18:27:20 +08:00
codingma
5a780e9eec rename template to breeze 2024-04-07 11:39:54 +08:00
codingma
2565a32bd9 support https://github.com/hiyouga/LLaMA-Factory/issues/3152 2024-04-07 11:34:01 +08:00
hiyouga
92dab8a90b simplify readme 2024-04-02 20:07:43 +08:00
hiyouga
dd73a0c248 set dev version 2024-04-01 23:24:08 +08:00
hiyouga
17bf8a2c3a support ORPO 2024-03-31 18:29:50 +08:00
hiyouga
511f675402 fix #2961 2024-03-26 17:26:14 +08:00
hiyouga
70a3052dd8 patch for gemma cpt 2024-03-12 21:21:54 +08:00
hiyouga
b3247d6a16 support olmo 2024-03-12 18:30:38 +08:00
hiyouga
352693e2dc tiny fix 2024-03-11 00:17:18 +08:00
hiyouga
8664262cde support layerwise galore 2024-03-10 00:24:11 +08:00
hiyouga
bdb496644c allow non-packing pretraining 2024-03-09 22:21:46 +08:00
hiyouga
d07ad5cc1c support vllm 2024-03-07 20:26:31 +08:00
hiyouga
921ee82267 fix chatglm3 template 2024-03-07 14:26:16 +08:00
hiyouga
9658c63cd9 fix add tokens 2024-03-06 15:04:02 +08:00
hiyouga
3016e65657 fix version checking 2024-03-06 14:51:51 +08:00
hiyouga
9e56eaf2d3 auto set chat template 2024-03-05 02:41:20 +08:00
hiyouga
24a79bd50f update readme 2024-03-04 19:29:26 +08:00
hiyouga
4cc2781efe fix #2629 2024-02-29 00:37:29 +08:00
hiyouga
38d8b2cef8 update chatglm3 template 2024-02-28 21:11:23 +08:00
hiyouga
3ba1054593 update readme 2024-02-26 17:25:47 +08:00
Rayrtfr
6e0fba60b3 Support Atom Model 2024-02-26 10:44:10 +08:00
hiyouga
354f13c01a fix data entry 2024-02-23 18:29:24 +08:00
hiyouga
6bf4c1274f fix gemma template 2024-02-23 13:49:53 +08:00
hiyouga
a87838ded1 fix template 2024-02-22 12:09:21 +08:00
hiyouga
c375a20230 fix template 2024-02-22 12:06:48 +08:00
hiyouga
c99e19641a support gemma 2024-02-21 23:27:36 +08:00
hiyouga
9aeb404a94 support lora for llama pro 2024-02-21 02:17:22 +08:00
hiyouga
02c8c55ce3 fix #2516 2024-02-20 20:44:24 +08:00
hiyouga
7924ffc55d support llama pro #2338 , add rslora 2024-02-15 02:27:36 +08:00
hiyouga
12b2066e34 fix #2471 2024-02-12 21:07:46 +08:00
hiyouga
a754f6e9ec update data/readme 2024-02-10 21:04:29 +08:00
hiyouga
a5fb2806cd update default template 2024-02-10 16:44:47 +08:00
hiyouga
7d2dc83c5e improve aligner 2024-02-10 16:39:19 +08:00
hoshi-hiyouga
388b705a8d Merge pull request #2462 from mnmueller/main
Enable Parsing of SlimOrca
2024-02-09 22:55:48 +08:00
hiyouga
54ea9684ed improve fix tokenizer 2024-02-09 14:53:14 +08:00
Mark Mueller
1d3598afa1 Slim Orca data parsing 2024-02-08 19:32:20 +01:00
Mark Mueller
6703d0546d Slim Orca data parsing 2024-02-08 17:56:18 +01:00
Mark Mueller
7f792dfede Slim Orca data parsing 2024-02-08 17:54:18 +01:00
Mark Mueller
8bd4182609 Slim Orca data parsing 2024-02-08 17:52:36 +01:00