Commit Graph

157 Commits

Author SHA1 Message Date
hiyouga
17bf8a2c3a support ORPO 2024-03-31 18:29:50 +08:00
hiyouga
511f675402 fix #2961 2024-03-26 17:26:14 +08:00
hiyouga
70a3052dd8 patch for gemma cpt 2024-03-12 21:21:54 +08:00
hiyouga
b3247d6a16 support olmo 2024-03-12 18:30:38 +08:00
hiyouga
352693e2dc tiny fix 2024-03-11 00:17:18 +08:00
hiyouga
8664262cde support layerwise galore 2024-03-10 00:24:11 +08:00
hiyouga
bdb496644c allow non-packing pretraining 2024-03-09 22:21:46 +08:00
hiyouga
d07ad5cc1c support vllm 2024-03-07 20:26:31 +08:00
hiyouga
921ee82267 fix chatglm3 template 2024-03-07 14:26:16 +08:00
hiyouga
9658c63cd9 fix add tokens 2024-03-06 15:04:02 +08:00
hiyouga
3016e65657 fix version checking 2024-03-06 14:51:51 +08:00
hiyouga
9e56eaf2d3 auto set chat template 2024-03-05 02:41:20 +08:00
hiyouga
24a79bd50f update readme 2024-03-04 19:29:26 +08:00
hiyouga
4cc2781efe fix #2629 2024-02-29 00:37:29 +08:00
hiyouga
38d8b2cef8 update chatglm3 template 2024-02-28 21:11:23 +08:00
hiyouga
3ba1054593 update readme 2024-02-26 17:25:47 +08:00
Rayrtfr
6e0fba60b3 Support Atom Model 2024-02-26 10:44:10 +08:00
hiyouga
354f13c01a fix data entry 2024-02-23 18:29:24 +08:00
hiyouga
6bf4c1274f fix gemma template 2024-02-23 13:49:53 +08:00
hiyouga
a87838ded1 fix template 2024-02-22 12:09:21 +08:00
hiyouga
c375a20230 fix template 2024-02-22 12:06:48 +08:00
hiyouga
c99e19641a support gemma 2024-02-21 23:27:36 +08:00
hiyouga
9aeb404a94 support lora for llama pro 2024-02-21 02:17:22 +08:00
hiyouga
02c8c55ce3 fix #2516 2024-02-20 20:44:24 +08:00
hiyouga
7924ffc55d support llama pro #2338 , add rslora 2024-02-15 02:27:36 +08:00
hiyouga
12b2066e34 fix #2471 2024-02-12 21:07:46 +08:00
hiyouga
a754f6e9ec update data/readme 2024-02-10 21:04:29 +08:00
hiyouga
a5fb2806cd update default template 2024-02-10 16:44:47 +08:00
hiyouga
7d2dc83c5e improve aligner 2024-02-10 16:39:19 +08:00
hoshi-hiyouga
388b705a8d Merge pull request #2462 from mnmueller/main
Enable Parsing of SlimOrca
2024-02-09 22:55:48 +08:00
hiyouga
54ea9684ed improve fix tokenizer 2024-02-09 14:53:14 +08:00
Mark Mueller
1d3598afa1 Slim Orca data parsing 2024-02-08 19:32:20 +01:00
Mark Mueller
6703d0546d Slim Orca data parsing 2024-02-08 17:56:18 +01:00
Mark Mueller
7f792dfede Slim Orca data parsing 2024-02-08 17:54:18 +01:00
Mark Mueller
8bd4182609 Slim Orca data parsing 2024-02-08 17:52:36 +01:00
Mark Mueller
36d7a75966 SlimOrca aligner 2024-02-08 08:28:32 -08:00
hoshi-hiyouga
d0daaa01f9 Merge pull request #2423 from mayflower/main
Support for german sft and dpo
2024-02-07 15:58:20 +08:00
hiyouga
88a1bc9773 lint 2024-02-07 01:10:04 +08:00
hiyouga
85622ae757 add models 2024-02-06 14:57:23 +08:00
Johann-Peter Hartmann
1ecea9de63 Merge branch 'hiyouga:main' into main 2024-02-04 13:55:00 +00:00
hiyouga
3dc86c4af9 fix #2421 2024-02-04 21:02:55 +08:00
Johann-Peter Hartmann
63da6294dd Merge branch 'hiyouga:main' into main 2024-02-04 12:51:25 +00:00
hiyouga
db0ab4d601 fix reserved label len 2024-02-04 17:54:26 +08:00
hiyouga
51df865734 fix #2397 2024-02-03 23:45:31 +08:00
hiyouga
4ecadc3512 fix #2376 2024-02-03 23:14:31 +08:00
hiyouga
901faa16cc support minicpm #2404 2024-02-03 22:36:46 +08:00
Johann-Peter Hartmann
b0ffde6e98 add simple german chatml template chatml_de 2024-02-03 09:01:15 +01:00
Fallen Angel
3399c0d645 fix eos_token_id=0 bug
when eos_token_id=0, will never add eos_token
2024-02-02 17:34:48 +08:00
hiyouga
b2fb0eca56 fix #2282 and update tool prompt 2024-01-22 22:27:30 +08:00
hiyouga
6fc2d5cc03 add orion models 2024-01-22 21:26:53 +08:00