hiyouga
|
17bf8a2c3a
|
support ORPO
|
2024-03-31 18:29:50 +08:00 |
|
hiyouga
|
511f675402
|
fix #2961
|
2024-03-26 17:26:14 +08:00 |
|
hiyouga
|
70a3052dd8
|
patch for gemma cpt
|
2024-03-12 21:21:54 +08:00 |
|
hiyouga
|
b3247d6a16
|
support olmo
|
2024-03-12 18:30:38 +08:00 |
|
hiyouga
|
352693e2dc
|
tiny fix
|
2024-03-11 00:17:18 +08:00 |
|
hiyouga
|
8664262cde
|
support layerwise galore
|
2024-03-10 00:24:11 +08:00 |
|
hiyouga
|
bdb496644c
|
allow non-packing pretraining
|
2024-03-09 22:21:46 +08:00 |
|
hiyouga
|
d07ad5cc1c
|
support vllm
|
2024-03-07 20:26:31 +08:00 |
|
hiyouga
|
921ee82267
|
fix chatglm3 template
|
2024-03-07 14:26:16 +08:00 |
|
hiyouga
|
9658c63cd9
|
fix add tokens
|
2024-03-06 15:04:02 +08:00 |
|
hiyouga
|
3016e65657
|
fix version checking
|
2024-03-06 14:51:51 +08:00 |
|
hiyouga
|
9e56eaf2d3
|
auto set chat template
|
2024-03-05 02:41:20 +08:00 |
|
hiyouga
|
24a79bd50f
|
update readme
|
2024-03-04 19:29:26 +08:00 |
|
hiyouga
|
4cc2781efe
|
fix #2629
|
2024-02-29 00:37:29 +08:00 |
|
hiyouga
|
38d8b2cef8
|
update chatglm3 template
|
2024-02-28 21:11:23 +08:00 |
|
hiyouga
|
3ba1054593
|
update readme
|
2024-02-26 17:25:47 +08:00 |
|
Rayrtfr
|
6e0fba60b3
|
Support Atom Model
|
2024-02-26 10:44:10 +08:00 |
|
hiyouga
|
354f13c01a
|
fix data entry
|
2024-02-23 18:29:24 +08:00 |
|
hiyouga
|
6bf4c1274f
|
fix gemma template
|
2024-02-23 13:49:53 +08:00 |
|
hiyouga
|
a87838ded1
|
fix template
|
2024-02-22 12:09:21 +08:00 |
|
hiyouga
|
c375a20230
|
fix template
|
2024-02-22 12:06:48 +08:00 |
|
hiyouga
|
c99e19641a
|
support gemma
|
2024-02-21 23:27:36 +08:00 |
|
hiyouga
|
9aeb404a94
|
support lora for llama pro
|
2024-02-21 02:17:22 +08:00 |
|
hiyouga
|
02c8c55ce3
|
fix #2516
|
2024-02-20 20:44:24 +08:00 |
|
hiyouga
|
7924ffc55d
|
support llama pro #2338 , add rslora
|
2024-02-15 02:27:36 +08:00 |
|
hiyouga
|
12b2066e34
|
fix #2471
|
2024-02-12 21:07:46 +08:00 |
|
hiyouga
|
a754f6e9ec
|
update data/readme
|
2024-02-10 21:04:29 +08:00 |
|
hiyouga
|
a5fb2806cd
|
update default template
|
2024-02-10 16:44:47 +08:00 |
|
hiyouga
|
7d2dc83c5e
|
improve aligner
|
2024-02-10 16:39:19 +08:00 |
|
hoshi-hiyouga
|
388b705a8d
|
Merge pull request #2462 from mnmueller/main
Enable Parsing of SlimOrca
|
2024-02-09 22:55:48 +08:00 |
|
hiyouga
|
54ea9684ed
|
improve fix tokenizer
|
2024-02-09 14:53:14 +08:00 |
|
Mark Mueller
|
1d3598afa1
|
Slim Orca data parsing
|
2024-02-08 19:32:20 +01:00 |
|
Mark Mueller
|
6703d0546d
|
Slim Orca data parsing
|
2024-02-08 17:56:18 +01:00 |
|
Mark Mueller
|
7f792dfede
|
Slim Orca data parsing
|
2024-02-08 17:54:18 +01:00 |
|
Mark Mueller
|
8bd4182609
|
Slim Orca data parsing
|
2024-02-08 17:52:36 +01:00 |
|
Mark Mueller
|
36d7a75966
|
SlimOrca aligner
|
2024-02-08 08:28:32 -08:00 |
|
hoshi-hiyouga
|
d0daaa01f9
|
Merge pull request #2423 from mayflower/main
Support for german sft and dpo
|
2024-02-07 15:58:20 +08:00 |
|
hiyouga
|
88a1bc9773
|
lint
|
2024-02-07 01:10:04 +08:00 |
|
hiyouga
|
85622ae757
|
add models
|
2024-02-06 14:57:23 +08:00 |
|
Johann-Peter Hartmann
|
1ecea9de63
|
Merge branch 'hiyouga:main' into main
|
2024-02-04 13:55:00 +00:00 |
|
hiyouga
|
3dc86c4af9
|
fix #2421
|
2024-02-04 21:02:55 +08:00 |
|
Johann-Peter Hartmann
|
63da6294dd
|
Merge branch 'hiyouga:main' into main
|
2024-02-04 12:51:25 +00:00 |
|
hiyouga
|
db0ab4d601
|
fix reserved label len
|
2024-02-04 17:54:26 +08:00 |
|
hiyouga
|
51df865734
|
fix #2397
|
2024-02-03 23:45:31 +08:00 |
|
hiyouga
|
4ecadc3512
|
fix #2376
|
2024-02-03 23:14:31 +08:00 |
|
hiyouga
|
901faa16cc
|
support minicpm #2404
|
2024-02-03 22:36:46 +08:00 |
|
Johann-Peter Hartmann
|
b0ffde6e98
|
add simple german chatml template chatml_de
|
2024-02-03 09:01:15 +01:00 |
|
Fallen Angel
|
3399c0d645
|
fix eos_token_id=0 bug
when eos_token_id=0, will never add eos_token
|
2024-02-02 17:34:48 +08:00 |
|
hiyouga
|
b2fb0eca56
|
fix #2282 and update tool prompt
|
2024-01-22 22:27:30 +08:00 |
|
hiyouga
|
6fc2d5cc03
|
add orion models
|
2024-01-22 21:26:53 +08:00 |
|