111 Commits

Author SHA1 Message Date
hiyouga
38755bced7 add template, modify datasets
Former-commit-id: 386f590209e466b51c17a7ac8cee55fc3ce928d7
2023-11-09 15:53:23 +08:00
hiyouga
1f2c56bff9 delete file
Former-commit-id: 479d0af2dc4ab8282b9d55aba1b03ab3a54f400b
2023-11-07 16:20:12 +08:00
hiyouga
3d40bdb600 upgrade peft, fix #1088 #1411
Former-commit-id: b2a60905f384ada92618bf21301fe96dac1c10bf
2023-11-07 16:13:36 +08:00
hiyouga
a919b6a478 update templates
Former-commit-id: a7eeb8e17c2f23f16732f5a5d767b39bcc1ac517
2023-11-06 12:25:47 +08:00
hiyouga
034b658348 fix deepseek template
Former-commit-id: d08f5e8a147f1929567d42b6bed8bc998c2a866d
2023-11-05 13:08:46 +08:00
hiyouga
04107b7af6 support deepseek coder #1378
Former-commit-id: 2a8a25819524e84a5e6e907923c47693f8b7a48d
2023-11-05 12:51:03 +08:00
hiyouga
6493f6d2e9 fix #1316
Former-commit-id: f4e4a04529a60b4c4ccc66cbf67f6e951fbc68d3
2023-10-31 11:32:08 +08:00
hiyouga
d48478ef88 update constants
Former-commit-id: f28a034a9b74630a56314446bd0f103c086bda60
2023-10-29 13:30:20 +08:00
hiyouga
bf0faf129d fix vicuna template
Former-commit-id: 52fc24d1664bc701f43e2bff8b3faded795b929c
2023-10-27 22:15:25 +08:00
hiyouga
5705c82cd8 fix chatglm3 template
Former-commit-id: 4117f388279ca43eb46def195c21e7051aefd0c7
2023-10-27 21:12:06 +08:00
hiyouga
8a76b1e499 support chatglm3
Former-commit-id: 1c0ab9a908dedf0ad69ad5741a23465da02006d9
2023-10-27 19:16:28 +08:00
hiyouga
d18c708f14 fix openchat template
Former-commit-id: 8fdff07e1f056afa5fe39fe794c6af030fc5f225
2023-10-21 01:25:42 +08:00
hiyouga
95697652f1 fix #1232
Former-commit-id: b665e9e133bf2f6f10346c374eb0de8a96dd5c7e
2023-10-20 23:28:52 +08:00
hiyouga
0503d45782 fix eval resuming in webui
Former-commit-id: 273745f9b9d117d4053afc1746108af95b0a51a4
2023-10-15 15:45:38 +08:00
hiyouga
99592478c9 tiny fix
Former-commit-id: 3ad8c92ecabaf2c169e53a8485687b4d04a772e7
2023-10-15 05:02:48 +08:00
hiyouga
4f9ca28e11 fix callback
Former-commit-id: 1e9401744cadecdef043b6f744b2616a74c64bca
2023-10-15 04:59:44 +08:00
hiyouga
3ae6229140 implement webui resuming training
Former-commit-id: accde3cd39ec7b09d96cf1865f8f51850693f5ce
2023-10-15 04:52:19 +08:00
hiyouga
c9d1cd108d refactor model_dtype, fix PPO trainer
Former-commit-id: 2818af0b0967d7695f27658acac0b7e2c2728e5d
2023-10-11 23:16:01 +08:00
hiyouga
141937ead6 fix aquila template, repair sft packing mechanism
Former-commit-id: be420e417920211b68f5b86a5ef5426aeaa62bb0
2023-10-10 18:49:55 +08:00
hiyouga
180fd06e61 fix flash shift short attention
Former-commit-id: 0a356bc897690262190a8112e8ace37d349daee1
2023-10-09 17:54:48 +08:00
hiyouga
b6e81a0307 fix shift short attention
Former-commit-id: ab65c3063b31b9e6a1aeb62c57224c1296ccdadd
2023-10-09 17:07:46 +08:00
hiyouga
d338ab3e19 fix #1068 #1074
Former-commit-id: d11a5454633be9f0600cbd1ab7a26c9c8fa5ed80
2023-09-28 14:39:16 +08:00
hiyouga
f61a000e73 tiny fix
Former-commit-id: 5d4118b09639ea4ee46d3d750cdd542c30555a03
2023-09-28 01:03:04 +08:00
hiyouga
8a8ba08bf7 tiny fix
Former-commit-id: d2ebd225dbb922adec99c1eb774c16f5cb973d2c
2023-09-28 01:02:11 +08:00
hiyouga
755e3e49b4 fix #1064
Former-commit-id: c90223639790152fadd100cedb5f63d375d9c195
2023-09-28 00:53:29 +08:00
hiyouga
deb17942ab fix layer norm dtype
Former-commit-id: 84b7486885c600e5e65c5ba9095d56ecc2502977
2023-09-28 00:25:55 +08:00
hiyouga
108c31e1fc support LongLoRA
Former-commit-id: 90375f600d5601866836123597fa3ef52008eeef
2023-09-27 21:55:50 +08:00
hiyouga
5ee1bdecdc add MMLU and C-Eval script
Former-commit-id: 465ee8119aa489a41bee0b01b3c105a2f3dd137f
2023-09-23 00:34:17 +08:00
hiyouga
48e7b600a8 fix error info
Former-commit-id: 7e8655c8b59c3fdc455e304cf875a6b7fcb69290
2023-09-19 18:30:23 +08:00
hiyouga
4e86462bad fix #762 #814
Former-commit-id: d4be857e23c74ed65e06903e19da6f18f15d9e30
2023-09-12 16:10:10 +08:00
hiyouga
8ac7ec0b48 tiny fix
Former-commit-id: 3b306478d4ccbf037ae1acc122f6dca11c718731
2023-09-11 18:27:08 +08:00
hiyouga
33bab0e7c1 update flashattn, fix ppo save model
Former-commit-id: 0fbece85a70222e5262a2295203de07ffe648fda
2023-09-11 17:25:36 +08:00
hiyouga
6a71361a54 remove PeftTrainer
Former-commit-id: b218c271edfb07006ddc34b1aca404088de6c528
2023-09-10 22:23:23 +08:00
hiyouga
8ab5566dc0 support FlashAttention2
Former-commit-id: d8aa1404bee9842f3e4cd037ad8d66c85470ac37
2023-09-10 20:43:56 +08:00
hiyouga
f865d0bd51 fix lora target
Former-commit-id: a51b7c98acc599de5ed2eaeeebe7b184105722c5
2023-09-09 17:04:45 +08:00
hiyouga
c818a7ff60 support lora target auto find
Former-commit-id: bca1a247bcef51dced59655c8a14c197569367ca
2023-09-09 15:38:37 +08:00
hiyouga
9ed4bb63d4 change to right-padding, update reward score #803
Former-commit-id: 8ea32e4046d75ddfa9517669e9de9f48fea720c6
2023-09-08 20:04:31 +08:00
hiyouga
62941919e8 fix chatglm template
Former-commit-id: 8aaaa132d49b4c758256a6159270cea4351f946c
2023-09-08 14:45:58 +08:00
hiyouga
f74b980650 fix baichuan templates
Former-commit-id: 85b1f6632a752029dabdaed87c58986deb3a6b1d
2023-09-07 18:54:14 +08:00
hiyouga
51f662860d update baichuan2 template
Former-commit-id: 0531886e1f534217dc3c9c0775d29fcf77ff7f5f
2023-09-06 21:43:06 +08:00
hiyouga
f9aee17f9d add Baichuan2 models
Former-commit-id: 62ce65c6282d2bbcb765354acc2819cc3e983a46
2023-09-06 18:36:04 +08:00
hiyouga
a4fd976048 refactor dataset_attr, add eos in pt, fix #757
Former-commit-id: a9d1fb72f791ae57a4d12f4e3a7e2abccf6a7077
2023-09-01 19:00:45 +08:00
codemayq
ea74e5a81b update llama2 template
Former-commit-id: 0bcc489c42fc41b2c7ee51cced2dc995256a02d8
2023-08-30 16:23:56 +08:00
codemayq
4b29d9d2b0 add dataset stage and filter dataset when stage chosen in webui
Former-commit-id: c0e4d1e81b41c9a36291d8bee46d7d807c898c21
2023-08-23 18:54:23 +08:00
hiyouga
802494e20a update template
Former-commit-id: 4318347d3f1982c773dad1074636ec7b550770fd
2023-08-22 19:46:09 +08:00
hiyouga
e6f4eab4ab fix #608
Former-commit-id: 02d69b6fdefa6b303b84fb8195a159006fe3f50a
2023-08-21 17:49:36 +08:00
hiyouga
d3bef03dc6 fix baichuan template for training #597 #616
Former-commit-id: 0a3f6984259526775b0efdb8a1b0b24f564a7239
2023-08-21 17:41:51 +08:00
hiyouga
caf4a61e21 fix ChatGLM2 ppo #527 #528
Former-commit-id: 9f4c2adc9a9ca8e458d3868805e077182e0d336a
2023-08-18 00:34:59 +08:00
hiyouga
623a34b16f fix generation bug #532
Former-commit-id: be21fc83f9aed0af1e5a2f83f5d5eeb36f1d283c
2023-08-17 22:21:34 +08:00
hiyouga
3021a01b71 fix baichuan and intern template
Former-commit-id: 892fd39373b816cf079e0decc9cb57dfb5565242
2023-08-17 01:27:20 +08:00