hiyouga
|
88f2e99c73
|
add CMMLU, update eval script
Former-commit-id: 4dd9b4d9829249da21c0827fb9a170335e518d93
|
2023-09-23 21:10:17 +08:00 |
|
hiyouga
|
467c30d591
|
move file
Former-commit-id: badd2735b56eca107d40d0068823df78d3629c14
|
2023-09-23 11:52:12 +08:00 |
|
hiyouga
|
5ee1bdecdc
|
add MMLU and C-Eval script
Former-commit-id: 465ee8119aa489a41bee0b01b3c105a2f3dd137f
|
2023-09-23 00:34:17 +08:00 |
|
hiyouga
|
e930682152
|
fix #1000
Former-commit-id: 5cc7a447843c578af602a5e054348fad1c9306ce
|
2023-09-22 15:00:48 +08:00 |
|
hiyouga
|
db21953bf0
|
update readme
Former-commit-id: 044d4425b447c7b67ea5473d3165d9b040040fba
|
2023-09-22 14:34:13 +08:00 |
|
hiyouga
|
d04585df59
|
tiny fix
Former-commit-id: ace3f85a7273fbbc531adfe6ad73bf76a5fff52d
|
2023-09-21 15:25:29 +08:00 |
|
hiyouga
|
65854736c3
|
update readme
Former-commit-id: acda45e4632c0ab87b4f38b13cf2f1c441d45e53
|
2023-09-16 17:33:01 +08:00 |
|
hiyouga
|
1cd0ea1f13
|
add MathInstruct dataset
Former-commit-id: 026af87e7fce091a0cda1afd6df3d6ab6189de9a
|
2023-09-13 22:30:14 +08:00 |
|
hiyouga
|
4e86462bad
|
fix #762 #814
Former-commit-id: d4be857e23c74ed65e06903e19da6f18f15d9e30
|
2023-09-12 16:10:10 +08:00 |
|
hiyouga
|
4410387859
|
Release v0.1.8
Former-commit-id: ccb3553576164113c31be714a0295ea82321d67d
|
2023-09-11 17:31:34 +08:00 |
|
hiyouga
|
cf08bcf3d9
|
truncate readme
Former-commit-id: baac22f4f4c390c8c5d7b7491ff84d096521bc71
|
2023-09-10 21:04:20 +08:00 |
|
hiyouga
|
17bc66fce0
|
update readme
Former-commit-id: 63611de7ae09cd9578fcb9c6408035ec6bfb2cb2
|
2023-09-10 21:01:20 +08:00 |
|
hiyouga
|
7a715aac55
|
update readme
Former-commit-id: 34005252df4b015fd06a229b0be882ed64672cc1
|
2023-09-10 20:52:21 +08:00 |
|
hiyouga
|
8ab5566dc0
|
support FlashAttention2
Former-commit-id: d8aa1404bee9842f3e4cd037ad8d66c85470ac37
|
2023-09-10 20:43:56 +08:00 |
|
hiyouga
|
c818a7ff60
|
support lora target auto find
Former-commit-id: bca1a247bcef51dced59655c8a14c197569367ca
|
2023-09-09 15:38:37 +08:00 |
|
hiyouga
|
c6265e6969
|
fix chatglm2 tokenizer
Former-commit-id: d8d82ca281811c20c89cc03dd00f69735515d6cf
|
2023-09-09 13:50:29 +08:00 |
|
hiyouga
|
f74b980650
|
fix baichuan templates
Former-commit-id: 85b1f6632a752029dabdaed87c58986deb3a6b1d
|
2023-09-07 18:54:14 +08:00 |
|
hiyouga
|
51f662860d
|
update baichuan2 template
Former-commit-id: 0531886e1f534217dc3c9c0775d29fcf77ff7f5f
|
2023-09-06 21:43:06 +08:00 |
|
hiyouga
|
0ba72273d2
|
add Baichuan2 models
Former-commit-id: 60603a94c667fda5066af742ae1394dadce7a784
|
2023-09-06 18:40:11 +08:00 |
|
hiyouga
|
a4fd976048
|
refactor dataset_attr, add eos in pt, fix #757
Former-commit-id: a9d1fb72f791ae57a4d12f4e3a7e2abccf6a7077
|
2023-09-01 19:00:45 +08:00 |
|
codemayq
|
d9b9d9d1fe
|
add ad gen dataset
Former-commit-id: 604f85487b46b3eb01b68cb2cc6535b7cb5527a7
|
2023-08-27 20:35:32 +08:00 |
|
hiyouga
|
802494e20a
|
update template
Former-commit-id: 4318347d3f1982c773dad1074636ec7b550770fd
|
2023-08-22 19:46:09 +08:00 |
|
hiyouga
|
03edfd07e7
|
fix PPO trainer #551 , update readme
Former-commit-id: 90205244186df558cd6b0000728d638348db3a10
|
2023-08-18 11:43:10 +08:00 |
|
hiyouga
|
e93e9641f5
|
update readme
Former-commit-id: e4eec9ddfd3a9688733e018a96274dff0d5d9962
|
2023-08-18 01:51:55 +08:00 |
|
hiyouga
|
fceca0bb6a
|
update training resuming
Former-commit-id: 58f13e22da18babed0d2d4348474e07745da8fa5
|
2023-08-18 01:41:17 +08:00 |
|
hiyouga
|
327e14d3ea
|
update readme
Former-commit-id: ff0aa793b6750830b3865c439ef64ed129ec9406
|
2023-08-17 11:00:22 +08:00 |
|
hiyouga
|
6c9b035c0e
|
web UI integrating RLHF
Former-commit-id: ec94274ca155300aee27621c018dd1bbaf78194b
|
2023-08-14 10:48:47 +08:00 |
|
hiyouga
|
2bcf0025d6
|
update readme
Former-commit-id: 8a79ded55d6e696368c96a6d9958e7c8cdaf977b
|
2023-08-12 21:29:06 +08:00 |
|
hiyouga
|
8686e62dfa
|
update readme
Former-commit-id: 2618e0b5a7ad88f68971f21d0e7eb4560866400f
|
2023-08-12 21:23:05 +08:00 |
|
hiyouga
|
ba65dcb15e
|
update readme
Former-commit-id: 1836c020c514e7a94aaa48abdf19ea8accbc1a2a
|
2023-08-12 21:00:11 +08:00 |
|
hiyouga
|
79f4ba0d26
|
Release v0.1.6
Former-commit-id: a48cb0d474ef0648a97387daf5f623498b5e3ee6
|
2023-08-11 23:25:57 +08:00 |
|
hiyouga
|
abdfa26d06
|
support DPO training (2305.18290)
Former-commit-id: 3ec4351cfdaf2aefcc7d13345e19d79874ed61d3
|
2023-08-11 03:02:53 +08:00 |
|
hiyouga
|
733b395822
|
update readme
Former-commit-id: 20cf27976f24db2667955a8007e0ce2baa35fc82
|
2023-08-07 15:02:02 +08:00 |
|
codemayq
|
ee0f66aaef
|
add detailed model configs
Former-commit-id: 293bd95712330eb354220abf79384b8c594608ee
|
2023-08-07 09:30:23 +08:00 |
|
hiyouga
|
9c84c4ed5d
|
support Qwen-7B, fix InternLM-7B inference
Former-commit-id: 87f8f830e20aa839e089559c1d038954742000ef
|
2023-08-03 15:53:32 +08:00 |
|
hiyouga
|
534e3320b5
|
release v0.1.5
Former-commit-id: c689857bbb82ecaa317bfc22d831c7025fe39cc7
|
2023-08-02 16:10:31 +08:00 |
|
hiyouga
|
40d277ae5e
|
update readme
Former-commit-id: ccde51c5ea6fc14dbbe627abb63c4631ad08cc9d
|
2023-08-01 18:48:27 +08:00 |
|
hiyouga
|
c5ad96375e
|
fix RM save model
Former-commit-id: ac88ce5233248dbf1c7943c5f1197e40ba52fde9
|
2023-08-01 11:56:17 +08:00 |
|
hiyouga
|
aa4335eac7
|
release v0.1.4
Former-commit-id: 973a6386657885c7d11ecc8746ebd8804b6b355d
|
2023-08-01 10:08:47 +08:00 |
|
hiyouga
|
a437424381
|
update readme
Former-commit-id: 62dca5bb820b8e75f3e24294d578322b97303b5f
|
2023-07-31 23:42:32 +08:00 |
|
hiyouga
|
e80b75b560
|
support streaming data, fix #284 #274 #268
Former-commit-id: 0411a4b3e122e7907441bc7a64b004948741a620
|
2023-07-31 23:33:00 +08:00 |
|
hiyouga
|
f65f0745cc
|
update readme
Former-commit-id: 5ee87138e46c4aab6218c37f255419a85b5a4692
|
2023-07-28 17:36:00 +08:00 |
|
hiyouga
|
ba911f988d
|
update dataset
Former-commit-id: f5c2ccdde45bfa5648443a901b2ac397d532eceb
|
2023-07-26 17:05:12 +08:00 |
|
hiyouga
|
c52dd3e86f
|
fix #242
Former-commit-id: 00efa8a07fe5a69bac545675696b2a19b7b811ed
|
2023-07-25 17:04:02 +08:00 |
|
hiyouga
|
d46c136c0e
|
update dataset
Former-commit-id: 182b42504399d2755897b9737db1d36655a0fa50
|
2023-07-23 20:01:43 +08:00 |
|
hiyouga
|
261ca840d0
|
update readme, fix web ui postprocess
Former-commit-id: 035c966d5c1a2c7b9e9cba8ad06182a6672eabd4
|
2023-07-22 14:29:22 +08:00 |
|
mrhan1993
|
cdd887908c
|
根据GLM Efficient Tuning添加中文README,web添加了server_port
Former-commit-id: 9f0b57b3701fa73f719cd5a319b1584454481bbb
|
2023-07-21 16:57:58 +08:00 |
|
hiyouga
|
6552b74005
|
Update README.md
Former-commit-id: c3fcb674865cf50c80ffeb48aeb2b01a7c9aa252
|
2023-07-20 17:23:16 +08:00 |
|
hiyouga
|
fa47c99fa9
|
add datasets
Former-commit-id: 7159bc54ed0f1bba974662a87ba5039d9aacadee
|
2023-07-19 20:59:15 +08:00 |
|
hiyouga
|
f7f2accf05
|
support LLaMA-2
Former-commit-id: 7a3ade8c699ff1cd2d17590e2f8df79e1738cee2
|
2023-07-19 16:42:14 +08:00 |
|