484 Commits

Author SHA1 Message Date
hoshi-hiyouga
d72f123851 Merge pull request #1553 from hannlp/hans
Change the default argument settings for PPO training

Former-commit-id: 48211e3799a16de946360930d3d92f5a40e9d12d
2023-11-20 20:32:55 +08:00
hiyouga
6d8d8509da update benchmark
Former-commit-id: a2019c8b61753e0a845dc2b3d3b45ae43c0d492f
2023-11-18 11:30:01 +08:00
hiyouga
60fe410f05 update readme
Former-commit-id: 90212280d6cc168e5421fd3201a69d2c6ae01e29
2023-11-18 11:15:56 +08:00
hiyouga
3dc7c0a88c add benchmark
Former-commit-id: 329134f58c45746702ec92a5bd76b89334cbc4b6
2023-11-18 11:09:52 +08:00
Yuchen Han
b92259ae02 Update README_zh.md
Former-commit-id: 7cab47b822ddf7f94d2ccfbe2ba4836e35f104f4
2023-11-17 00:18:07 -08:00
hiyouga
763a9305d0 update readme
Former-commit-id: 72e6699547d4fd0a20f61225f7a4b2c054ec035b
2023-11-16 15:58:37 +08:00
hiyouga
f441932bd1 support full-parameter PPO
Former-commit-id: ce783036001397a20b0b4c5da2fea6d0c03389d2
2023-11-16 02:08:04 +08:00
hiyouga
0c1fab84f1 add demo mode for web UI
Former-commit-id: 8350bcf85d5e59b63da46b540c6ad860e8419d9e
2023-11-15 23:51:26 +08:00
hiyouga
3e0b76650a update readme and constants
Former-commit-id: 1e19cf242a1f843b590feefbe24b2cc0a17712b5
2023-11-15 18:04:37 +08:00
hiyouga
42bb8b6400 fix dc link
Former-commit-id: 88ab33254e02dce7437786fd2d153411118c4594
2023-11-13 23:22:56 +08:00
hiyouga
125587b187 refactor evaluation, upgrade trl to 074
Former-commit-id: 442aefb925c4ff02b98aa30c49c2e01d04f6496a
2023-11-13 22:20:35 +08:00
hiyouga
0fbaa42752 refactor constants
Former-commit-id: 3697a3dc9a0be8141951dfe65812844f66059517
2023-11-10 14:16:10 +08:00
hiyouga
164559d01d update readme
Former-commit-id: b3572659f562756375f879b0a97e42d4e2129f54
2023-11-09 16:00:24 +08:00
hiyouga
2627f95ac3 update readme (list in alphabetical order)
Former-commit-id: e1e04cb1f1a0d37cac47c228bd3d5bbcda912aff
2023-11-06 17:18:12 +08:00
hiyouga
a919b6a478 update templates
Former-commit-id: a7eeb8e17c2f23f16732f5a5d767b39bcc1ac517
2023-11-06 12:25:47 +08:00
hiyouga
a9db89a025 update data readme (zh)
Former-commit-id: cc8ffa10d877f5893f3940204e5bec6f3266559f
2023-11-02 23:42:49 +08:00
hiyouga
a1b0655457 support sharegpt format, add datasets
Former-commit-id: a8371724130db2fbd7273a480e2acb251e382aec
2023-11-02 23:10:04 +08:00
hiyouga
b3931f73cc update projects
Former-commit-id: 640a5201089f2ed5c6721ee29ee8b7a9be95847b
2023-10-29 22:53:47 +08:00
hiyouga
dc8739066b add projects
Former-commit-id: 59f342e76f9d1ad04cfa7979b98d1eb9a3cebc99
2023-10-29 22:07:13 +08:00
hiyouga
bf0faf129d fix vicuna template
Former-commit-id: 52fc24d1664bc701f43e2bff8b3faded795b929c
2023-10-27 22:15:25 +08:00
hiyouga
8babfe3b4c update readme
Former-commit-id: 4600c29e932b3c46561c2e8727e4cbdf270a72bb
2023-10-27 19:19:03 +08:00
hiyouga
8a76b1e499 support chatglm3
Former-commit-id: 1c0ab9a908dedf0ad69ad5741a23465da02006d9
2023-10-27 19:16:28 +08:00
hiyouga
d6c77d9196 reimplement neftune
Former-commit-id: 7b4acf7265b04cc4a674b3dcafdb90e76f149e39
2023-10-22 16:15:08 +08:00
anvie
3635823fbe add NEFTune optimization
Former-commit-id: 57fb40aa04fec11ca165a97ea463579faeaeebe7
2023-10-21 13:24:10 +07:00
hiyouga
95697652f1 fix #1232
Former-commit-id: b665e9e133bf2f6f10346c374eb0de8a96dd5c7e
2023-10-20 23:28:52 +08:00
hiyouga
f6a8069159 fix #1217
Former-commit-id: 6496a99b7d9a5aa57bfc8b935b274f1481580864
2023-10-19 15:52:24 +08:00
hoshi-hiyouga
eae703c121 Update README_zh.md
Former-commit-id: 5f83a6e72c095dfd3c840661fee5b6c7030be593
2023-10-16 00:28:27 +08:00
hiyouga
fca1e451b5 update readme
Former-commit-id: f5d0da4d2ac995c85ab891b103503bfa8a80f6db
2023-10-15 20:28:14 +08:00
hiyouga
64bf750a74 update readme
Former-commit-id: cb426766944487c72c10ca6e59ffb9888ca8b1e2
2023-10-13 13:53:43 +08:00
hiyouga
e2d412da02 update discord link
Former-commit-id: c4102f306a4c6c0e0397adce6001bbbf3f09314c
2023-10-12 21:44:28 +08:00
hiyouga
819b5faa9a rename repository
Former-commit-id: 197c754d731d495330f33bbf962f8bbc7a10c0cc
2023-10-12 21:42:29 +08:00
hiyouga
3ba788fc2c update readme
Former-commit-id: 8e2ed6b8ceb710c50cd73f8047ff4be305674567
2023-10-09 20:02:50 +08:00
hiyouga
d338ab3e19 fix #1068 #1074
Former-commit-id: d11a5454633be9f0600cbd1ab7a26c9c8fa5ed80
2023-09-28 14:39:16 +08:00
hiyouga
650a2a2e01 update readme
Former-commit-id: 4eae06146436a21fa63239c3687882b1e9e8ba13
2023-09-27 21:57:47 +08:00
hiyouga
108c31e1fc support LongLoRA
Former-commit-id: 90375f600d5601866836123597fa3ef52008eeef
2023-09-27 21:55:50 +08:00
hiyouga
88f2e99c73 add CMMLU, update eval script
Former-commit-id: 4dd9b4d9829249da21c0827fb9a170335e518d93
2023-09-23 21:10:17 +08:00
hiyouga
467c30d591 move file
Former-commit-id: badd2735b56eca107d40d0068823df78d3629c14
2023-09-23 11:52:12 +08:00
hiyouga
5ee1bdecdc add MMLU and C-Eval script
Former-commit-id: 465ee8119aa489a41bee0b01b3c105a2f3dd137f
2023-09-23 00:34:17 +08:00
hiyouga
e930682152 fix #1000
Former-commit-id: 5cc7a447843c578af602a5e054348fad1c9306ce
2023-09-22 15:00:48 +08:00
hiyouga
db21953bf0 update readme
Former-commit-id: 044d4425b447c7b67ea5473d3165d9b040040fba
2023-09-22 14:34:13 +08:00
hiyouga
d04585df59 tiny fix
Former-commit-id: ace3f85a7273fbbc531adfe6ad73bf76a5fff52d
2023-09-21 15:25:29 +08:00
hiyouga
65854736c3 update readme
Former-commit-id: acda45e4632c0ab87b4f38b13cf2f1c441d45e53
2023-09-16 17:33:01 +08:00
hiyouga
1cd0ea1f13 add MathInstruct dataset
Former-commit-id: 026af87e7fce091a0cda1afd6df3d6ab6189de9a
2023-09-13 22:30:14 +08:00
hiyouga
4e86462bad fix #762 #814
Former-commit-id: d4be857e23c74ed65e06903e19da6f18f15d9e30
2023-09-12 16:10:10 +08:00
hiyouga
4410387859 Release v0.1.8
Former-commit-id: ccb3553576164113c31be714a0295ea82321d67d
2023-09-11 17:31:34 +08:00
hiyouga
cf08bcf3d9 truncate readme
Former-commit-id: baac22f4f4c390c8c5d7b7491ff84d096521bc71
2023-09-10 21:04:20 +08:00
hiyouga
17bc66fce0 update readme
Former-commit-id: 63611de7ae09cd9578fcb9c6408035ec6bfb2cb2
2023-09-10 21:01:20 +08:00
hiyouga
7a715aac55 update readme
Former-commit-id: 34005252df4b015fd06a229b0be882ed64672cc1
2023-09-10 20:52:21 +08:00
hiyouga
8ab5566dc0 support FlashAttention2
Former-commit-id: d8aa1404bee9842f3e4cd037ad8d66c85470ac37
2023-09-10 20:43:56 +08:00
hiyouga
c818a7ff60 support lora target auto find
Former-commit-id: bca1a247bcef51dced59655c8a14c197569367ca
2023-09-09 15:38:37 +08:00