hoshi-hiyouga
|
d72f123851
|
Merge pull request #1553 from hannlp/hans
Change the default argument settings for PPO training
Former-commit-id: 48211e3799a16de946360930d3d92f5a40e9d12d
|
2023-11-20 20:32:55 +08:00 |
|
hiyouga
|
6d8d8509da
|
update benchmark
Former-commit-id: a2019c8b61753e0a845dc2b3d3b45ae43c0d492f
|
2023-11-18 11:30:01 +08:00 |
|
hiyouga
|
60fe410f05
|
update readme
Former-commit-id: 90212280d6cc168e5421fd3201a69d2c6ae01e29
|
2023-11-18 11:15:56 +08:00 |
|
hiyouga
|
3dc7c0a88c
|
add benchmark
Former-commit-id: 329134f58c45746702ec92a5bd76b89334cbc4b6
|
2023-11-18 11:09:52 +08:00 |
|
Yuchen Han
|
b92259ae02
|
Update README_zh.md
Former-commit-id: 7cab47b822ddf7f94d2ccfbe2ba4836e35f104f4
|
2023-11-17 00:18:07 -08:00 |
|
hiyouga
|
763a9305d0
|
update readme
Former-commit-id: 72e6699547d4fd0a20f61225f7a4b2c054ec035b
|
2023-11-16 15:58:37 +08:00 |
|
hiyouga
|
f441932bd1
|
support full-parameter PPO
Former-commit-id: ce783036001397a20b0b4c5da2fea6d0c03389d2
|
2023-11-16 02:08:04 +08:00 |
|
hiyouga
|
0c1fab84f1
|
add demo mode for web UI
Former-commit-id: 8350bcf85d5e59b63da46b540c6ad860e8419d9e
|
2023-11-15 23:51:26 +08:00 |
|
hiyouga
|
3e0b76650a
|
update readme and constants
Former-commit-id: 1e19cf242a1f843b590feefbe24b2cc0a17712b5
|
2023-11-15 18:04:37 +08:00 |
|
hiyouga
|
42bb8b6400
|
fix dc link
Former-commit-id: 88ab33254e02dce7437786fd2d153411118c4594
|
2023-11-13 23:22:56 +08:00 |
|
hiyouga
|
125587b187
|
refactor evaluation, upgrade trl to 074
Former-commit-id: 442aefb925c4ff02b98aa30c49c2e01d04f6496a
|
2023-11-13 22:20:35 +08:00 |
|
hiyouga
|
0fbaa42752
|
refactor constants
Former-commit-id: 3697a3dc9a0be8141951dfe65812844f66059517
|
2023-11-10 14:16:10 +08:00 |
|
hiyouga
|
164559d01d
|
update readme
Former-commit-id: b3572659f562756375f879b0a97e42d4e2129f54
|
2023-11-09 16:00:24 +08:00 |
|
hiyouga
|
2627f95ac3
|
update readme (list in alphabetical order)
Former-commit-id: e1e04cb1f1a0d37cac47c228bd3d5bbcda912aff
|
2023-11-06 17:18:12 +08:00 |
|
hiyouga
|
a919b6a478
|
update templates
Former-commit-id: a7eeb8e17c2f23f16732f5a5d767b39bcc1ac517
|
2023-11-06 12:25:47 +08:00 |
|
hiyouga
|
a9db89a025
|
update data readme (zh)
Former-commit-id: cc8ffa10d877f5893f3940204e5bec6f3266559f
|
2023-11-02 23:42:49 +08:00 |
|
hiyouga
|
a1b0655457
|
support sharegpt format, add datasets
Former-commit-id: a8371724130db2fbd7273a480e2acb251e382aec
|
2023-11-02 23:10:04 +08:00 |
|
hiyouga
|
b3931f73cc
|
update projects
Former-commit-id: 640a5201089f2ed5c6721ee29ee8b7a9be95847b
|
2023-10-29 22:53:47 +08:00 |
|
hiyouga
|
dc8739066b
|
add projects
Former-commit-id: 59f342e76f9d1ad04cfa7979b98d1eb9a3cebc99
|
2023-10-29 22:07:13 +08:00 |
|
hiyouga
|
bf0faf129d
|
fix vicuna template
Former-commit-id: 52fc24d1664bc701f43e2bff8b3faded795b929c
|
2023-10-27 22:15:25 +08:00 |
|
hiyouga
|
8babfe3b4c
|
update readme
Former-commit-id: 4600c29e932b3c46561c2e8727e4cbdf270a72bb
|
2023-10-27 19:19:03 +08:00 |
|
hiyouga
|
8a76b1e499
|
support chatglm3
Former-commit-id: 1c0ab9a908dedf0ad69ad5741a23465da02006d9
|
2023-10-27 19:16:28 +08:00 |
|
hiyouga
|
d6c77d9196
|
reimplement neftune
Former-commit-id: 7b4acf7265b04cc4a674b3dcafdb90e76f149e39
|
2023-10-22 16:15:08 +08:00 |
|
anvie
|
3635823fbe
|
add NEFTune optimization
Former-commit-id: 57fb40aa04fec11ca165a97ea463579faeaeebe7
|
2023-10-21 13:24:10 +07:00 |
|
hiyouga
|
95697652f1
|
fix #1232
Former-commit-id: b665e9e133bf2f6f10346c374eb0de8a96dd5c7e
|
2023-10-20 23:28:52 +08:00 |
|
hiyouga
|
f6a8069159
|
fix #1217
Former-commit-id: 6496a99b7d9a5aa57bfc8b935b274f1481580864
|
2023-10-19 15:52:24 +08:00 |
|
hoshi-hiyouga
|
eae703c121
|
Update README_zh.md
Former-commit-id: 5f83a6e72c095dfd3c840661fee5b6c7030be593
|
2023-10-16 00:28:27 +08:00 |
|
hiyouga
|
fca1e451b5
|
update readme
Former-commit-id: f5d0da4d2ac995c85ab891b103503bfa8a80f6db
|
2023-10-15 20:28:14 +08:00 |
|
hiyouga
|
64bf750a74
|
update readme
Former-commit-id: cb426766944487c72c10ca6e59ffb9888ca8b1e2
|
2023-10-13 13:53:43 +08:00 |
|
hiyouga
|
e2d412da02
|
update discord link
Former-commit-id: c4102f306a4c6c0e0397adce6001bbbf3f09314c
|
2023-10-12 21:44:28 +08:00 |
|
hiyouga
|
819b5faa9a
|
rename repository
Former-commit-id: 197c754d731d495330f33bbf962f8bbc7a10c0cc
|
2023-10-12 21:42:29 +08:00 |
|
hiyouga
|
3ba788fc2c
|
update readme
Former-commit-id: 8e2ed6b8ceb710c50cd73f8047ff4be305674567
|
2023-10-09 20:02:50 +08:00 |
|
hiyouga
|
d338ab3e19
|
fix #1068 #1074
Former-commit-id: d11a5454633be9f0600cbd1ab7a26c9c8fa5ed80
|
2023-09-28 14:39:16 +08:00 |
|
hiyouga
|
650a2a2e01
|
update readme
Former-commit-id: 4eae06146436a21fa63239c3687882b1e9e8ba13
|
2023-09-27 21:57:47 +08:00 |
|
hiyouga
|
108c31e1fc
|
support LongLoRA
Former-commit-id: 90375f600d5601866836123597fa3ef52008eeef
|
2023-09-27 21:55:50 +08:00 |
|
hiyouga
|
88f2e99c73
|
add CMMLU, update eval script
Former-commit-id: 4dd9b4d9829249da21c0827fb9a170335e518d93
|
2023-09-23 21:10:17 +08:00 |
|
hiyouga
|
467c30d591
|
move file
Former-commit-id: badd2735b56eca107d40d0068823df78d3629c14
|
2023-09-23 11:52:12 +08:00 |
|
hiyouga
|
5ee1bdecdc
|
add MMLU and C-Eval script
Former-commit-id: 465ee8119aa489a41bee0b01b3c105a2f3dd137f
|
2023-09-23 00:34:17 +08:00 |
|
hiyouga
|
e930682152
|
fix #1000
Former-commit-id: 5cc7a447843c578af602a5e054348fad1c9306ce
|
2023-09-22 15:00:48 +08:00 |
|
hiyouga
|
db21953bf0
|
update readme
Former-commit-id: 044d4425b447c7b67ea5473d3165d9b040040fba
|
2023-09-22 14:34:13 +08:00 |
|
hiyouga
|
d04585df59
|
tiny fix
Former-commit-id: ace3f85a7273fbbc531adfe6ad73bf76a5fff52d
|
2023-09-21 15:25:29 +08:00 |
|
hiyouga
|
65854736c3
|
update readme
Former-commit-id: acda45e4632c0ab87b4f38b13cf2f1c441d45e53
|
2023-09-16 17:33:01 +08:00 |
|
hiyouga
|
1cd0ea1f13
|
add MathInstruct dataset
Former-commit-id: 026af87e7fce091a0cda1afd6df3d6ab6189de9a
|
2023-09-13 22:30:14 +08:00 |
|
hiyouga
|
4e86462bad
|
fix #762 #814
Former-commit-id: d4be857e23c74ed65e06903e19da6f18f15d9e30
|
2023-09-12 16:10:10 +08:00 |
|
hiyouga
|
4410387859
|
Release v0.1.8
Former-commit-id: ccb3553576164113c31be714a0295ea82321d67d
|
2023-09-11 17:31:34 +08:00 |
|
hiyouga
|
cf08bcf3d9
|
truncate readme
Former-commit-id: baac22f4f4c390c8c5d7b7491ff84d096521bc71
|
2023-09-10 21:04:20 +08:00 |
|
hiyouga
|
17bc66fce0
|
update readme
Former-commit-id: 63611de7ae09cd9578fcb9c6408035ec6bfb2cb2
|
2023-09-10 21:01:20 +08:00 |
|
hiyouga
|
7a715aac55
|
update readme
Former-commit-id: 34005252df4b015fd06a229b0be882ed64672cc1
|
2023-09-10 20:52:21 +08:00 |
|
hiyouga
|
8ab5566dc0
|
support FlashAttention2
Former-commit-id: d8aa1404bee9842f3e4cd037ad8d66c85470ac37
|
2023-09-10 20:43:56 +08:00 |
|
hiyouga
|
c818a7ff60
|
support lora target auto find
Former-commit-id: bca1a247bcef51dced59655c8a14c197569367ca
|
2023-09-09 15:38:37 +08:00 |
|