hiyouga
|
42e0b30476
|
update flashattn, fix ppo save model
Former-commit-id: 0b08bc3dac246d4aa3f89afb7172529dcad9c39f
|
2023-09-11 17:25:36 +08:00 |
|
hiyouga
|
a09a7b650d
|
remove PeftTrainer
Former-commit-id: cc0cff3e991f194732d278e627648e528118a719
|
2023-09-10 22:23:23 +08:00 |
|
hiyouga
|
332d7bbd56
|
truncate readme
Former-commit-id: fed5d0cc87e4a5a023f2edae622f2820bded1509
|
2023-09-10 21:04:20 +08:00 |
|
hiyouga
|
d3b6fece71
|
update readme
Former-commit-id: c42fe77fec2918fe8811d48ec88e9a7c1e6f07ab
|
2023-09-10 21:01:20 +08:00 |
|
hiyouga
|
9d963b82de
|
update readme
Former-commit-id: b4109cfe548e091cd20fa84815dce5ff3974a090
|
2023-09-10 20:52:21 +08:00 |
|
hiyouga
|
a402161631
|
support FlashAttention2
Former-commit-id: 23e56c5554b948d4f08ad87849b261eafd2c7890
|
2023-09-10 20:43:56 +08:00 |
|
hiyouga
|
b481ad58e6
|
fix #850
Former-commit-id: e5975c4c6b8bd47ec506b0d4a4703bee05495436
|
2023-09-10 14:22:03 +08:00 |
|
hiyouga
|
f91c5f2638
|
fix lora target
Former-commit-id: d822e41e7ac7e310ee49e347fc45754284ce30b8
|
2023-09-09 17:04:45 +08:00 |
|
hiyouga
|
7143c551ab
|
support lora target auto find
Former-commit-id: bce9984733d88bf013847eed523d1c75fdf0995e
|
2023-09-09 15:38:37 +08:00 |
|
hiyouga
|
50e93392dd
|
fix chatglm2 tokenizer
Former-commit-id: 1ab60b4a93fa1be5dfe6ffbd4deb64c0f9d9b431
|
2023-09-09 13:50:29 +08:00 |
|
hiyouga
|
9f83e93839
|
add baichuan2 convert script
Former-commit-id: 4d676e0ea9e59c1be13ecb47734917ba78938ac8
|
2023-09-08 22:59:41 +08:00 |
|
hiyouga
|
692b132dbf
|
fix bug in DPO data collator
Former-commit-id: 4fc262cdf1347691e253bdfbd96568db5a49c086
|
2023-09-08 20:45:07 +08:00 |
|
hiyouga
|
e70b3e8947
|
fix #761
Former-commit-id: be76f6cbe5143f781b6b39603b80392253b3080a
|
2023-09-08 20:22:18 +08:00 |
|
hiyouga
|
612d97db6f
|
change to right-padding, update reward score #803
Former-commit-id: baa90415bc8f5ebd423d001378b51c3a3a6c2ec7
|
2023-09-08 20:04:31 +08:00 |
|
hiyouga
|
bb1b67c076
|
fix chatglm template
Former-commit-id: 69a824628b4d6a56a680a7e713b217877c6c15c5
|
2023-09-08 14:45:58 +08:00 |
|
hiyouga
|
5a75c31caa
|
update requirements
Former-commit-id: d796a4a5709c390629bafbeb7c91fccf6a9076d0
|
2023-09-07 19:26:25 +08:00 |
|
hiyouga
|
8b9210286b
|
fix #818
Former-commit-id: e81fd458c279ed2f3cee780e517482b425c8886d
|
2023-09-07 19:19:53 +08:00 |
|
hiyouga
|
b5acec34f7
|
add deepspeed check in PPO training
Former-commit-id: e203ec7f71f504ccbaa89c27d20b8a0d9fa53f7e
|
2023-09-07 19:12:40 +08:00 |
|
hiyouga
|
86d835878c
|
fix #809
Former-commit-id: 2783ca75365d7c373cefba039788a48f0b8f35fc
|
2023-09-07 19:04:32 +08:00 |
|
hiyouga
|
eae7b331d3
|
fix baichuan templates
Former-commit-id: f48a49e835b32f3991cfad8874c7b9c78953809f
|
2023-09-07 18:54:14 +08:00 |
|
hiyouga
|
ed89e29bcc
|
update baichuan2 template
Former-commit-id: 16d9f8ba176443c5b397233da621600d6e1e1eec
|
2023-09-06 21:43:06 +08:00 |
|
hiyouga
|
c2b1886aff
|
add Baichuan2 models
Former-commit-id: 90b3f02c44c0b8cc1b59f37af3a1ec28874a8a61
|
2023-09-06 18:40:11 +08:00 |
|
hiyouga
|
218f36bca5
|
add Baichuan2 models
Former-commit-id: 36960025e9274b574f57e7a7bf453cd96956e922
|
2023-09-06 18:36:04 +08:00 |
|
hoshi-hiyouga
|
b91fc1f5b3
|
Merge pull request #786 from kinghuin/patch-1
fix utils.py bug
Former-commit-id: 26aad616340748e1594a60119ca9434908bf7465
|
2023-09-05 10:49:34 +08:00 |
|
Q
|
2a22bf9c15
|
fix utils.py bug
Former-commit-id: dc490117d50c3cbc070b804bac89400f4290272f
|
2023-09-05 10:38:01 +08:00 |
|
hiyouga
|
62e2037125
|
fix #763
Former-commit-id: e424b928a35097b783af879a2290f59b2158801d
|
2023-09-01 23:13:05 +08:00 |
|
hiyouga
|
e5b72c6a77
|
refactor dataset_attr, add eos in pt, fix #757
Former-commit-id: 0feec9a830b917b36686b61938a66e842eccf930
|
2023-09-01 19:00:45 +08:00 |
|
codingma
|
93be211f80
|
Merge pull request #741 from hiyouga/feature-addDatasetCheck
Feature add dataset check
Former-commit-id: 4b6dabe73d2c7edc94cd495390577c8bcf88428b
|
2023-08-31 20:57:36 +08:00 |
|
codemayq
|
9ae3fb4ced
|
update llama2 template
Former-commit-id: 01de1d51d9fa5a22a338b6ed18ffad4d0ad5e3e8
|
2023-08-30 16:23:56 +08:00 |
|
codemayq
|
f641075789
|
add dataset stage check
Former-commit-id: 5c719a7ce988339d034a653456da9742dc2cec7c
|
2023-08-30 16:23:08 +08:00 |
|
codingma
|
f7658db1b6
|
Merge pull request #651 from hiyouga/feature-dataset_stage
add dataset stage
Former-commit-id: 3b0ef57405cbc22ff8ce4eef2cfcb73872519db5
|
2023-08-28 16:03:45 +08:00 |
|
codemayq
|
b869bc1a20
|
add ad gen dataset
Former-commit-id: fcd0788aa4dda0cecc1420d369d371032a207810
|
2023-08-27 20:35:32 +08:00 |
|
codemayq
|
a72d756d77
|
add text format dataset preview in webui
Former-commit-id: cd30871aadb40cd3d598a6d0b415946744d2d550
|
2023-08-24 19:45:36 +08:00 |
|
codemayq
|
d3fd8f89b8
|
add stage in DatasetAttr
Former-commit-id: 9c55200d8de0623640f529dbf39b8b0f169636d3
|
2023-08-23 20:54:53 +08:00 |
|
hiyouga
|
180a05a446
|
fix import error
Former-commit-id: b3207a974a45038591b8cbbcf20d1ca1142d6679
|
2023-08-23 20:45:03 +08:00 |
|
hiyouga
|
eb9ac9ee1f
|
fix #649
Former-commit-id: e6120a937ddb4f3c0b9bcb2466742f5cf4f77f8c
|
2023-08-23 20:21:15 +08:00 |
|
codemayq
|
a6662b73f5
|
add readme for dataset
Former-commit-id: bdcb0ea40e726e4c5752f938b379ed9a18e7e1d0
|
2023-08-23 19:55:45 +08:00 |
|
codemayq
|
cbc7db3478
|
add dataset stage and filter dataset when stage chosen in webui
Former-commit-id: 26e4136449a4df6028d834fd16a0f4a7c532759d
|
2023-08-23 18:54:23 +08:00 |
|
hiyouga
|
4606340f0f
|
fix webui
Former-commit-id: 95304b6822d9fe04bcddc1ee246a56389bd5f96a
|
2023-08-23 11:03:35 +08:00 |
|
hoshi-hiyouga
|
d4b4ccd597
|
Merge pull request #644 from hiyouga/fix-quantization_bit
fix quantization bit is ""
Former-commit-id: e1a8eca182e532b48e472919b4474656a726b40c
|
2023-08-23 10:45:45 +08:00 |
|
codemayq
|
9c3f4e3a37
|
fix quantization bit is ""
Former-commit-id: 0dcab66f8843e2887f9f7ca66334122fef35c5b7
|
2023-08-23 10:08:17 +08:00 |
|
codemayq
|
440e00d8f9
|
fix quantization is ""
Former-commit-id: 2469cc16d1dd3f5ee822edc18b2d7021ff7cba03
|
2023-08-23 10:04:03 +08:00 |
|
hiyouga
|
6310613699
|
update template
Former-commit-id: a95f3a4d62de1073a78125401cf4289ec0523156
|
2023-08-22 19:46:09 +08:00 |
|
hoshi-hiyouga
|
f55907dbea
|
Merge pull request #629 from panpan0000/main
add rm dataset explanation
Former-commit-id: c2b4571d0ffb6298d6e07212982d9c13efd65adf
|
2023-08-22 13:41:44 +08:00 |
|
Peter Pan
|
5cac87d317
|
add rm dataset explanation
Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>
Former-commit-id: 1efb95025be6501f1b30b20e7c711d3590b5d1ee
|
2023-08-22 01:33:59 -04:00 |
|
hoshi-hiyouga
|
9c0622de13
|
Merge pull request #619 from hiyouga/feature-templateTest
add template encode test
Former-commit-id: 8a1587ae49fff3968e0182f4fcc9a65dfdb260fc
|
2023-08-21 20:56:34 +08:00 |
|
codemayq
|
37b93c8b71
|
add template encode test
Former-commit-id: c15e0d6847cbc055d8376b3c43ac4fbd17b5877a
|
2023-08-21 20:51:24 +08:00 |
|
hiyouga
|
d6be98cda6
|
fix #617
Former-commit-id: a7bdaf1c92c7d798caf8438dc42a8972632ec584
|
2023-08-21 18:16:11 +08:00 |
|
hiyouga
|
4d128acc17
|
fix #608
Former-commit-id: c02a6809124fcfd06628c49c95d419ec2d8cc8ef
|
2023-08-21 17:49:36 +08:00 |
|
hiyouga
|
516df9ecce
|
fix baichuan template for training #597 #616
Former-commit-id: 6530c1d972301eac9ef058b3235618bb09833f15
|
2023-08-21 17:41:51 +08:00 |
|