hiyouga
|
7e225be16e
|
add yuan model
Former-commit-id: 6a0377e2e51633bd5fb10fa8628e554565c5ee3e
|
2023-12-29 13:50:24 +08:00 |
|
hiyouga
|
a53b2a643f
|
add xverse-65B-2 model
Former-commit-id: 3e563a0d9666934dfdab54d61654ec00079a93f1
|
2023-12-18 19:24:09 +08:00 |
|
hiyouga
|
d925ecae1b
|
add models
Former-commit-id: 3a4728557304996bcbe58d7d6380beead7c63c70
|
2023-12-18 19:09:31 +08:00 |
|
hiyouga
|
f927601702
|
add xverse-65b-chat model
Former-commit-id: fff6288db6b61ca27010ea47c918298f76922106
|
2023-12-16 20:21:29 +08:00 |
|
hiyouga
|
9f77e8b025
|
support autogptq in llama board #246
Former-commit-id: fea01226703d1534b5cf511bcb6a49e73bc86ce1
|
2023-12-16 16:31:30 +08:00 |
|
hiyouga
|
296711d502
|
support quantization in export model
Former-commit-id: f32500ae6edccab7d14df4c92467e15986866def
|
2023-12-15 23:44:50 +08:00 |
|
hiyouga
|
a78759e7ee
|
add model urls
Former-commit-id: 3139a9fafab246f5461697efd5ed7a6599d85481
|
2023-12-13 00:09:17 +08:00 |
|
hiyouga
|
6975124a57
|
support mixtral
Former-commit-id: 75b5b8e36ab1933b2625f11b645f56cbc805fd85
|
2023-12-12 11:39:04 +08:00 |
|
hiyouga
|
24ce319b6f
|
add models
Former-commit-id: 758ae7937a41a95016e70180fb343011763c1b67
|
2023-12-06 13:33:18 +08:00 |
|
hiyouga
|
8ca196d51f
|
add xuanyuan models
Former-commit-id: 1dfa9de3723550cddf24bbc0739cad6207731212
|
2023-12-02 00:35:29 +08:00 |
|
hiyouga
|
72bbd5bdef
|
patch modelscope
Former-commit-id: 8888cf53f040f5a2d8c0e59cddf79b252449bf58
|
2023-12-01 22:53:15 +08:00 |
|
yuze.zyz
|
389687a56d
|
remove useless code
Former-commit-id: 323df46dd6a8eaf1fd608380406dcbce80c097b2
|
2023-12-01 17:28:23 +08:00 |
|
tastelikefeet
|
97280c73b9
|
fix bug
Former-commit-id: 6d483e76141420e0cb577541e6e1794c20f025f6
|
2023-12-01 17:27:00 +08:00 |
|
yuze.zyz
|
d323ccc3ec
|
add readme
Former-commit-id: 3d5ec6f12b4ae7d04520e6865516a9a6dd4f7efe
|
2023-12-01 16:11:30 +08:00 |
|
tastelikefeet
|
8547085615
|
add model
Former-commit-id: 48e8d8438bc6cd2c75dc39419c45aaebb34a2e0a
|
2023-12-01 15:06:17 +08:00 |
|
yuze.zyz
|
6933c1fed2
|
fix
Former-commit-id: e8774b4c9cbc8f894621ec72957f720d5c83d22b
|
2023-11-29 21:43:58 +08:00 |
|
yuze.zyz
|
9d125bf533
|
support ms
Former-commit-id: fdd4f94f563110ef9f96ab4a7fd954def32e9785
|
2023-11-29 20:36:55 +08:00 |
|
hiyouga
|
670ee3934f
|
fix #1659
Former-commit-id: e4123129aae59f4123d53c1f5320e3d5e09ae26d
|
2023-11-28 20:52:28 +08:00 |
|
hiyouga
|
953a562ec1
|
support Yi-34B-Chat models
Former-commit-id: 1751a79c27e7fc13e76a731a061dc0c10d828cda
|
2023-11-23 19:31:49 +08:00 |
|
hiyouga
|
df83def566
|
update ppo and demo in webui
Former-commit-id: de7571704c82121db13e3fc907379d2453100191
|
2023-11-16 14:55:26 +08:00 |
|
hiyouga
|
2162c37e41
|
update readme and constants
Former-commit-id: 7d83e3dd9101a4fdd0b589d0c1f7b609c0feecd1
|
2023-11-15 18:04:37 +08:00 |
|
hiyouga
|
09a4474e7f
|
disentangle model from tuner and rename modules
Former-commit-id: 02cbf91e7e424f8379c1fed01b82a5f7a83b6947
|
2023-11-15 16:29:09 +08:00 |
|
hiyouga
|
ec334f5891
|
release v0.2.2, fix #1478 #1466
Former-commit-id: c9534c411716e1dceb54c5eb35fe845c93ee2973
|
2023-11-13 23:09:05 +08:00 |
|
hiyouga
|
178b85ff9a
|
refactor constants
Former-commit-id: a4d4c3fd35276f20e3b354e9d13ea971029c8775
|
2023-11-10 14:16:10 +08:00 |
|
hiyouga
|
2eb65d21ac
|
upgrade peft, fix #1088 #1411
Former-commit-id: aa7d104f8e050d12cb8f585bc8a52c850995500f
|
2023-11-07 16:13:36 +08:00 |
|
hiyouga
|
0f727b393e
|
update constants
Former-commit-id: ebacbb1072045924a7e335cc9dda488d6f0be8b3
|
2023-10-29 13:30:20 +08:00 |
|
hiyouga
|
728dfb1be7
|
fix #1068 #1074
Former-commit-id: 26c6bfd21de06cc56be9a58e2ef69045ea70cc14
|
2023-09-28 14:39:16 +08:00 |
|
hiyouga
|
1c150995ae
|
fix layer norm dtype
Former-commit-id: 67af21961b68d9b54d07b09e444c7140869f26da
|
2023-09-28 00:25:55 +08:00 |
|
hiyouga
|
35d1921081
|
add MMLU and C-Eval script
Former-commit-id: 3403f876127b4b99c5e3edb2834cc3b9a3a0063f
|
2023-09-23 00:34:17 +08:00 |
|
hiyouga
|
a09a7b650d
|
remove PeftTrainer
Former-commit-id: cc0cff3e991f194732d278e627648e528118a719
|
2023-09-10 22:23:23 +08:00 |
|
hiyouga
|
ed89e29bcc
|
update baichuan2 template
Former-commit-id: 16d9f8ba176443c5b397233da621600d6e1e1eec
|
2023-09-06 21:43:06 +08:00 |
|
hiyouga
|
218f36bca5
|
add Baichuan2 models
Former-commit-id: 36960025e9274b574f57e7a7bf453cd96956e922
|
2023-09-06 18:36:04 +08:00 |
|
hiyouga
|
e5b72c6a77
|
refactor dataset_attr, add eos in pt, fix #757
Former-commit-id: 0feec9a830b917b36686b61938a66e842eccf930
|
2023-09-01 19:00:45 +08:00 |
|
codemayq
|
cbc7db3478
|
add dataset stage and filter dataset when stage chosen in webui
Former-commit-id: 26e4136449a4df6028d834fd16a0f4a7c532759d
|
2023-08-23 18:54:23 +08:00 |
|
hiyouga
|
7f0b908de2
|
update webui
Former-commit-id: da30d0fb4abdb825f3383ddd106bb06a84695b7a
|
2023-08-14 22:45:26 +08:00 |
|
codemayq
|
9585699918
|
add template match and stage in webui
Former-commit-id: d6283e7f041f08f76d18350cb5f6a6c58ca80e92
|
2023-08-14 20:42:59 +08:00 |
|
hiyouga
|
fdfb644f0a
|
support rope scaling, fix #475 #476 #478
Former-commit-id: 337d5f68b72230e545e7a94ca789187c7a2b7187
|
2023-08-12 20:46:27 +08:00 |
|
codemayq
|
8bf5a98815
|
add sft script preview in webui
Former-commit-id: 2b72649b404750226aa418b61ef5a6c9ac03938f
|
2023-08-12 13:53:55 +08:00 |
|
hiyouga
|
d5f1b99ac4
|
Release v0.1.6
Former-commit-id: 43c8b3c3c8bfb2e32d17fb3e8b194938e37d54bd
|
2023-08-11 23:25:57 +08:00 |
|
hiyouga
|
ca719a8697
|
support DPO training (2305.18290)
Former-commit-id: 6d98de148e4af63a7028dfaeb6cf86eb56a4488f
|
2023-08-11 03:02:53 +08:00 |
|
hiyouga
|
76f3ae7bf3
|
support interleave probs
Former-commit-id: 168d99816f9bdc746c587f7f09753ba7e0a4b19d
|
2023-08-04 21:27:35 +08:00 |
|
hiyouga
|
9b3304b054
|
update UI, fix #212
Former-commit-id: ac92c2bd7c47353759474fad9412f21b38c65501
|
2023-07-20 22:09:06 +08:00 |
|
hiyouga
|
8b688251be
|
support LLaMA-2
Former-commit-id: 04dfda054855ee9256586aacbd382f8fb0bfed04
|
2023-07-19 16:42:14 +08:00 |
|
hiyouga
|
baf2e4e825
|
a monkey patch for lora_target
Former-commit-id: 622f44a05b49b10571bd189ae3843683117ad77f
|
2023-07-18 00:31:40 +08:00 |
|
hiyouga
|
eac7f97337
|
release v0.1.0
Former-commit-id: 63c8d3a17cb18f0d8a8e37bfa147daf5bdd28ea9
|
2023-07-18 00:18:25 +08:00 |
|
hiyouga
|
6261fb362a
|
modity code structure
Former-commit-id: 0682ed357210897e0b67c4a6eb31a94b3eb929f1
|
2023-07-15 16:54:28 +08:00 |
|