hiyouga
|
f8376b228a
|
add xuanyuan models
Former-commit-id: 6e7af11b989e4cf97ffacbab4736e3434ff6c925
|
2023-12-02 00:35:29 +08:00 |
|
hiyouga
|
c60e79c12e
|
patch modelscope
Former-commit-id: bd42c229b01a0bf3ceadb8cee5ad49a060cc2d13
|
2023-12-01 22:53:15 +08:00 |
|
yuze.zyz
|
fcd61657ee
|
remove useless code
Former-commit-id: 5a2392f105704810e9ce96c13fcc8a555726f9b8
|
2023-12-01 17:28:23 +08:00 |
|
tastelikefeet
|
eb835b693d
|
fix bug
Former-commit-id: d9e52957e272e8133f1b37cf20d193084425e09e
|
2023-12-01 17:27:00 +08:00 |
|
yuze.zyz
|
b2200409f5
|
add readme
Former-commit-id: 5aa6751e52b5c2e06727c50e60218226b146b7bf
|
2023-12-01 16:11:30 +08:00 |
|
tastelikefeet
|
63e12226a0
|
add model
Former-commit-id: 8ce4d11e38518b0b4657c7e64394d471cbb0bd6d
|
2023-12-01 15:06:17 +08:00 |
|
yuze.zyz
|
45925e4a9c
|
fix
Former-commit-id: fb2204c183ae8c061ed6ec7f4f1bfbb0b4900c9b
|
2023-11-29 21:43:58 +08:00 |
|
yuze.zyz
|
e08e0e5814
|
support ms
Former-commit-id: d38a2e7341100902b6c761895b1fe6191c905d06
|
2023-11-29 20:36:55 +08:00 |
|
hiyouga
|
ae1048db6d
|
fix #1659
Former-commit-id: 475a3fa0f4c09d4cfd55ec66271a6d3c9eb5f4d2
|
2023-11-28 20:52:28 +08:00 |
|
hiyouga
|
5f2943dc84
|
support Yi-34B-Chat models
Former-commit-id: ff1c289229ee382d3e76578bbb6a5e299b969ded
|
2023-11-23 19:31:49 +08:00 |
|
hiyouga
|
e4f97615f0
|
update ppo and demo in webui
Former-commit-id: 7537dd434f4c0f0bde06bd8c2ac69bf622772316
|
2023-11-16 14:55:26 +08:00 |
|
hiyouga
|
3e0b76650a
|
update readme and constants
Former-commit-id: 1e19cf242a1f843b590feefbe24b2cc0a17712b5
|
2023-11-15 18:04:37 +08:00 |
|
hiyouga
|
06a4820836
|
disentangle model from tuner and rename modules
Former-commit-id: 4736344eb1595ee023a50d49e8118f4eee46305f
|
2023-11-15 16:29:09 +08:00 |
|
hiyouga
|
4a767e5593
|
release v0.2.2, fix #1478 #1466
Former-commit-id: 35cc1e28f675889c44f75a0a3194005c7f23631b
|
2023-11-13 23:09:05 +08:00 |
|
hiyouga
|
0fbaa42752
|
refactor constants
Former-commit-id: 3697a3dc9a0be8141951dfe65812844f66059517
|
2023-11-10 14:16:10 +08:00 |
|
hiyouga
|
3d40bdb600
|
upgrade peft, fix #1088 #1411
Former-commit-id: b2a60905f384ada92618bf21301fe96dac1c10bf
|
2023-11-07 16:13:36 +08:00 |
|
hiyouga
|
d48478ef88
|
update constants
Former-commit-id: f28a034a9b74630a56314446bd0f103c086bda60
|
2023-10-29 13:30:20 +08:00 |
|
hiyouga
|
d338ab3e19
|
fix #1068 #1074
Former-commit-id: d11a5454633be9f0600cbd1ab7a26c9c8fa5ed80
|
2023-09-28 14:39:16 +08:00 |
|
hiyouga
|
deb17942ab
|
fix layer norm dtype
Former-commit-id: 84b7486885c600e5e65c5ba9095d56ecc2502977
|
2023-09-28 00:25:55 +08:00 |
|
hiyouga
|
5ee1bdecdc
|
add MMLU and C-Eval script
Former-commit-id: 465ee8119aa489a41bee0b01b3c105a2f3dd137f
|
2023-09-23 00:34:17 +08:00 |
|
hiyouga
|
6a71361a54
|
remove PeftTrainer
Former-commit-id: b218c271edfb07006ddc34b1aca404088de6c528
|
2023-09-10 22:23:23 +08:00 |
|
hiyouga
|
51f662860d
|
update baichuan2 template
Former-commit-id: 0531886e1f534217dc3c9c0775d29fcf77ff7f5f
|
2023-09-06 21:43:06 +08:00 |
|
hiyouga
|
f9aee17f9d
|
add Baichuan2 models
Former-commit-id: 62ce65c6282d2bbcb765354acc2819cc3e983a46
|
2023-09-06 18:36:04 +08:00 |
|
hiyouga
|
a4fd976048
|
refactor dataset_attr, add eos in pt, fix #757
Former-commit-id: a9d1fb72f791ae57a4d12f4e3a7e2abccf6a7077
|
2023-09-01 19:00:45 +08:00 |
|
codemayq
|
4b29d9d2b0
|
add dataset stage and filter dataset when stage chosen in webui
Former-commit-id: c0e4d1e81b41c9a36291d8bee46d7d807c898c21
|
2023-08-23 18:54:23 +08:00 |
|
hiyouga
|
02a61b08b1
|
update webui
Former-commit-id: 9d0f6214b68a653c0a67632437b227ab8f589bed
|
2023-08-14 22:45:26 +08:00 |
|
codemayq
|
ee7da14f81
|
add template match and stage in webui
Former-commit-id: 79c68e552722079faf2ab0858870b481844d66ae
|
2023-08-14 20:42:59 +08:00 |
|
hiyouga
|
3f0a2d6adc
|
support rope scaling, fix #475 #476 #478
Former-commit-id: fa940c17b8d3e379af08804003f1a522c1cd6ac4
|
2023-08-12 20:46:27 +08:00 |
|
codemayq
|
3ba1b81105
|
add sft script preview in webui
Former-commit-id: 6bc8e9866d482c945dd98f4e9ab205a7d7270755
|
2023-08-12 13:53:55 +08:00 |
|
hiyouga
|
79f4ba0d26
|
Release v0.1.6
Former-commit-id: a48cb0d474ef0648a97387daf5f623498b5e3ee6
|
2023-08-11 23:25:57 +08:00 |
|
hiyouga
|
abdfa26d06
|
support DPO training (2305.18290)
Former-commit-id: 3ec4351cfdaf2aefcc7d13345e19d79874ed61d3
|
2023-08-11 03:02:53 +08:00 |
|
hiyouga
|
b32ed1d7be
|
support interleave probs
Former-commit-id: 69744c17e8180e0ad549b57d575454724b820d01
|
2023-08-04 21:27:35 +08:00 |
|
hiyouga
|
94f2fd634f
|
update UI, fix #212
Former-commit-id: 4d1641c1bff1b2b53c0e9b80b3e3ac7979223ccd
|
2023-07-20 22:09:06 +08:00 |
|
hiyouga
|
f7f2accf05
|
support LLaMA-2
Former-commit-id: 7a3ade8c699ff1cd2d17590e2f8df79e1738cee2
|
2023-07-19 16:42:14 +08:00 |
|
hiyouga
|
4e1997a343
|
a monkey patch for lora_target
Former-commit-id: 262252d67bbe4ebcbb315b5d7a34f9a091f8af0c
|
2023-07-18 00:31:40 +08:00 |
|
hiyouga
|
091805d38e
|
release v0.1.0
Former-commit-id: f8193e8009451cf569a28a10eb4bd88831844441
|
2023-07-18 00:18:25 +08:00 |
|
hiyouga
|
a696148d6b
|
modity code structure
Former-commit-id: f75137661358f9070bc70c341dfa2cc5fd69cf94
|
2023-07-15 16:54:28 +08:00 |
|