hiyouga
|
9aeb88c426
|
add export_device in webui #3333
Former-commit-id: 30ebd3652809d73941e0a5e4a8be11d989faf98d
|
2024-04-25 19:02:32 +08:00 |
|
hiyouga
|
83404c4fa9
|
support new special token #3420
Former-commit-id: f5c6a47f5193ab3a6c137580992bdcce0b31fdd5
|
2024-04-24 23:39:31 +08:00 |
|
hiyouga
|
d2bb1b3a6b
|
reenable sdpa and fast tok by default
Former-commit-id: 9e00902dbedc71d55743d1bf237843506a557891
|
2024-04-24 02:18:44 +08:00 |
|
hiyouga
|
1d2e372a8e
|
update readme
Former-commit-id: d4eaee262a64e716ce475dc4eb18d8d9697d8dd8
|
2024-04-22 17:09:17 +08:00 |
|
hiyouga
|
233e167f68
|
fix optimizers
Former-commit-id: f811eee2fa12a89a55a9c5d3a05a1521b4347727
|
2024-04-21 20:40:54 +08:00 |
|
hiyouga
|
f8e219dc81
|
fix mod stuff
Former-commit-id: cf3988226e6398c67bb2955578e436fc505aa5c5
|
2024-04-21 18:11:10 +08:00 |
|
Marco
|
abd9fed445
|
fix small typo
Former-commit-id: 5638a03cd0cf8119ff366b3b3e303b5a2351b065
|
2024-04-18 20:33:29 +02:00 |
|
Marco
|
44cda2eece
|
Added Mixture of Depths
Former-commit-id: 75dd98b9abc847e22cb263c17ebcd2ca5dd98345
|
2024-04-18 20:31:24 +02:00 |
|
hiyouga
|
d301f0a64b
|
Update parser.py
Former-commit-id: 92c2133896c20054db86dd53508c982e39bd5ca0
|
2024-04-16 18:09:31 +08:00 |
|
hoshi-hiyouga
|
42084e08ae
|
Merge pull request #3287 from Ledzy/badam
[Feature] Add BAdam algorithm
Former-commit-id: 10a5e1e65b34b03e5ca2a41bf6ded09a3fb25f0c
|
2024-04-16 17:32:16 +08:00 |
|
hoshi-hiyouga
|
7ecea08b9b
|
Update parser.py
Former-commit-id: 898239883afc79f03abd0dc276eef901662a9591
|
2024-04-16 17:27:25 +08:00 |
|
hoshi-hiyouga
|
191971865d
|
Update parser.py
Former-commit-id: 2f3da8169d18b026760cc0ac7dd6141bdd08c932
|
2024-04-16 17:27:02 +08:00 |
|
hoshi-hiyouga
|
ff4f587dd9
|
Update finetuning_args.py
Former-commit-id: 3a23d900aea74078f0bc8cf73fac860a4ce3df67
|
2024-04-16 17:26:30 +08:00 |
|
hiyouga
|
b638c65519
|
support cohere commandR #3184
Former-commit-id: e077c36872740f6b2ac255aee9da6c4c70f28977
|
2024-04-15 23:26:42 +08:00 |
|
Jonery
|
d4d471450f
|
Feature BAdam
Former-commit-id: d8d2807fbcf587c37f7fd34a23e9397d2775ceed
|
2024-04-15 23:15:27 +08:00 |
|
hiyouga
|
276f2cb24e
|
update examples
Former-commit-id: 369294b31c8a03a1cafcee83eb31a817007d3c49
|
2024-04-15 22:14:34 +08:00 |
|
hiyouga
|
9338f878a3
|
fix #3273
Former-commit-id: 3b20c89b342a068356ffc29c3724b645775c65db
|
2024-04-15 15:32:58 +08:00 |
|
hiyouga
|
31bbbb6d13
|
fix #3238
Former-commit-id: 4d7e81ab4722d13bec6ca1af141f94bdc74d0883
|
2024-04-12 14:28:11 +08:00 |
|
hiyouga
|
f6530222f7
|
fix #3116
Former-commit-id: b7256aa33d761280751518c20f29f9b8ea3fb025
|
2024-04-03 14:47:59 +08:00 |
|
hiyouga
|
b12176d818
|
simplify readme
Former-commit-id: 0da6ec2d516326fe9c7583ba71cd1778eb838178
|
2024-04-02 20:07:43 +08:00 |
|
hiyouga
|
117b67ea30
|
add moe aux loss control #3085
Former-commit-id: c9187ebc944e2de454ace3304b7d28eabb1b1a81
|
2024-04-02 14:26:31 +08:00 |
|
hiyouga
|
e7f13098c6
|
support infer 4bit model on GPUs #3023
Former-commit-id: 950a9dab9055839990656b2b40956792b253573d
|
2024-04-01 17:34:04 +08:00 |
|
hiyouga
|
d764cd8736
|
support ORPO
Former-commit-id: f44a4c27e2461cdaa1b16865f597a31033c0e6d9
|
2024-03-31 18:29:50 +08:00 |
|
hiyouga
|
9408366a36
|
fix #2982
Former-commit-id: e5e6a0c50c7a1c0052ed6b459450b9735ff2c9a1
|
2024-03-28 20:22:31 +08:00 |
|
hiyouga
|
a916688723
|
fix bug
Former-commit-id: f513e1415cc3fe87f600318fba855d1286b6d007
|
2024-03-26 17:30:12 +08:00 |
|
hiyouga
|
3336422760
|
fix #2961
Former-commit-id: 616917bb3be7f71073b56ad8c7bc4e164b08b9b5
|
2024-03-26 17:26:14 +08:00 |
|
hiyouga
|
bf8d2f8eda
|
tiny fix
Former-commit-id: bf2455e420cf35c6596528f319c1b18408b5519a
|
2024-03-25 23:28:52 +08:00 |
|
hiyouga
|
ebd6bc2604
|
add arg check
Former-commit-id: 86e0d5a5a50ae34307f5176c7c4a6ab9d0c224b9
|
2024-03-25 22:42:58 +08:00 |
|
hiyouga
|
46f99ff277
|
improve lora+ impl.
Former-commit-id: 332bad25455a70ad9204e7dd384bb086d789aa39
|
2024-03-13 23:32:51 +08:00 |
|
齐保元
|
3c91e86268
|
[FEATURE]: ADD LORA+ ALGORITHM
Former-commit-id: c35b3c3b1e27171f8a703f88ede1dc8a84c80a56
|
2024-03-13 19:43:27 +08:00 |
|
hiyouga
|
9a784fb4f3
|
fix kv cache
Former-commit-id: a9588e36e95bed896eea8d79ba7108447ff08f4b
|
2024-03-13 01:21:50 +08:00 |
|
hiyouga
|
43fd80a1aa
|
support QDoRA
Former-commit-id: d8ad1c5ef08e733e52084de271aad762b1613129
|
2024-03-12 22:12:42 +08:00 |
|
hiyouga
|
6c1b4aec75
|
fix #2802
Former-commit-id: 1370db270d7ba1a20468abdb29193ce7534d1b4f
|
2024-03-12 17:08:34 +08:00 |
|
hiyouga
|
c9ed3fc3a4
|
fix #2782 #2798
Former-commit-id: eb3ab610610a0964bc8a1c9fa015805353f04c31
|
2024-03-12 15:53:29 +08:00 |
|
hiyouga
|
7c492864e9
|
update parser
Former-commit-id: d98258aa08d93494ad50d7786064e7fda15f6ca9
|
2024-03-10 13:35:20 +08:00 |
|
hiyouga
|
7ff8a064f3
|
support layerwise galore
Former-commit-id: d43a4da0947897d0be3f62fad3107754d4c89f2b
|
2024-03-10 00:24:11 +08:00 |
|
hiyouga
|
4881f4e631
|
allow non-packing pretraining
Former-commit-id: 3fee5cc5a3db9ce874ad90f2500ec092d904bd4e
|
2024-03-09 22:21:46 +08:00 |
|
hiyouga
|
5d7d8bd55c
|
update hardware requirements
Former-commit-id: 604b3d10fc1448f702943114b66b97bded21e080
|
2024-03-09 03:58:18 +08:00 |
|
hiyouga
|
48d4364586
|
fix chat engine, update webui
Former-commit-id: 8b32dddd7d883bae07735796a517927c79d1c33b
|
2024-03-08 03:01:53 +08:00 |
|
hiyouga
|
3879d79b89
|
update galore args
Former-commit-id: c7479a7976f773feb36aab4fdb0500be53d83b6a
|
2024-03-08 01:17:32 +08:00 |
|
hiyouga
|
e416cecf62
|
fix galore
Former-commit-id: 62a3ceeef8f60caef43ccc7f971a0c9184e21296
|
2024-03-08 00:44:51 +08:00 |
|
hiyouga
|
1e6fb6c8aa
|
support galore
Former-commit-id: b67a4a46a88d83bb2a3459b3317b66cda15e0171
|
2024-03-07 22:41:36 +08:00 |
|
hiyouga
|
056d2d956a
|
support vllm
Former-commit-id: 889f6e910e654d8ec3922c2185042d737ffbf1c3
|
2024-03-07 20:26:31 +08:00 |
|
hiyouga
|
9a69cadab3
|
fix #2735
Former-commit-id: 416f6333f66b6afd70a3a936d82593efca583235
|
2024-03-07 16:15:53 +08:00 |
|
hiyouga
|
73d9dfc7ab
|
fix version checking
Former-commit-id: 5780da8d640609cca388f55983d0251e5547209a
|
2024-03-06 14:51:51 +08:00 |
|
hiyouga
|
46ee267cfc
|
improve aqlm optim
Former-commit-id: 81be999b407e988c2f42764d827ac859d079ed3e
|
2024-03-05 20:49:50 +08:00 |
|
hiyouga
|
59a9a5994e
|
fix #2649
Former-commit-id: 1c850de660c671d92f0bc63f230d338b60b7c0bd
|
2024-03-01 13:02:41 +08:00 |
|
hiyouga
|
544e7a491b
|
release v0.5.3
Former-commit-id: f6bc89581b3cd129448da2defc23848de6f494ed
|
2024-02-29 00:34:19 +08:00 |
|
hiyouga
|
b392e6cfb9
|
support DoRA, AWQ, AQLM #2512
Former-commit-id: 6614cc1f08aa944db083e27e451bbdd733f7dd97
|
2024-02-28 19:53:28 +08:00 |
|
hiyouga
|
a274900188
|
fix #2532
Former-commit-id: 23a8e64f1c47cd473c627effbe271233c136369c
|
2024-02-21 21:55:14 +08:00 |
|