hiyouga
|
edd28dbe2c
|
fix bug
Former-commit-id: 8172530d54fbd42a9dd3219f06378563d62424e0
|
2024-03-13 23:55:31 +08:00 |
|
hiyouga
|
9ff7c99eb1
|
fix bug
Former-commit-id: 714d936dfbe022c4f2cfa6ff643e3482a3f96012
|
2024-03-13 23:43:42 +08:00 |
|
hiyouga
|
8b8671817f
|
improve lora+ impl.
Former-commit-id: 72367307dfadf936fb989ebe8bc9f0ff229fb933
|
2024-03-13 23:32:51 +08:00 |
|
齐保元
|
24c9277488
|
[FEATURE]: ADD LORA+ ALGORITHM
Former-commit-id: a0965cd62c85545aa2364e244295df2963308354
|
2024-03-13 19:43:27 +08:00 |
|
hiyouga
|
922bd8864b
|
fix #2817
Former-commit-id: 0b4a5bf509a6fbf18337a29a6a498f33d0cbca76
|
2024-03-13 12:42:03 +08:00 |
|
hiyouga
|
8673abbe5e
|
fix #2802
Former-commit-id: b9f87cdc11b3fe712574b91455dc722b69c60c66
|
2024-03-13 12:33:45 +08:00 |
|
hiyouga
|
a74426df0f
|
fix kv cache
Former-commit-id: 96ce76cd2753bc91c781ad13aa8f7a972abe815a
|
2024-03-13 01:21:50 +08:00 |
|
hiyouga
|
bbf272f96e
|
support QDoRA
Former-commit-id: 19ef4826490b79e0c2aee20ad67430aa0e4724a7
|
2024-03-12 22:12:42 +08:00 |
|
hiyouga
|
096c31bfb6
|
patch for gemma cpt
Former-commit-id: 70a3052dd8a2d1322fa01ab19e369e465842d416
|
2024-03-12 21:21:54 +08:00 |
|
hiyouga
|
c28818c39f
|
fix plot issues
Former-commit-id: 60cc17f3a8b56c0b2ad76be7c10ca0b4e1738121
|
2024-03-12 18:41:35 +08:00 |
|
hiyouga
|
14ed926a2d
|
support olmo
Former-commit-id: b3247d6a1604f4cbeb0d7c163d0082ce91afb870
|
2024-03-12 18:30:38 +08:00 |
|
hiyouga
|
0b7e870b07
|
fix #2802
Former-commit-id: 8d8956bad542c0e1c0f7edbf4ffc22bb0f8788ae
|
2024-03-12 17:08:34 +08:00 |
|
hiyouga
|
7124b71676
|
fix #2782 #2798
Former-commit-id: 07f9b754a7418b489e839bd674aa47094583a92d
|
2024-03-12 15:53:29 +08:00 |
|
hiyouga
|
c88062347e
|
fix #2775
Former-commit-id: e874c00906c765b81c0e5ff9c7b3679557da8e0e
|
2024-03-11 00:42:54 +08:00 |
|
hiyouga
|
f776e738f8
|
tiny fix
Former-commit-id: 352693e2dcc8fc039b5d574e1a5709563929b0ce
|
2024-03-11 00:17:18 +08:00 |
|
hiyouga
|
566bfad930
|
update parser
Former-commit-id: be99799413e1ba37807a02838bf2d87fd966bf55
|
2024-03-10 13:35:20 +08:00 |
|
hiyouga
|
4a4e4b4354
|
support layerwise galore
Former-commit-id: 8664262cde3919e10eaecbd66e8c5d356856362e
|
2024-03-10 00:24:11 +08:00 |
|
hiyouga
|
276def1897
|
fix #2732
Former-commit-id: 18ffce36b5ee0809f2e2905c2fd44843a3725ea0
|
2024-03-09 22:37:16 +08:00 |
|
hiyouga
|
868444e124
|
allow non-packing pretraining
Former-commit-id: bdb496644ce2c18806fc4fdae1fedcb3e5b5f808
|
2024-03-09 22:21:46 +08:00 |
|
hiyouga
|
1173441661
|
fix #2766
Former-commit-id: 412c52e325660e8b871ffd59f5564f84f46a143f
|
2024-03-09 21:35:24 +08:00 |
|
hiyouga
|
8f6eb1383d
|
use default arg for freeze tuning
Former-commit-id: af0e370fb16f3e0cf2f4c8036301d5253d8249b9
|
2024-03-09 06:08:48 +08:00 |
|
hiyouga
|
5c00783697
|
update hardware requirements
Former-commit-id: 393c2de27ce0a2dee793092843ec0afa54f49a6d
|
2024-03-09 03:58:18 +08:00 |
|
hiyouga
|
c561b268ef
|
fix #2756 , patch #2746
Former-commit-id: e8dd38b7fdf8e172745d2538eb103895f2839c38
|
2024-03-09 02:01:26 +08:00 |
|
hoshi-hiyouga
|
36d65289d0
|
Merge pull request #2746 from stephen-nju/main
fix deepspeed ppo RuntimeError
Former-commit-id: 516d0ddc666c179616a2a610b1353728db57391e
|
2024-03-09 01:37:00 +08:00 |
|
hiyouga
|
398c261c7c
|
fix aqlm version
Former-commit-id: 10be2f0eccc3963a985afcd24e5b8b8fc638b1c3
|
2024-03-09 00:09:09 +08:00 |
|
stephen_zhu
|
c69b9fbe58
|
update
Former-commit-id: aa71571b773c5dc527b17219ec87828e4455b330
|
2024-03-08 12:47:44 +08:00 |
|
stephen
|
495b858606
|
fix ppo runtime error
Former-commit-id: cdb7f82869b07d9d5d31b7b2aaf6b033bd00e32e
|
2024-03-08 11:48:26 +08:00 |
|
hiyouga
|
7443ac3116
|
fix chat engine, update webui
Former-commit-id: 5d956e2a5167201aecdfce2794c25d8a2d84e234
|
2024-03-08 03:01:53 +08:00 |
|
hiyouga
|
2235020cc9
|
update galore args
Former-commit-id: 0ac6b40a4772b61a3476bb74b976d24c408a2c35
|
2024-03-08 01:17:32 +08:00 |
|
hiyouga
|
5b50458acf
|
fix galore
Former-commit-id: 33a4c24a8a3c153bc62edf74b9246699a0ae3233
|
2024-03-08 00:44:51 +08:00 |
|
hiyouga
|
f373290012
|
add Yi-9B model
Former-commit-id: 57452a4aa1d37a047d659f002c1aaa6246f64178
|
2024-03-07 23:11:57 +08:00 |
|
hiyouga
|
2c010c72b8
|
support galore
Former-commit-id: 28f78621883917425fabe49f5473778111012127
|
2024-03-07 22:41:36 +08:00 |
|
hiyouga
|
34533b2f35
|
support vllm
Former-commit-id: d07ad5cc1cdbc13879afd84f653afdfee03a6933
|
2024-03-07 20:26:31 +08:00 |
|
hiyouga
|
37e40563f1
|
fix #2735
Former-commit-id: f74f804a715dfb16bf24a056bc95db6b102f9ed7
|
2024-03-07 16:15:53 +08:00 |
|
hoshi-hiyouga
|
90e66c8d94
|
Merge pull request #2730 from cx2333-gt/main
fix flash_attn in train_web
Former-commit-id: 2185855bdb7d4cb55f3af796e35fb1b0e8dce5e3
|
2024-03-07 14:37:18 +08:00 |
|
cx2333
|
013c12a135
|
revert choice name
Former-commit-id: 94b7a1b91588716e9e91f7aae7126ed924e55953
|
2024-03-07 14:28:55 +08:00 |
|
hiyouga
|
843d3f7a97
|
fix chatglm3 template
Former-commit-id: 921ee822679bd10fcc14d084d0845e6132e570dd
|
2024-03-07 14:26:16 +08:00 |
|
cx2333
|
22624e566e
|
fix flash_attn in train_web
Former-commit-id: a8889498fa4e9b6c7a82422ed5b1da3662b48d42
|
2024-03-07 10:13:55 +08:00 |
|
hiyouga
|
31c618f1f7
|
tiny fix
Former-commit-id: 0048a2021e94d068f7c6054df0b9569ae4912eb1
|
2024-03-06 17:25:08 +08:00 |
|
hiyouga
|
8b6c178249
|
export use balanced gpu
Former-commit-id: 3e84f430b14a94e68f5815d8e412f0d74d28a04c
|
2024-03-06 16:33:14 +08:00 |
|
hiyouga
|
8b21a60d9c
|
fix add tokens
Former-commit-id: 9658c63cd94d28bba730a19f73397580b9865d6b
|
2024-03-06 15:04:02 +08:00 |
|
hiyouga
|
e887aface7
|
fix version checking
Former-commit-id: 3016e6565708637c1d760f2cd5a67cbd8a5a6c26
|
2024-03-06 14:51:51 +08:00 |
|
hiyouga
|
af526c3a46
|
fix arg dtype
Former-commit-id: e0c47358f9d09ab64acbb5ebafc61b52b5b1f2af
|
2024-03-05 20:53:30 +08:00 |
|
hiyouga
|
9561809ce9
|
improve aqlm optim
Former-commit-id: 259af60d28985b919911587716c24a3ac7f7de64
|
2024-03-05 20:49:50 +08:00 |
|
hiyouga
|
c776cdfc3e
|
optimize aqlm training
Former-commit-id: d3d3dac7070eb9055bcdc91eaf53f5b3741c0bda
|
2024-03-05 18:35:41 +08:00 |
|
hiyouga
|
0f2250b831
|
fix dora inference
Former-commit-id: ddf352f861e04e813cb8adeb4513964b4945081a
|
2024-03-05 11:51:41 +08:00 |
|
hiyouga
|
768358b960
|
fix export model
Former-commit-id: e5edcf440f2c96b90b1186ada887873f19d3c152
|
2024-03-05 11:05:41 +08:00 |
|
hiyouga
|
eeb34e5696
|
auto set chat template
Former-commit-id: 9e56eaf2d3e42d3985b0e68929a3a32e1db157ef
|
2024-03-05 02:41:20 +08:00 |
|
hiyouga
|
b04316d9a8
|
update readme
Former-commit-id: 24a79bd50f972008ca1a862edaff7cdd6c211cef
|
2024-03-04 19:29:26 +08:00 |
|
hiyouga
|
a62d17d009
|
fix export on cpu device
Former-commit-id: cda2ff87272797a062c7addb1bf840ac46208dfd
|
2024-03-04 17:35:09 +08:00 |
|