hiyouga
|
7ff8a064f3
|
support layerwise galore
Former-commit-id: d43a4da0947897d0be3f62fad3107754d4c89f2b
|
2024-03-10 00:24:11 +08:00 |
|
hiyouga
|
c635bbe465
|
fix #2732
Former-commit-id: bc39ad1d102b91d5417daa38b8a581e1e1ab2af9
|
2024-03-09 22:37:16 +08:00 |
|
hiyouga
|
4881f4e631
|
allow non-packing pretraining
Former-commit-id: 3fee5cc5a3db9ce874ad90f2500ec092d904bd4e
|
2024-03-09 22:21:46 +08:00 |
|
hiyouga
|
c631799f5d
|
fix #2766
Former-commit-id: a8cd556230c1d0bc4e090acc2276c035910ce6f6
|
2024-03-09 21:35:24 +08:00 |
|
hiyouga
|
48846676d8
|
use default arg for freeze tuning
Former-commit-id: a38fd7c8b39cb59fb61c26fdf80aaa6f2d0623b9
|
2024-03-09 06:08:48 +08:00 |
|
hiyouga
|
f37d481c5d
|
add GaLore results
Former-commit-id: ac05b9bba62924693bdede85917d21b844849b8c
|
2024-03-09 04:11:55 +08:00 |
|
hiyouga
|
5d7d8bd55c
|
update hardware requirements
Former-commit-id: 604b3d10fc1448f702943114b66b97bded21e080
|
2024-03-09 03:58:18 +08:00 |
|
hiyouga
|
8ed1463236
|
update examples
Former-commit-id: 38592faa258f7331afb95bc5db4b9bf37f08105d
|
2024-03-09 02:30:37 +08:00 |
|
hiyouga
|
43b2ede0f8
|
fix #2756 , patch #2746
Former-commit-id: 627d1c91e675f1d9ebf47bad123cbbf29821da4d
|
2024-03-09 02:01:26 +08:00 |
|
hoshi-hiyouga
|
2f095e2017
|
Merge pull request #2746 from stephen-nju/main
fix deepspeed ppo RuntimeError
Former-commit-id: 656c653f0c628f9494b4d7ae12e60c8eeec1ea7a
|
2024-03-09 01:37:00 +08:00 |
|
hiyouga
|
9b55bb964c
|
Update setup.py
Former-commit-id: 543740fa00dda2c5d16822f7c9f4ef32d916426f
|
2024-03-09 00:14:48 +08:00 |
|
hiyouga
|
9b97b23ce7
|
fix aqlm version
Former-commit-id: 05673f81f0295c76957f3247c62f95fda322a63e
|
2024-03-09 00:09:09 +08:00 |
|
hiyouga
|
53ab28533e
|
fix example params
Former-commit-id: 0280748528488d7bee6b9074025255453966124c
|
2024-03-08 20:41:43 +08:00 |
|
stephen_zhu
|
940c00e7ae
|
update
Former-commit-id: 295f9ef2eff2e8b5d7a21d3da8dd3e6eb2a42006
|
2024-03-08 12:47:44 +08:00 |
|
stephen
|
18cfd5f349
|
fix ppo runtime error
Former-commit-id: 14e2f221e3e720075e59065a3dc42aa4d993a8b6
|
2024-03-08 11:48:26 +08:00 |
|
hiyouga
|
d46c2bbcba
|
update readme
Former-commit-id: 353db1e28aa8888228a05813bb09c51e7d28728c
|
2024-03-08 03:06:21 +08:00 |
|
hiyouga
|
48d4364586
|
fix chat engine, update webui
Former-commit-id: 8b32dddd7d883bae07735796a517927c79d1c33b
|
2024-03-08 03:01:53 +08:00 |
|
hiyouga
|
8042c66a76
|
Update setup.py
Former-commit-id: 76c3ec05258a5f5d1f78430ef6258a5eda527d65
|
2024-03-08 01:23:00 +08:00 |
|
hiyouga
|
3879d79b89
|
update galore args
Former-commit-id: c7479a7976f773feb36aab4fdb0500be53d83b6a
|
2024-03-08 01:17:32 +08:00 |
|
hiyouga
|
e416cecf62
|
fix galore
Former-commit-id: 62a3ceeef8f60caef43ccc7f971a0c9184e21296
|
2024-03-08 00:44:51 +08:00 |
|
hiyouga
|
81fcb80466
|
add Yi-9B model
Former-commit-id: bfcb0245b832242eefb84de6f70bd75544f3ceb7
|
2024-03-07 23:11:57 +08:00 |
|
hiyouga
|
bf812fbe40
|
add galore examples
Former-commit-id: aabf1b99f39aae535401b2f65f0d629def6e39f5
|
2024-03-07 22:53:45 +08:00 |
|
hiyouga
|
1e6fb6c8aa
|
support galore
Former-commit-id: b67a4a46a88d83bb2a3459b3317b66cda15e0171
|
2024-03-07 22:41:36 +08:00 |
|
hiyouga
|
5d0c95bd02
|
update readme
Former-commit-id: 649e3e8cb741b28552b351a3e2627345e292689d
|
2024-03-07 20:34:49 +08:00 |
|
hiyouga
|
7cd2417002
|
tiny fix
Former-commit-id: 731530212152476f76963bba121ce2fe1264432a
|
2024-03-07 20:29:34 +08:00 |
|
hoshi-hiyouga
|
16851d66e5
|
Merge pull request #2739 from hiyouga/dev-vllm
support vllm
Former-commit-id: 8cc876958a6c05e644e2f519282efb6f222a2277
|
2024-03-07 20:28:18 +08:00 |
|
hiyouga
|
056d2d956a
|
support vllm
Former-commit-id: 889f6e910e654d8ec3922c2185042d737ffbf1c3
|
2024-03-07 20:26:31 +08:00 |
|
hiyouga
|
9a69cadab3
|
fix #2735
Former-commit-id: 416f6333f66b6afd70a3a936d82593efca583235
|
2024-03-07 16:15:53 +08:00 |
|
hoshi-hiyouga
|
3de642bffd
|
Merge pull request #2730 from cx2333-gt/main
fix flash_attn in train_web
Former-commit-id: eff0b774fc8e1a5a07a2554d611cb85bef439dec
|
2024-03-07 14:37:18 +08:00 |
|
cx2333
|
286b9d9849
|
revert choice name
Former-commit-id: 7832e68072219c7d1f562aee868812a4d655f4e0
|
2024-03-07 14:28:55 +08:00 |
|
hiyouga
|
cef1ede826
|
fix chatglm3 template
Former-commit-id: 9be0aa70fdd2e9ec208aa1850ace5c287efc8c3a
|
2024-03-07 14:26:16 +08:00 |
|
cx2333
|
5007566588
|
fix flash_attn in train_web
Former-commit-id: 5f340e362b0e91fec76c19c77c5705bba1db481a
|
2024-03-07 10:13:55 +08:00 |
|
hiyouga
|
e93fb3cc6c
|
tiny fix
Former-commit-id: c3145afa4164dd28888f17599a154f7dddbe9326
|
2024-03-06 17:25:08 +08:00 |
|
hiyouga
|
7578209735
|
export use balanced gpu
Former-commit-id: 710487dc694489bf3dfe54f8d32df80ce46439e4
|
2024-03-06 16:33:14 +08:00 |
|
hiyouga
|
67f02f75d0
|
fix add tokens
Former-commit-id: ff5353681a87d033903bf8cf6133c6bdb3fa9e5a
|
2024-03-06 15:04:02 +08:00 |
|
hiyouga
|
73d9dfc7ab
|
fix version checking
Former-commit-id: 5780da8d640609cca388f55983d0251e5547209a
|
2024-03-06 14:51:51 +08:00 |
|
hiyouga
|
6b407092d9
|
update examples
Former-commit-id: 194e25606515bfa42c3be27d68f68d604191514b
|
2024-03-06 13:14:57 +08:00 |
|
hiyouga
|
3168abc0a1
|
fix arg dtype
Former-commit-id: 999ae05655815ac04ababddae55d9343f5d39f84
|
2024-03-05 20:53:30 +08:00 |
|
hiyouga
|
46ee267cfc
|
improve aqlm optim
Former-commit-id: 81be999b407e988c2f42764d827ac859d079ed3e
|
2024-03-05 20:49:50 +08:00 |
|
hiyouga
|
a10bead9b5
|
optimize aqlm training
Former-commit-id: 8b42660e4039b3d6475f502f397686ba6b140627
|
2024-03-05 18:35:41 +08:00 |
|
hiyouga
|
3553e301dd
|
fix dora inference
Former-commit-id: 21b3597b0a05169afe51e1609b532787a65ca8ea
|
2024-03-05 11:51:41 +08:00 |
|
hiyouga
|
02b838b9b0
|
fix export model
Former-commit-id: 7ba2f7bf8da3c559e05d8dde20e93cd1d3d4e8ef
|
2024-03-05 11:05:41 +08:00 |
|
hiyouga
|
b1de6d1025
|
update readme
Former-commit-id: bd6fd8ad3a5ef8c49247dc1b1cd7584ef211489e
|
2024-03-05 03:20:23 +08:00 |
|
hiyouga
|
bc67872218
|
add examples
Former-commit-id: 2744dc9d2f9df4150a496b38e24ea96040a85bef
|
2024-03-05 03:16:35 +08:00 |
|
hiyouga
|
0229fffde5
|
auto set chat template
Former-commit-id: d8bf2f0efe6919990c7032aaa06010980cfde019
|
2024-03-05 02:41:20 +08:00 |
|
hiyouga
|
3555b87363
|
update readme
Former-commit-id: c95bc2774800ed2e6d54a6099a466bdacc0cfb78
|
2024-03-04 19:29:26 +08:00 |
|
hiyouga
|
2dca53962e
|
fix export on cpu device
Former-commit-id: e4722a9a627ea4e9a1341cc00a3108dd06a6b550
|
2024-03-04 17:35:09 +08:00 |
|
hiyouga
|
f4f71f2797
|
fix sub-process error in thread
Former-commit-id: 3448ad43d05301b12a19a02c1cc23d7b0ee525c3
|
2024-03-03 15:04:35 +08:00 |
|
hiyouga
|
77ab9457ed
|
update readme
Former-commit-id: 8f1bbd8f5954f64554b7dbe98073d19841e0cb74
|
2024-03-03 01:41:07 +08:00 |
|
hiyouga
|
4fa53b6282
|
update readme, add starcoder2, cosmopedia
Former-commit-id: 1ae7c183640146bb9b06c98942985a1721d2b9c9
|
2024-03-03 01:01:46 +08:00 |
|