Commit Graph

725 Commits

Author SHA1 Message Date
hiyouga
f776e738f8 tiny fix
Former-commit-id: 352693e2dc
2024-03-11 00:17:18 +08:00
hiyouga
566bfad930 update parser
Former-commit-id: be99799413
2024-03-10 13:35:20 +08:00
hiyouga
4a4e4b4354 support layerwise galore
Former-commit-id: 8664262cde
2024-03-10 00:24:11 +08:00
hiyouga
276def1897 fix #2732
Former-commit-id: 18ffce36b5
2024-03-09 22:37:16 +08:00
hiyouga
868444e124 allow non-packing pretraining
Former-commit-id: bdb496644c
2024-03-09 22:21:46 +08:00
hiyouga
1173441661 fix #2766
Former-commit-id: 412c52e325
2024-03-09 21:35:24 +08:00
hiyouga
8f6eb1383d use default arg for freeze tuning
Former-commit-id: af0e370fb1
2024-03-09 06:08:48 +08:00
hiyouga
5c00783697 update hardware requirements
Former-commit-id: 393c2de27c
2024-03-09 03:58:18 +08:00
hiyouga
c561b268ef fix #2756 , patch #2746
Former-commit-id: e8dd38b7fd
2024-03-09 02:01:26 +08:00
hoshi-hiyouga
36d65289d0 Merge pull request #2746 from stephen-nju/main
fix deepspeed ppo RuntimeError

Former-commit-id: 516d0ddc66
2024-03-09 01:37:00 +08:00
hiyouga
398c261c7c fix aqlm version
Former-commit-id: 10be2f0ecc
2024-03-09 00:09:09 +08:00
stephen_zhu
c69b9fbe58 update
Former-commit-id: aa71571b77
2024-03-08 12:47:44 +08:00
stephen
495b858606 fix ppo runtime error
Former-commit-id: cdb7f82869
2024-03-08 11:48:26 +08:00
hiyouga
7443ac3116 fix chat engine, update webui
Former-commit-id: 5d956e2a51
2024-03-08 03:01:53 +08:00
hiyouga
2235020cc9 update galore args
Former-commit-id: 0ac6b40a47
2024-03-08 01:17:32 +08:00
hiyouga
5b50458acf fix galore
Former-commit-id: 33a4c24a8a
2024-03-08 00:44:51 +08:00
hiyouga
f373290012 add Yi-9B model
Former-commit-id: 57452a4aa1
2024-03-07 23:11:57 +08:00
hiyouga
2c010c72b8 support galore
Former-commit-id: 28f7862188
2024-03-07 22:41:36 +08:00
hiyouga
34533b2f35 support vllm
Former-commit-id: d07ad5cc1c
2024-03-07 20:26:31 +08:00
hiyouga
37e40563f1 fix #2735
Former-commit-id: f74f804a71
2024-03-07 16:15:53 +08:00
hoshi-hiyouga
90e66c8d94 Merge pull request #2730 from cx2333-gt/main
fix flash_attn in train_web

Former-commit-id: 2185855bdb
2024-03-07 14:37:18 +08:00
cx2333
013c12a135 revert choice name
Former-commit-id: 94b7a1b915
2024-03-07 14:28:55 +08:00
hiyouga
843d3f7a97 fix chatglm3 template
Former-commit-id: 921ee82267
2024-03-07 14:26:16 +08:00
cx2333
22624e566e fix flash_attn in train_web
Former-commit-id: a8889498fa
2024-03-07 10:13:55 +08:00
hiyouga
31c618f1f7 tiny fix
Former-commit-id: 0048a2021e
2024-03-06 17:25:08 +08:00
hiyouga
8b6c178249 export use balanced gpu
Former-commit-id: 3e84f430b1
2024-03-06 16:33:14 +08:00
hiyouga
8b21a60d9c fix add tokens
Former-commit-id: 9658c63cd9
2024-03-06 15:04:02 +08:00
hiyouga
e887aface7 fix version checking
Former-commit-id: 3016e65657
2024-03-06 14:51:51 +08:00
hiyouga
af526c3a46 fix arg dtype
Former-commit-id: e0c47358f9
2024-03-05 20:53:30 +08:00
hiyouga
9561809ce9 improve aqlm optim
Former-commit-id: 259af60d28
2024-03-05 20:49:50 +08:00
hiyouga
c776cdfc3e optimize aqlm training
Former-commit-id: d3d3dac707
2024-03-05 18:35:41 +08:00
hiyouga
0f2250b831 fix dora inference
Former-commit-id: ddf352f861
2024-03-05 11:51:41 +08:00
hiyouga
768358b960 fix export model
Former-commit-id: e5edcf440f
2024-03-05 11:05:41 +08:00
hiyouga
eeb34e5696 auto set chat template
Former-commit-id: 9e56eaf2d3
2024-03-05 02:41:20 +08:00
hiyouga
b04316d9a8 update readme
Former-commit-id: 24a79bd50f
2024-03-04 19:29:26 +08:00
hiyouga
a62d17d009 fix export on cpu device
Former-commit-id: cda2ff8727
2024-03-04 17:35:09 +08:00
hiyouga
0e58cd6422 fix sub-process error in thread
Former-commit-id: 9c10854b46
2024-03-03 15:04:35 +08:00
hiyouga
9ae1514a75 update readme, add starcoder2, cosmopedia
Former-commit-id: 894d183214
2024-03-03 01:01:46 +08:00
hiyouga
d1e6e02461 fix #2649
Former-commit-id: 4e5fae2fac
2024-03-01 13:02:41 +08:00
hiyouga
82f26bc959 tiny fix
Former-commit-id: 396fd47947
2024-02-29 21:03:48 +08:00
hiyouga
72f7ec38db fix webui
Former-commit-id: 1bfa70ce8e
2024-02-29 20:09:09 +08:00
hiyouga
3787d13816 fix #2642
Former-commit-id: c0be617195
2024-02-29 18:32:54 +08:00
hiyouga
1853b5c172 tiny fix
Former-commit-id: 4a871e80e2
2024-02-29 17:28:50 +08:00
hiyouga
8487b66532 tiny fix and release
Former-commit-id: ece3b3737e
2024-02-29 00:46:47 +08:00
hoshi-hiyouga
d2b8645b60 Merge pull request #2575 from lungothrin/feature/chatter-with-role
support on fly test of tools

Former-commit-id: 7c87532476
2024-02-29 00:39:47 +08:00
hiyouga
60ceace8a0 fix #2629
Former-commit-id: 4cc2781efe
2024-02-29 00:37:29 +08:00
hiyouga
8e7d50dae4 release v0.5.3
Former-commit-id: fa5ab21ebc
2024-02-29 00:34:19 +08:00
hiyouga
57f85add58 update chatglm3 template
Former-commit-id: 38d8b2cef8
2024-02-28 21:11:23 +08:00
hiyouga
5abbca70d3 support DoRA, AWQ, AQLM #2512
Former-commit-id: cfefacaa37
2024-02-28 19:53:28 +08:00
Liang Ge
8d7fcd64b5 support on fly test of tools
Former-commit-id: 56b3918fda
2024-02-28 01:17:49 +08:00