BUAADreamer
|
110c2ce2a5
|
modify style
Former-commit-id: 3bffc1e1b8bcc4582cebea06d35e5146163c7bec
|
2024-04-25 21:27:48 +08:00 |
|
BUAADreamer
|
c425436676
|
modify style
Former-commit-id: 54b713d0c4ffdfc6a7faeb14471b58bb1cd8acf5
|
2024-04-25 21:15:16 +08:00 |
|
BUAADreamer
|
f74e640565
|
Merge branch 'hiyouga:main' into main
Former-commit-id: 131d0bcd554dedd794add7eb3d7b1201cac80e7c
|
2024-04-25 20:02:50 +08:00 |
|
BUAADreamer
|
3c792174db
|
merge data part to the text stream
Former-commit-id: 7ee20286d9bcc2d5378bfd6bb02cd3648396d873
|
2024-04-25 19:19:59 +08:00 |
|
hiyouga
|
9aeb88c426
|
add export_device in webui #3333
Former-commit-id: 30ebd3652809d73941e0a5e4a8be11d989faf98d
|
2024-04-25 19:02:32 +08:00 |
|
BUAADreamer
|
00e2a272ef
|
merge model part to the text stream
Former-commit-id: b6fcb832ddaed4647d6f2b926f3dfccd47f3ea84
|
2024-04-25 08:20:41 +08:00 |
|
hiyouga
|
83404c4fa9
|
support new special token #3420
Former-commit-id: f5c6a47f5193ab3a6c137580992bdcce0b31fdd5
|
2024-04-24 23:39:31 +08:00 |
|
hiyouga
|
d2bb1b3a6b
|
reenable sdpa and fast tok by default
Former-commit-id: 9e00902dbedc71d55743d1bf237843506a557891
|
2024-04-24 02:18:44 +08:00 |
|
hiyouga
|
f8e219dc81
|
fix mod stuff
Former-commit-id: cf3988226e6398c67bb2955578e436fc505aa5c5
|
2024-04-21 18:11:10 +08:00 |
|
Marco
|
44cda2eece
|
Added Mixture of Depths
Former-commit-id: 75dd98b9abc847e22cb263c17ebcd2ca5dd98345
|
2024-04-18 20:31:24 +02:00 |
|
hiyouga
|
b638c65519
|
support cohere commandR #3184
Former-commit-id: e077c36872740f6b2ac255aee9da6c4c70f28977
|
2024-04-15 23:26:42 +08:00 |
|
hiyouga
|
9338f878a3
|
fix #3273
Former-commit-id: 3b20c89b342a068356ffc29c3724b645775c65db
|
2024-04-15 15:32:58 +08:00 |
|
hiyouga
|
117b67ea30
|
add moe aux loss control #3085
Former-commit-id: c9187ebc944e2de454ace3304b7d28eabb1b1a81
|
2024-04-02 14:26:31 +08:00 |
|
hiyouga
|
e7f13098c6
|
support infer 4bit model on GPUs #3023
Former-commit-id: 950a9dab9055839990656b2b40956792b253573d
|
2024-04-01 17:34:04 +08:00 |
|
hiyouga
|
9a784fb4f3
|
fix kv cache
Former-commit-id: a9588e36e95bed896eea8d79ba7108447ff08f4b
|
2024-03-13 01:21:50 +08:00 |
|
hiyouga
|
6c1b4aec75
|
fix #2802
Former-commit-id: 1370db270d7ba1a20468abdb29193ce7534d1b4f
|
2024-03-12 17:08:34 +08:00 |
|
hiyouga
|
c9ed3fc3a4
|
fix #2782 #2798
Former-commit-id: eb3ab610610a0964bc8a1c9fa015805353f04c31
|
2024-03-12 15:53:29 +08:00 |
|
hiyouga
|
4881f4e631
|
allow non-packing pretraining
Former-commit-id: 3fee5cc5a3db9ce874ad90f2500ec092d904bd4e
|
2024-03-09 22:21:46 +08:00 |
|
hiyouga
|
48d4364586
|
fix chat engine, update webui
Former-commit-id: 8b32dddd7d883bae07735796a517927c79d1c33b
|
2024-03-08 03:01:53 +08:00 |
|
hiyouga
|
056d2d956a
|
support vllm
Former-commit-id: 889f6e910e654d8ec3922c2185042d737ffbf1c3
|
2024-03-07 20:26:31 +08:00 |
|
hiyouga
|
9a69cadab3
|
fix #2735
Former-commit-id: 416f6333f66b6afd70a3a936d82593efca583235
|
2024-03-07 16:15:53 +08:00 |
|
hiyouga
|
46ee267cfc
|
improve aqlm optim
Former-commit-id: 81be999b407e988c2f42764d827ac859d079ed3e
|
2024-03-05 20:49:50 +08:00 |
|
hiyouga
|
596b6828cb
|
support llama pro #2338 , add rslora
Former-commit-id: 40d659b7f30dd5a004703c176ec1f22dc864e505
|
2024-02-15 02:27:36 +08:00 |
|
hiyouga
|
34bc0c22b1
|
lint
Former-commit-id: 6b1f89b6494e9b6b087fe90600617a3024e014e5
|
2024-02-07 01:10:04 +08:00 |
|
hiyouga
|
66e0e651b9
|
format style
Former-commit-id: 53b683531b83cd1d19de97c6565f16c1eca6f5e1
|
2024-01-20 20:15:56 +08:00 |
|
hiyouga
|
be61bfda93
|
add upcast_lmhead option
Former-commit-id: 7ef69a1697c11ff13e7503360e40ef36cfb1c345
|
2024-01-19 23:54:25 +08:00 |
|
hiyouga
|
7b1a56b96f
|
support export push_to_hub #2183
Former-commit-id: fac09da7123a500d255de74810a8d057fb5c5f07
|
2024-01-16 23:59:42 +08:00 |
|
hiyouga
|
90d279f39f
|
fix args
Former-commit-id: ff18f327a3dc96d9677ef32841e8f29ab2eeb7ef
|
2023-12-28 18:47:19 +08:00 |
|
hiyouga
|
af3f5b6e16
|
fix export format
Former-commit-id: 7c82bd396b9e6ff395850ad544d95cbf1b7557cd
|
2023-12-28 18:40:46 +08:00 |
|
hiyouga
|
921f593632
|
update loader
Former-commit-id: 080d8eab858217ca58bffe719d5ffde7579c5bda
|
2023-12-24 19:10:23 +08:00 |
|
hiyouga
|
940403720a
|
update patcher
Former-commit-id: d6d7b6670847ce4ea10353c5b126214542b45c2b
|
2023-12-23 15:24:27 +08:00 |
|
hiyouga
|
6faf9c35a9
|
support unsloth
Former-commit-id: b857f00234b90b785d82ca7cdb29af3d948b1a7b
|
2023-12-23 00:14:33 +08:00 |
|
hiyouga
|
13fd751a78
|
fix tokenizer for Yi chat models #1617 #1875
Former-commit-id: 9485692c8d367a0b25d3e653db413aa01cb9ad7d
|
2023-12-18 17:18:11 +08:00 |
|
hiyouga
|
f902b0d420
|
refactor adapter hparam
Former-commit-id: f82aece9ebd6df83a7a005cc7cbbcec07fa6e14d
|
2023-12-15 20:53:11 +08:00 |
|
hoshi-hiyouga
|
b9736c13e0
|
Merge branch 'main' into feat/support_ms
Former-commit-id: 698756dffb7d4e602b3e0cab66ef0a4befe7215c
|
2023-12-12 17:55:32 +08:00 |
|
xingjun.wang
|
ed26bb3d82
|
update args for MsDataset.load
Former-commit-id: c5f69357a167cbf99a93607177526e787419ea05
|
2023-12-12 13:02:54 +08:00 |
|
hiyouga
|
72bbd5bdef
|
patch modelscope
Former-commit-id: 8888cf53f040f5a2d8c0e59cddf79b252449bf58
|
2023-12-01 22:53:15 +08:00 |
|
yuze.zyz
|
9d125bf533
|
support ms
Former-commit-id: fdd4f94f563110ef9f96ab4a7fd954def32e9785
|
2023-11-29 20:36:55 +08:00 |
|
hiyouga
|
7a3a0144a5
|
support full-parameter PPO
Former-commit-id: 4af967d69475e1c9fdf1a7983cd6b83bd431abff
|
2023-11-16 02:08:04 +08:00 |
|
hiyouga
|
f0766a2ab0
|
add todo
Former-commit-id: 0bd884feb11736d0ab24ca19885151cb47d9dcd3
|
2023-11-10 14:38:18 +08:00 |
|
hiyouga
|
f5ba2190fb
|
fix ppo train and dpo eval
Former-commit-id: ced863031836632cb5920e22ae6991f251372118
|
2023-11-07 22:48:51 +08:00 |
|
hiyouga
|
2eb65d21ac
|
upgrade peft, fix #1088 #1411
Former-commit-id: aa7d104f8e050d12cb8f585bc8a52c850995500f
|
2023-11-07 16:13:36 +08:00 |
|
hiyouga
|
6da51565f5
|
reimplement neftune
Former-commit-id: efe9e5a194d3a9f052701d904715238816e4c09e
|
2023-10-22 16:15:08 +08:00 |
|
hiyouga
|
c2e84d4558
|
refactor export, fix #1190
Former-commit-id: 30e60e37023a7c4a2db033ffec0542efa3d5cdfb
|
2023-10-15 16:01:48 +08:00 |
|
hiyouga
|
27dd87c890
|
fix #1184
Former-commit-id: 5b069a967823e659dbc70b0d50361b3ad248087e
|
2023-10-14 19:20:11 +08:00 |
|
hiyouga
|
3198a7e5f4
|
refactor model_dtype, fix PPO trainer
Former-commit-id: 3e17ee5afbcb823a7c9a2f91864b3750cd79edb4
|
2023-10-11 23:16:01 +08:00 |
|
hiyouga
|
1c150995ae
|
fix layer norm dtype
Former-commit-id: 67af21961b68d9b54d07b09e444c7140869f26da
|
2023-09-28 00:25:55 +08:00 |
|
hiyouga
|
20130b486c
|
support LongLoRA
Former-commit-id: 0832ed37e7947d699f17375648a52f80752c2b6b
|
2023-09-27 21:55:50 +08:00 |
|
hiyouga
|
dc68c313ee
|
fix #944
Former-commit-id: 032245647848aaa4167086636b6c985268c5fee3
|
2023-09-21 19:51:02 +08:00 |
|
hiyouga
|
a402161631
|
support FlashAttention2
Former-commit-id: 23e56c5554b948d4f08ad87849b261eafd2c7890
|
2023-09-10 20:43:56 +08:00 |
|