BUAADreamer
|
8b2a735c14
|
modify some style
Former-commit-id: b016e6a671a2f228f0bdd9b8d5995b4669609655
|
2024-04-25 21:58:18 +08:00 |
|
BUAADreamer
|
058ed5e607
|
modify style
Former-commit-id: c1f1df99e4dc3d0aadf1207b4e9a16218187fd5a
|
2024-04-25 21:29:50 +08:00 |
|
BUAADreamer
|
c425436676
|
modify style
Former-commit-id: 54b713d0c4ffdfc6a7faeb14471b58bb1cd8acf5
|
2024-04-25 21:15:16 +08:00 |
|
BUAADreamer
|
dbd905438b
|
add some
Former-commit-id: 8d035a849c4a441d457791aab073861adf69a09f
|
2024-04-25 21:08:32 +08:00 |
|
BUAADreamer
|
f74e640565
|
Merge branch 'hiyouga:main' into main
Former-commit-id: 131d0bcd554dedd794add7eb3d7b1201cac80e7c
|
2024-04-25 20:02:50 +08:00 |
|
BUAADreamer
|
3c792174db
|
merge data part to the text stream
Former-commit-id: 7ee20286d9bcc2d5378bfd6bb02cd3648396d873
|
2024-04-25 19:19:59 +08:00 |
|
hiyouga
|
9aeb88c426
|
add export_device in webui #3333
Former-commit-id: 30ebd3652809d73941e0a5e4a8be11d989faf98d
|
2024-04-25 19:02:32 +08:00 |
|
BUAADreamer
|
00e2a272ef
|
merge model part to the text stream
Former-commit-id: b6fcb832ddaed4647d6f2b926f3dfccd47f3ea84
|
2024-04-25 08:20:41 +08:00 |
|
BUAADreamer
|
5142349661
|
remove error
Former-commit-id: 2bcd1c7dc3595f17ae4e2c4475196cc2d03d0e75
|
2024-04-25 01:01:59 +08:00 |
|
BUAADreamer
|
6c1db2d012
|
remove conflicts
Former-commit-id: f8b637eb76cba7ec229e2978068805ad1cca8adb
|
2024-04-25 00:34:22 +08:00 |
|
BUAADreamer
|
12c51655ce
|
add llava and instructblip
Former-commit-id: 142fb6f4541a1acfefe66ff2574dabde53b00c06
|
2024-04-25 00:22:43 +08:00 |
|
hiyouga
|
21fac4c98c
|
fix log level
Former-commit-id: 8d21302f6201b3f33c10f61f3559bd95be3363c2
|
2024-04-24 23:42:59 +08:00 |
|
hiyouga
|
83404c4fa9
|
support new special token #3420
Former-commit-id: f5c6a47f5193ab3a6c137580992bdcce0b31fdd5
|
2024-04-24 23:39:31 +08:00 |
|
hiyouga
|
94c8219575
|
fix bug
Former-commit-id: 38e164fe4aaea6f0baf121a720291ca42643ba8c
|
2024-04-24 05:21:18 +08:00 |
|
hiyouga
|
ad24a2a0c9
|
fix bug
Former-commit-id: 271c24d2c82d645fa9072e6de94ca38f20411537
|
2024-04-24 05:10:07 +08:00 |
|
hiyouga
|
c05027d14a
|
remove redundant code
Former-commit-id: 4a7a7ad2bcdc493458084f5f3d384239228b7d5a
|
2024-04-24 05:02:18 +08:00 |
|
hiyouga
|
5420905a2e
|
support unsloth generate
Former-commit-id: 0ef1ad9f505dba71db9342f524cc3a7565e5e09e
|
2024-04-24 04:46:53 +08:00 |
|
hiyouga
|
03f2e3284a
|
refactor patcher
Former-commit-id: 263cfe1294f5c3188f5e8d65791f35ee0d87315a
|
2024-04-24 03:02:23 +08:00 |
|
hiyouga
|
d2bb1b3a6b
|
reenable sdpa and fast tok by default
Former-commit-id: 9e00902dbedc71d55743d1bf237843506a557891
|
2024-04-24 02:18:44 +08:00 |
|
hiyouga
|
35c4a2c212
|
fix #3347 #3387
Former-commit-id: c253c18185a29b59190f3e0ed236c2bb4c788085
|
2024-04-24 01:30:16 +08:00 |
|
BUAADreamer
|
ab6dc0ea30
|
add multimodal LLM BLIP-2 and InstructBLIP
Former-commit-id: a730f89a972f1a9d37c718c716f199cb8d4903b2
|
2024-04-23 18:45:43 +08:00 |
|
hiyouga
|
1d341dcd83
|
fix #3365
Former-commit-id: 415ce41e8fa887e980e5bd575c8e95bd4076b90b
|
2024-04-21 19:20:18 +08:00 |
|
hiyouga
|
f8e219dc81
|
fix mod stuff
Former-commit-id: cf3988226e6398c67bb2955578e436fc505aa5c5
|
2024-04-21 18:11:10 +08:00 |
|
hoshi-hiyouga
|
3365cc8cf0
|
Merge pull request #3338 from astramind-ai/main
Adding Mixture of Depth
Former-commit-id: 4da2ece53353b63e672ff529d6beba41ff710c14
|
2024-04-21 18:05:52 +08:00 |
|
hoshi-hiyouga
|
3a5e68b7d9
|
fix #3348
Former-commit-id: aa5e921c00f60074eceb2f9d4d8837cc713edba6
|
2024-04-20 10:34:09 +08:00 |
|
Marco
|
44cda2eece
|
Added Mixture of Depths
Former-commit-id: 75dd98b9abc847e22cb263c17ebcd2ca5dd98345
|
2024-04-18 20:31:24 +02:00 |
|
hiyouga
|
9e1bd6420d
|
fix #3324
Former-commit-id: 5e710c4ac331f3400534d33b2646c4108c898d98
|
2024-04-18 15:34:45 +08:00 |
|
hiyouga
|
bee796f6b5
|
fix #3316
Former-commit-id: 7395e9e90a209228ff563ab54319955608850fc3
|
2024-04-17 22:54:34 +08:00 |
|
hoshi-hiyouga
|
42084e08ae
|
Merge pull request #3287 from Ledzy/badam
[Feature] Add BAdam algorithm
Former-commit-id: 10a5e1e65b34b03e5ca2a41bf6ded09a3fb25f0c
|
2024-04-16 17:32:16 +08:00 |
|
hoshi-hiyouga
|
c7c216069c
|
Update utils.py
Former-commit-id: 7edf4dbed88b8034282f14fd6e0cb6f7f9e5f805
|
2024-04-16 17:29:30 +08:00 |
|
hoshi-hiyouga
|
cde9d1b917
|
Update patcher.py
Former-commit-id: 494e6a1e05b38f5ff61d83327303614f53c92e64
|
2024-04-16 17:29:19 +08:00 |
|
hoshi-hiyouga
|
96213f04b0
|
Update adapter.py
Former-commit-id: 8f7b75b26f020d8ae85baab7b082475c3bfeb512
|
2024-04-16 17:28:12 +08:00 |
|
Jonery
|
6dd6b3e396
|
resolve gradient checkpointing issue.
Former-commit-id: 6df9135d063bb6102f0cbcdf0d702076f5febbae
|
2024-04-16 12:05:27 +08:00 |
|
hiyouga
|
efa808069a
|
support unsloth 2024.4
Former-commit-id: 14a83f8bc4fe44783252378fce59198194a96bb8
|
2024-04-16 00:25:03 +08:00 |
|
hiyouga
|
b5c5283dd6
|
add codegemma
Former-commit-id: 9324176525c2eda22962b0ca1895009b6237e6e3
|
2024-04-16 00:11:15 +08:00 |
|
hiyouga
|
b638c65519
|
support cohere commandR #3184
Former-commit-id: e077c36872740f6b2ac255aee9da6c4c70f28977
|
2024-04-15 23:26:42 +08:00 |
|
Jonery
|
d4d471450f
|
Feature BAdam
Former-commit-id: d8d2807fbcf587c37f7fd34a23e9397d2775ceed
|
2024-04-15 23:15:27 +08:00 |
|
hiyouga
|
276f2cb24e
|
update examples
Former-commit-id: 369294b31c8a03a1cafcee83eb31a817007d3c49
|
2024-04-15 22:14:34 +08:00 |
|
hoshi-hiyouga
|
0c80751e87
|
Merge pull request #3276 from liu-zichen/fix_mixtral
fix: turn on output_router_logits of mixtral
Former-commit-id: 07bbaf5c67d00a152e5304e81b15fd9189e7bb99
|
2024-04-15 15:38:16 +08:00 |
|
hiyouga
|
9338f878a3
|
fix #3273
Former-commit-id: 3b20c89b342a068356ffc29c3724b645775c65db
|
2024-04-15 15:32:58 +08:00 |
|
liuzc
|
fde3d91242
|
fix: mixtral output_router_logits
Former-commit-id: ab3171ea97ec968b972287287ef9ee2502c6d37c
|
2024-04-15 12:11:49 +08:00 |
|
hiyouga
|
7468f2535c
|
release v0.6.2
Former-commit-id: f92ad0a62d957b595f6a76a5403216b163eb3d17
|
2024-04-11 20:08:51 +08:00 |
|
hoshi-hiyouga
|
7856f98965
|
Update adapter.py
Former-commit-id: 720fde3683529ed7e08ac27c7c4598c6bdc30d44
|
2024-04-10 00:57:51 +08:00 |
|
hoshi-hiyouga
|
e25ddef08c
|
Update adapter.py
Former-commit-id: a84b8d17dbf221259212e81931d80bcdd6284ad7
|
2024-04-10 00:57:30 +08:00 |
|
Erich Schubert
|
95a4589bbf
|
Pass additional_target to unsloth
Fixes #3200
Former-commit-id: f8f87f5b0549cba6a011749c42064047f82ba577
|
2024-04-09 17:53:40 +02:00 |
|
hiyouga
|
566d71b7a9
|
fix quant infer and qwen2moe
Former-commit-id: b75d16767f35c36e2cf2aaab8a3844135085bccf
|
2024-04-09 17:12:59 +08:00 |
|
hiyouga
|
1348f7d860
|
fix resize vocab at inference #3022
Former-commit-id: c243720b89eec0af2872fa3c7980a0026d893f4d
|
2024-04-03 18:14:24 +08:00 |
|
hiyouga
|
b12176d818
|
simplify readme
Former-commit-id: 0da6ec2d516326fe9c7583ba71cd1778eb838178
|
2024-04-02 20:07:43 +08:00 |
|
hiyouga
|
117b67ea30
|
add moe aux loss control #3085
Former-commit-id: c9187ebc944e2de454ace3304b7d28eabb1b1a81
|
2024-04-02 14:26:31 +08:00 |
|
hiyouga
|
03e20bb5c6
|
fix #3022
Former-commit-id: dac2f617bda9470ac8d85c7e9def09cc04970506
|
2024-04-02 13:58:39 +08:00 |
|