Commit Graph

198 Commits

Author SHA1 Message Date
BUAADreamer
a0be27fc9b modify some style
Former-commit-id: 2d4ded535f
2024-04-25 21:58:18 +08:00
BUAADreamer
40bfe767f7 modify style
Former-commit-id: 235b411370
2024-04-25 21:29:50 +08:00
BUAADreamer
e6cf251fb8 modify style
Former-commit-id: 1dcabafe72
2024-04-25 21:15:16 +08:00
BUAADreamer
cefdbb8728 add some
Former-commit-id: 94ad744941
2024-04-25 21:08:32 +08:00
BUAADreamer
ad7d8a6525 Merge branch 'hiyouga:main' into main
Former-commit-id: 68cdd9a020
2024-04-25 20:02:50 +08:00
BUAADreamer
b6d78b2a64 merge data part to the text stream
Former-commit-id: c6dd89918f
2024-04-25 19:19:59 +08:00
hiyouga
84031c5bf9 add export_device in webui #3333
Former-commit-id: 3a7c1286ce
2024-04-25 19:02:32 +08:00
BUAADreamer
4e032ff95e merge model part to the text stream
Former-commit-id: 838eb87a96
2024-04-25 08:20:41 +08:00
BUAADreamer
cb1c66a810 remove error
Former-commit-id: 8239907f57
2024-04-25 01:01:59 +08:00
BUAADreamer
ff8d729b59 remove conflicts
Former-commit-id: 7ffee90799
2024-04-25 00:34:22 +08:00
BUAADreamer
31bce63a10 add llava and instructblip
Former-commit-id: cfb485eddf
2024-04-25 00:22:43 +08:00
hiyouga
9a21785396 fix log level
Former-commit-id: 7fbe8add8f
2024-04-24 23:42:59 +08:00
hiyouga
ce490c65ae support new special token #3420
Former-commit-id: 297fb8ead3
2024-04-24 23:39:31 +08:00
hiyouga
7d89abb1fd fix bug
Former-commit-id: 73ff9c834b
2024-04-24 05:21:18 +08:00
hiyouga
612ba26c4c fix bug
Former-commit-id: 8f44dce08a
2024-04-24 05:10:07 +08:00
hiyouga
1f99c367b3 remove redundant code
Former-commit-id: 667ce08b27
2024-04-24 05:02:18 +08:00
hiyouga
c0afc4074f support unsloth generate
Former-commit-id: b1deb0a0b9
2024-04-24 04:46:53 +08:00
hiyouga
8465e54d38 refactor patcher
Former-commit-id: aa2b79eb23
2024-04-24 03:02:23 +08:00
hiyouga
80c8586534 reenable sdpa and fast tok by default
Former-commit-id: 07737a3d2d
2024-04-24 02:18:44 +08:00
hiyouga
34ecad4af8 fix #3347 #3387
Former-commit-id: 707f0b1d5d
2024-04-24 01:30:16 +08:00
BUAADreamer
175b56bced add multimodal LLM BLIP-2 and InstructBLIP
Former-commit-id: 4dcb11eab7
2024-04-23 18:45:43 +08:00
hiyouga
79666c298d fix #3365
Former-commit-id: a1d31ffc8c
2024-04-21 19:20:18 +08:00
hiyouga
ec81d45d27 fix mod stuff
Former-commit-id: f58425ab45
2024-04-21 18:11:10 +08:00
hoshi-hiyouga
7c63a9b5fd Merge pull request #3338 from astramind-ai/main
Adding Mixture of Depth

Former-commit-id: d0273787be
2024-04-21 18:05:52 +08:00
hoshi-hiyouga
e9b1aff447 fix #3348
Former-commit-id: 1fa287fd63
2024-04-20 10:34:09 +08:00
Marco
639297a5ef Added Mixture of Depths
Former-commit-id: 620add7b9f
2024-04-18 20:31:24 +02:00
hiyouga
9aa62ffb57 fix #3324
Former-commit-id: 942362d008
2024-04-18 15:34:45 +08:00
hiyouga
0170ef83a6 fix #3316
Former-commit-id: c9a477322d
2024-04-17 22:54:34 +08:00
hoshi-hiyouga
496396b3bc Merge pull request #3287 from Ledzy/badam
[Feature] Add BAdam algorithm

Former-commit-id: 4d660c5ade
2024-04-16 17:32:16 +08:00
hoshi-hiyouga
b92f690190 Update utils.py
Former-commit-id: 38a56706e0
2024-04-16 17:29:30 +08:00
hoshi-hiyouga
48fb0be1b9 Update patcher.py
Former-commit-id: a950f3b81d
2024-04-16 17:29:19 +08:00
hoshi-hiyouga
ce56ff22af Update adapter.py
Former-commit-id: 750cdf2e74
2024-04-16 17:28:12 +08:00
Jonery
b3260c7456 resolve gradient checkpointing issue.
Former-commit-id: 7ecb61822b
2024-04-16 12:05:27 +08:00
hiyouga
b40f266617 support unsloth 2024.4
Former-commit-id: 7dc72fb58c
2024-04-16 00:25:03 +08:00
hiyouga
bd2b758b48 add codegemma
Former-commit-id: 6543f3d449
2024-04-16 00:11:15 +08:00
hiyouga
2dc3343b1c support cohere commandR #3184
Former-commit-id: e0dbac2845
2024-04-15 23:26:42 +08:00
Jonery
025f329445 Feature BAdam
Former-commit-id: 06c8908d3f
2024-04-15 23:15:27 +08:00
hiyouga
fb385b8c26 update examples
Former-commit-id: cce52351b5
2024-04-15 22:14:34 +08:00
hoshi-hiyouga
1bdf7e4b9d Merge pull request #3276 from liu-zichen/fix_mixtral
fix: turn on output_router_logits of mixtral
Former-commit-id: 0e0942d388
2024-04-15 15:38:16 +08:00
hiyouga
ceccad3419 fix #3273
Former-commit-id: efc345c4b0
2024-04-15 15:32:58 +08:00
liuzc
11f4afc5ad fix: mixtral output_router_logits
Former-commit-id: 9f4fe62386
2024-04-15 12:11:49 +08:00
hiyouga
431e9804ee release v0.6.2
Former-commit-id: 9d4c949461
2024-04-11 20:08:51 +08:00
hoshi-hiyouga
77d16ada1e Update adapter.py
Former-commit-id: 98bc97d8d2
2024-04-10 00:57:51 +08:00
hoshi-hiyouga
e5b4cb62e0 Update adapter.py
Former-commit-id: 2111b586b6
2024-04-10 00:57:30 +08:00
Erich Schubert
3dccd3c67e Pass additional_target to unsloth
Fixes #3200

Former-commit-id: b5eefe5c4c
2024-04-09 17:53:40 +02:00
hiyouga
0e08c209c4 fix quant infer and qwen2moe
Former-commit-id: 7f6c2486b8
2024-04-09 17:12:59 +08:00
hiyouga
2ecf2bcbf0 fix resize vocab at inference #3022
Former-commit-id: 148bda353f
2024-04-03 18:14:24 +08:00
hiyouga
bf5ffeeae0 simplify readme
Former-commit-id: 92dab8a90b
2024-04-02 20:07:43 +08:00
hiyouga
f4be51f356 add moe aux loss control #3085
Former-commit-id: b267aeb53f
2024-04-02 14:26:31 +08:00
hiyouga
c7104f8fab fix #3022
Former-commit-id: 9ddbe2866a
2024-04-02 13:58:39 +08:00