BUAADreamer
|
a0be27fc9b
|
modify some style
Former-commit-id: 2d4ded535faa44b460f88028d49e4b8c8b430db5
|
2024-04-25 21:58:18 +08:00 |
|
BUAADreamer
|
40bfe767f7
|
modify style
Former-commit-id: 235b4113709fe788b4f1a1a3089ce8356940877b
|
2024-04-25 21:29:50 +08:00 |
|
BUAADreamer
|
e6cf251fb8
|
modify style
Former-commit-id: 1dcabafe72fe21c7f9122a6bc1a1ccc4f5d08fdd
|
2024-04-25 21:15:16 +08:00 |
|
BUAADreamer
|
cefdbb8728
|
add some
Former-commit-id: 94ad744941b5d305892c38169a8478d0fb1a3019
|
2024-04-25 21:08:32 +08:00 |
|
BUAADreamer
|
ad7d8a6525
|
Merge branch 'hiyouga:main' into main
Former-commit-id: 68cdd9a020902e276efec6f431ecaf7cf3ded6da
|
2024-04-25 20:02:50 +08:00 |
|
BUAADreamer
|
b6d78b2a64
|
merge data part to the text stream
Former-commit-id: c6dd89918feb25fe8c07857162421ad1706f791f
|
2024-04-25 19:19:59 +08:00 |
|
hiyouga
|
84031c5bf9
|
add export_device in webui #3333
Former-commit-id: 3a7c1286ce7fdfe5db60b17720ce734408913b7f
|
2024-04-25 19:02:32 +08:00 |
|
BUAADreamer
|
4e032ff95e
|
merge model part to the text stream
Former-commit-id: 838eb87a961894b072bf79a60bbda63516670d6f
|
2024-04-25 08:20:41 +08:00 |
|
BUAADreamer
|
cb1c66a810
|
remove error
Former-commit-id: 8239907f578dc4b185cfb1789944db23d112fa2c
|
2024-04-25 01:01:59 +08:00 |
|
BUAADreamer
|
ff8d729b59
|
remove conflicts
Former-commit-id: 7ffee907995095220e93c282d8b57137c0e6c018
|
2024-04-25 00:34:22 +08:00 |
|
BUAADreamer
|
31bce63a10
|
add llava and instructblip
Former-commit-id: cfb485eddff0130422416b50c50e171fccc8103e
|
2024-04-25 00:22:43 +08:00 |
|
hiyouga
|
9a21785396
|
fix log level
Former-commit-id: 7fbe8add8f449358c9815c5ba8a2052a2d874dab
|
2024-04-24 23:42:59 +08:00 |
|
hiyouga
|
ce490c65ae
|
support new special token #3420
Former-commit-id: 297fb8ead3daf154152d9826b49bb4d769fbaaa9
|
2024-04-24 23:39:31 +08:00 |
|
hiyouga
|
7d89abb1fd
|
fix bug
Former-commit-id: 73ff9c834b069bf8b1bde75cc4daf996746050fa
|
2024-04-24 05:21:18 +08:00 |
|
hiyouga
|
612ba26c4c
|
fix bug
Former-commit-id: 8f44dce08aa809bb7d4ea0bd5f48ca1c56436044
|
2024-04-24 05:10:07 +08:00 |
|
hiyouga
|
1f99c367b3
|
remove redundant code
Former-commit-id: 667ce08b27df9452faee87348419f5f1f0c0cb2f
|
2024-04-24 05:02:18 +08:00 |
|
hiyouga
|
c0afc4074f
|
support unsloth generate
Former-commit-id: b1deb0a0b920645884e58f8206b1842c144c1c52
|
2024-04-24 04:46:53 +08:00 |
|
hiyouga
|
8465e54d38
|
refactor patcher
Former-commit-id: aa2b79eb23c60825e6601b0b8cc6b59e3f566b2d
|
2024-04-24 03:02:23 +08:00 |
|
hiyouga
|
80c8586534
|
reenable sdpa and fast tok by default
Former-commit-id: 07737a3d2d026c973ab964f948953d6ce0e1f2a9
|
2024-04-24 02:18:44 +08:00 |
|
hiyouga
|
34ecad4af8
|
fix #3347 #3387
Former-commit-id: 707f0b1d5d42b8e2c5b783c7783f65dfa9890a68
|
2024-04-24 01:30:16 +08:00 |
|
BUAADreamer
|
175b56bced
|
add multimodal LLM BLIP-2 and InstructBLIP
Former-commit-id: 4dcb11eab7bbeac866043d2a7c748b8d06fbd243
|
2024-04-23 18:45:43 +08:00 |
|
hiyouga
|
79666c298d
|
fix #3365
Former-commit-id: a1d31ffc8cb7a6a477704efe779d485d83b8b9fb
|
2024-04-21 19:20:18 +08:00 |
|
hiyouga
|
ec81d45d27
|
fix mod stuff
Former-commit-id: f58425ab45727f7859583d4b9fda776715e27ff6
|
2024-04-21 18:11:10 +08:00 |
|
hoshi-hiyouga
|
7c63a9b5fd
|
Merge pull request #3338 from astramind-ai/main
Adding Mixture of Depth
Former-commit-id: d0273787be481fb2cbed993580f8239e63d74f7f
|
2024-04-21 18:05:52 +08:00 |
|
hoshi-hiyouga
|
e9b1aff447
|
fix #3348
Former-commit-id: 1fa287fd637aad0c5e8893046515a54bbff4c009
|
2024-04-20 10:34:09 +08:00 |
|
Marco
|
639297a5ef
|
Added Mixture of Depths
Former-commit-id: 620add7b9f634de1a711f7b87b16050adf735e9b
|
2024-04-18 20:31:24 +02:00 |
|
hiyouga
|
9aa62ffb57
|
fix #3324
Former-commit-id: 942362d0087345e468e0ae541dcca9b684d74d1a
|
2024-04-18 15:34:45 +08:00 |
|
hiyouga
|
0170ef83a6
|
fix #3316
Former-commit-id: c9a477322df82fecdb268ed385e3e0c376c0baeb
|
2024-04-17 22:54:34 +08:00 |
|
hoshi-hiyouga
|
496396b3bc
|
Merge pull request #3287 from Ledzy/badam
[Feature] Add BAdam algorithm
Former-commit-id: 4d660c5ade384df4444fa0543a39edce6220903d
|
2024-04-16 17:32:16 +08:00 |
|
hoshi-hiyouga
|
b92f690190
|
Update utils.py
Former-commit-id: 38a56706e0f52297501d351d38b51bee73e881dc
|
2024-04-16 17:29:30 +08:00 |
|
hoshi-hiyouga
|
48fb0be1b9
|
Update patcher.py
Former-commit-id: a950f3b81de701f5f23ce3efa60ff0382bb40dfe
|
2024-04-16 17:29:19 +08:00 |
|
hoshi-hiyouga
|
ce56ff22af
|
Update adapter.py
Former-commit-id: 750cdf2e74097c8775d03ddf55646cc14d4a686f
|
2024-04-16 17:28:12 +08:00 |
|
Jonery
|
b3260c7456
|
resolve gradient checkpointing issue.
Former-commit-id: 7ecb61822b37f5d71060d696495830ff98edaa06
|
2024-04-16 12:05:27 +08:00 |
|
hiyouga
|
b40f266617
|
support unsloth 2024.4
Former-commit-id: 7dc72fb58cb988418323f63821a21a184ecf0718
|
2024-04-16 00:25:03 +08:00 |
|
hiyouga
|
bd2b758b48
|
add codegemma
Former-commit-id: 6543f3d4496218f7f90c582cb6aa8c852d716cbf
|
2024-04-16 00:11:15 +08:00 |
|
hiyouga
|
2dc3343b1c
|
support cohere commandR #3184
Former-commit-id: e0dbac28450a0e1e0b84e1577ef785fc762c0b46
|
2024-04-15 23:26:42 +08:00 |
|
Jonery
|
025f329445
|
Feature BAdam
Former-commit-id: 06c8908d3fe48907ddb585c5fa15677fc5416f94
|
2024-04-15 23:15:27 +08:00 |
|
hiyouga
|
fb385b8c26
|
update examples
Former-commit-id: cce52351b54f70904f33902d9c17411134f9f6eb
|
2024-04-15 22:14:34 +08:00 |
|
hoshi-hiyouga
|
1bdf7e4b9d
|
Merge pull request #3276 from liu-zichen/fix_mixtral
fix: turn on output_router_logits of mixtral
Former-commit-id: 0e0942d388bdb0122001a7f8e081315059d5d327
|
2024-04-15 15:38:16 +08:00 |
|
hiyouga
|
ceccad3419
|
fix #3273
Former-commit-id: efc345c4b0095ec959ea23bbe54c344278780cbe
|
2024-04-15 15:32:58 +08:00 |
|
liuzc
|
11f4afc5ad
|
fix: mixtral output_router_logits
Former-commit-id: 9f4fe623866b10b30c6418dee116b36671274f9f
|
2024-04-15 12:11:49 +08:00 |
|
hiyouga
|
431e9804ee
|
release v0.6.2
Former-commit-id: 9d4c949461d232a959c14859ae7fef191faab711
|
2024-04-11 20:08:51 +08:00 |
|
hoshi-hiyouga
|
77d16ada1e
|
Update adapter.py
Former-commit-id: 98bc97d8d218182c026e9f57bbcbf40ab1e0bc87
|
2024-04-10 00:57:51 +08:00 |
|
hoshi-hiyouga
|
e5b4cb62e0
|
Update adapter.py
Former-commit-id: 2111b586b648caa150a8e41877c7fede75911da8
|
2024-04-10 00:57:30 +08:00 |
|
Erich Schubert
|
3dccd3c67e
|
Pass additional_target to unsloth
Fixes #3200
Former-commit-id: b5eefe5c4c084b63a12b023cae877fcd1914d4fc
|
2024-04-09 17:53:40 +02:00 |
|
hiyouga
|
0e08c209c4
|
fix quant infer and qwen2moe
Former-commit-id: 7f6c2486b83e1d2c96a2314bfa8e1519ca5f574e
|
2024-04-09 17:12:59 +08:00 |
|
hiyouga
|
2ecf2bcbf0
|
fix resize vocab at inference #3022
Former-commit-id: 148bda353f0b53af022c51da9a9e59a56f341510
|
2024-04-03 18:14:24 +08:00 |
|
hiyouga
|
bf5ffeeae0
|
simplify readme
Former-commit-id: 92dab8a90bdd82a72a06559943467b56dde12c71
|
2024-04-02 20:07:43 +08:00 |
|
hiyouga
|
f4be51f356
|
add moe aux loss control #3085
Former-commit-id: b267aeb53fc49d2eeb0f3fc5ebe55e643f5db377
|
2024-04-02 14:26:31 +08:00 |
|
hiyouga
|
c7104f8fab
|
fix #3022
Former-commit-id: 9ddbe2866a4a4433d7635659a5635d16c59800b1
|
2024-04-02 13:58:39 +08:00 |
|