hoshi-hiyouga
|
651fdb819c
|
Update loader.py
Former-commit-id: 72d4817a15f6916706828ea2a61d808183c23773
|
2024-04-26 03:22:40 +08:00 |
|
BUAADreamer
|
62d9d20686
|
modify some style
Former-commit-id: b016e6a671a2f228f0bdd9b8d5995b4669609655
|
2024-04-25 21:58:18 +08:00 |
|
BUAADreamer
|
633e76a1bd
|
modify style
Former-commit-id: c1f1df99e4dc3d0aadf1207b4e9a16218187fd5a
|
2024-04-25 21:29:50 +08:00 |
|
BUAADreamer
|
bb46c64dd6
|
modify style
Former-commit-id: 54b713d0c4ffdfc6a7faeb14471b58bb1cd8acf5
|
2024-04-25 21:15:16 +08:00 |
|
BUAADreamer
|
c11c9711e7
|
add some
Former-commit-id: 8d035a849c4a441d457791aab073861adf69a09f
|
2024-04-25 21:08:32 +08:00 |
|
BUAADreamer
|
69fb4351f5
|
merge data part to the text stream
Former-commit-id: 7ee20286d9bcc2d5378bfd6bb02cd3648396d873
|
2024-04-25 19:19:59 +08:00 |
|
BUAADreamer
|
d6d109a282
|
merge model part to the text stream
Former-commit-id: b6fcb832ddaed4647d6f2b926f3dfccd47f3ea84
|
2024-04-25 08:20:41 +08:00 |
|
BUAADreamer
|
15c8b9ac37
|
remove error
Former-commit-id: 2bcd1c7dc3595f17ae4e2c4475196cc2d03d0e75
|
2024-04-25 01:01:59 +08:00 |
|
BUAADreamer
|
8dda627bd8
|
remove conflicts
Former-commit-id: f8b637eb76cba7ec229e2978068805ad1cca8adb
|
2024-04-25 00:34:22 +08:00 |
|
BUAADreamer
|
641c97ba74
|
add llava and instructblip
Former-commit-id: 142fb6f4541a1acfefe66ff2574dabde53b00c06
|
2024-04-25 00:22:43 +08:00 |
|
hiyouga
|
3b83811b99
|
support new special token #3420
Former-commit-id: f5c6a47f5193ab3a6c137580992bdcce0b31fdd5
|
2024-04-24 23:39:31 +08:00 |
|
hiyouga
|
e66b8ade4d
|
support unsloth generate
Former-commit-id: 0ef1ad9f505dba71db9342f524cc3a7565e5e09e
|
2024-04-24 04:46:53 +08:00 |
|
hiyouga
|
460da206f6
|
refactor patcher
Former-commit-id: 263cfe1294f5c3188f5e8d65791f35ee0d87315a
|
2024-04-24 03:02:23 +08:00 |
|
hiyouga
|
83b8bc8937
|
fix #3347 #3387
Former-commit-id: c253c18185a29b59190f3e0ed236c2bb4c788085
|
2024-04-24 01:30:16 +08:00 |
|
BUAADreamer
|
20e05970ab
|
add multimodal LLM BLIP-2 and InstructBLIP
Former-commit-id: a730f89a972f1a9d37c718c716f199cb8d4903b2
|
2024-04-23 18:45:43 +08:00 |
|
hiyouga
|
366c0eb1c5
|
fix mod stuff
Former-commit-id: cf3988226e6398c67bb2955578e436fc505aa5c5
|
2024-04-21 18:11:10 +08:00 |
|
hoshi-hiyouga
|
5c3922713a
|
Merge pull request #3338 from astramind-ai/main
Adding Mixture of Depth
Former-commit-id: 4da2ece53353b63e672ff529d6beba41ff710c14
|
2024-04-21 18:05:52 +08:00 |
|
hoshi-hiyouga
|
7279a7014c
|
fix #3348
Former-commit-id: aa5e921c00f60074eceb2f9d4d8837cc713edba6
|
2024-04-20 10:34:09 +08:00 |
|
Marco
|
68dbd5d220
|
Added Mixture of Depths
Former-commit-id: 75dd98b9abc847e22cb263c17ebcd2ca5dd98345
|
2024-04-18 20:31:24 +02:00 |
|
hiyouga
|
ba4efe3ff6
|
support unsloth 2024.4
Former-commit-id: 14a83f8bc4fe44783252378fce59198194a96bb8
|
2024-04-16 00:25:03 +08:00 |
|
hiyouga
|
2aa1d1476e
|
add codegemma
Former-commit-id: 9324176525c2eda22962b0ca1895009b6237e6e3
|
2024-04-16 00:11:15 +08:00 |
|
hiyouga
|
19874e39ee
|
support cohere commandR #3184
Former-commit-id: e077c36872740f6b2ac255aee9da6c4c70f28977
|
2024-04-15 23:26:42 +08:00 |
|
hiyouga
|
be206df674
|
update examples
Former-commit-id: 369294b31c8a03a1cafcee83eb31a817007d3c49
|
2024-04-15 22:14:34 +08:00 |
|
hiyouga
|
f8609236ab
|
fix quant infer and qwen2moe
Former-commit-id: b75d16767f35c36e2cf2aaab8a3844135085bccf
|
2024-04-09 17:12:59 +08:00 |
|
hiyouga
|
d97150c571
|
fix resize vocab at inference #3022
Former-commit-id: c243720b89eec0af2872fa3c7980a0026d893f4d
|
2024-04-03 18:14:24 +08:00 |
|
hiyouga
|
76ba7b51c1
|
add moe aux loss control #3085
Former-commit-id: c9187ebc944e2de454ace3304b7d28eabb1b1a81
|
2024-04-02 14:26:31 +08:00 |
|
hiyouga
|
6d7c325f19
|
fix #2928
Former-commit-id: 9558ee87bc7260a6596385aaa375df544862bfa9
|
2024-03-24 00:34:54 +08:00 |
|
hiyouga
|
4ef67ed4dd
|
improve lora+ impl.
Former-commit-id: 332bad25455a70ad9204e7dd384bb086d789aa39
|
2024-03-13 23:32:51 +08:00 |
|
hiyouga
|
7538d8e726
|
fix #2732
Former-commit-id: bc39ad1d102b91d5417daa38b8a581e1e1ab2af9
|
2024-03-09 22:37:16 +08:00 |
|
hiyouga
|
56565bdbd4
|
allow non-packing pretraining
Former-commit-id: 3fee5cc5a3db9ce874ad90f2500ec092d904bd4e
|
2024-03-09 22:21:46 +08:00 |
|
hiyouga
|
e16912b0c0
|
fix #2756 , patch #2746
Former-commit-id: 627d1c91e675f1d9ebf47bad123cbbf29821da4d
|
2024-03-09 02:01:26 +08:00 |
|
hoshi-hiyouga
|
5469111c65
|
Merge pull request #2746 from stephen-nju/main
fix deepspeed ppo RuntimeError
Former-commit-id: 656c653f0c628f9494b4d7ae12e60c8eeec1ea7a
|
2024-03-09 01:37:00 +08:00 |
|
hiyouga
|
1dd3f17f79
|
fix aqlm version
Former-commit-id: 05673f81f0295c76957f3247c62f95fda322a63e
|
2024-03-09 00:09:09 +08:00 |
|
stephen
|
eb1ad9f161
|
fix ppo runtime error
Former-commit-id: 14e2f221e3e720075e59065a3dc42aa4d993a8b6
|
2024-03-08 11:48:26 +08:00 |
|
hiyouga
|
a02d518edc
|
fix #2735
Former-commit-id: 416f6333f66b6afd70a3a936d82593efca583235
|
2024-03-07 16:15:53 +08:00 |
|
hiyouga
|
4aa6db78fb
|
fix version checking
Former-commit-id: 5780da8d640609cca388f55983d0251e5547209a
|
2024-03-06 14:51:51 +08:00 |
|
hiyouga
|
c60b53a164
|
improve aqlm optim
Former-commit-id: 81be999b407e988c2f42764d827ac859d079ed3e
|
2024-03-05 20:49:50 +08:00 |
|
hiyouga
|
67bb861040
|
optimize aqlm training
Former-commit-id: 8b42660e4039b3d6475f502f397686ba6b140627
|
2024-03-05 18:35:41 +08:00 |
|
hiyouga
|
10845a2fe7
|
fix #2649
Former-commit-id: 1c850de660c671d92f0bc63f230d338b60b7c0bd
|
2024-03-01 13:02:41 +08:00 |
|
hiyouga
|
c5cbe2c6f9
|
fix #2642
Former-commit-id: d8435e7f1850532310e1bee069b45f38cd666e48
|
2024-02-29 18:32:54 +08:00 |
|
hiyouga
|
443d85d80f
|
release v0.5.3
Former-commit-id: f6bc89581b3cd129448da2defc23848de6f494ed
|
2024-02-29 00:34:19 +08:00 |
|
hiyouga
|
98a0c8e8bf
|
support DoRA, AWQ, AQLM #2512
Former-commit-id: 6614cc1f08aa944db083e27e451bbdd733f7dd97
|
2024-02-28 19:53:28 +08:00 |
|
hiyouga
|
562b9d0167
|
support llama pro #2338 , add rslora
Former-commit-id: 40d659b7f30dd5a004703c176ec1f22dc864e505
|
2024-02-15 02:27:36 +08:00 |
|
younesbelkada
|
4b195603c9
|
add v1 hf tags
Former-commit-id: a29cc9f4472c95cd6a43ea350ab728e0a8069c6e
|
2024-02-13 05:58:49 +00:00 |
|
hiyouga
|
a52372df01
|
add option to disable version check
Former-commit-id: fd769cb2de696aee3c5e882237e16eace6a9d675
|
2024-02-10 22:31:23 +08:00 |
|
hiyouga
|
4adb4477bc
|
bump up transformers version
Former-commit-id: 82f4d4301ed9f31b160d6313a1d2d44a22865f4d
|
2024-02-04 00:01:16 +08:00 |
|
hiyouga
|
c0e4eebf17
|
format style
Former-commit-id: 53b683531b83cd1d19de97c6565f16c1eca6f5e1
|
2024-01-20 20:15:56 +08:00 |
|
hiyouga
|
a9fc7dbfa6
|
support function calling
Former-commit-id: 66533b3f65babf2429c92c0f8fafe4eff5e0ff63
|
2024-01-18 09:54:23 +08:00 |
|
hiyouga
|
4f6c841d52
|
fix rm server
Former-commit-id: 81bc1638682a9fd01518f9f25250a6b584d2a9e6
|
2024-01-03 15:30:46 +08:00 |
|
hiyouga
|
e058832486
|
fix version
Former-commit-id: dd7500b65d0d548441eece101b60d51fa619cc0f
|
2023-12-29 04:53:36 +08:00 |
|