hiyouga
|
4dcd47100d
|
fix llava rlhf
Former-commit-id: f6863cbbcbf960d6481296c6cae3e40fd70e4e14
|
2024-04-28 03:01:49 +08:00 |
|
hoshi-hiyouga
|
a6f6b406b3
|
Update loader.py
Former-commit-id: 72d4817a15f6916706828ea2a61d808183c23773
|
2024-04-26 03:22:40 +08:00 |
|
BUAADreamer
|
8b2a735c14
|
modify some style
Former-commit-id: b016e6a671a2f228f0bdd9b8d5995b4669609655
|
2024-04-25 21:58:18 +08:00 |
|
BUAADreamer
|
058ed5e607
|
modify style
Former-commit-id: c1f1df99e4dc3d0aadf1207b4e9a16218187fd5a
|
2024-04-25 21:29:50 +08:00 |
|
BUAADreamer
|
c425436676
|
modify style
Former-commit-id: 54b713d0c4ffdfc6a7faeb14471b58bb1cd8acf5
|
2024-04-25 21:15:16 +08:00 |
|
BUAADreamer
|
dbd905438b
|
add some
Former-commit-id: 8d035a849c4a441d457791aab073861adf69a09f
|
2024-04-25 21:08:32 +08:00 |
|
BUAADreamer
|
3c792174db
|
merge data part to the text stream
Former-commit-id: 7ee20286d9bcc2d5378bfd6bb02cd3648396d873
|
2024-04-25 19:19:59 +08:00 |
|
BUAADreamer
|
00e2a272ef
|
merge model part to the text stream
Former-commit-id: b6fcb832ddaed4647d6f2b926f3dfccd47f3ea84
|
2024-04-25 08:20:41 +08:00 |
|
BUAADreamer
|
5142349661
|
remove error
Former-commit-id: 2bcd1c7dc3595f17ae4e2c4475196cc2d03d0e75
|
2024-04-25 01:01:59 +08:00 |
|
BUAADreamer
|
6c1db2d012
|
remove conflicts
Former-commit-id: f8b637eb76cba7ec229e2978068805ad1cca8adb
|
2024-04-25 00:34:22 +08:00 |
|
BUAADreamer
|
12c51655ce
|
add llava and instructblip
Former-commit-id: 142fb6f4541a1acfefe66ff2574dabde53b00c06
|
2024-04-25 00:22:43 +08:00 |
|
hiyouga
|
83404c4fa9
|
support new special token #3420
Former-commit-id: f5c6a47f5193ab3a6c137580992bdcce0b31fdd5
|
2024-04-24 23:39:31 +08:00 |
|
hiyouga
|
5420905a2e
|
support unsloth generate
Former-commit-id: 0ef1ad9f505dba71db9342f524cc3a7565e5e09e
|
2024-04-24 04:46:53 +08:00 |
|
hiyouga
|
03f2e3284a
|
refactor patcher
Former-commit-id: 263cfe1294f5c3188f5e8d65791f35ee0d87315a
|
2024-04-24 03:02:23 +08:00 |
|
hiyouga
|
35c4a2c212
|
fix #3347 #3387
Former-commit-id: c253c18185a29b59190f3e0ed236c2bb4c788085
|
2024-04-24 01:30:16 +08:00 |
|
BUAADreamer
|
ab6dc0ea30
|
add multimodal LLM BLIP-2 and InstructBLIP
Former-commit-id: a730f89a972f1a9d37c718c716f199cb8d4903b2
|
2024-04-23 18:45:43 +08:00 |
|
hiyouga
|
f8e219dc81
|
fix mod stuff
Former-commit-id: cf3988226e6398c67bb2955578e436fc505aa5c5
|
2024-04-21 18:11:10 +08:00 |
|
hoshi-hiyouga
|
3365cc8cf0
|
Merge pull request #3338 from astramind-ai/main
Adding Mixture of Depth
Former-commit-id: 4da2ece53353b63e672ff529d6beba41ff710c14
|
2024-04-21 18:05:52 +08:00 |
|
hoshi-hiyouga
|
3a5e68b7d9
|
fix #3348
Former-commit-id: aa5e921c00f60074eceb2f9d4d8837cc713edba6
|
2024-04-20 10:34:09 +08:00 |
|
Marco
|
44cda2eece
|
Added Mixture of Depths
Former-commit-id: 75dd98b9abc847e22cb263c17ebcd2ca5dd98345
|
2024-04-18 20:31:24 +02:00 |
|
hiyouga
|
efa808069a
|
support unsloth 2024.4
Former-commit-id: 14a83f8bc4fe44783252378fce59198194a96bb8
|
2024-04-16 00:25:03 +08:00 |
|
hiyouga
|
b5c5283dd6
|
add codegemma
Former-commit-id: 9324176525c2eda22962b0ca1895009b6237e6e3
|
2024-04-16 00:11:15 +08:00 |
|
hiyouga
|
b638c65519
|
support cohere commandR #3184
Former-commit-id: e077c36872740f6b2ac255aee9da6c4c70f28977
|
2024-04-15 23:26:42 +08:00 |
|
hiyouga
|
276f2cb24e
|
update examples
Former-commit-id: 369294b31c8a03a1cafcee83eb31a817007d3c49
|
2024-04-15 22:14:34 +08:00 |
|
hiyouga
|
566d71b7a9
|
fix quant infer and qwen2moe
Former-commit-id: b75d16767f35c36e2cf2aaab8a3844135085bccf
|
2024-04-09 17:12:59 +08:00 |
|
hiyouga
|
1348f7d860
|
fix resize vocab at inference #3022
Former-commit-id: c243720b89eec0af2872fa3c7980a0026d893f4d
|
2024-04-03 18:14:24 +08:00 |
|
hiyouga
|
117b67ea30
|
add moe aux loss control #3085
Former-commit-id: c9187ebc944e2de454ace3304b7d28eabb1b1a81
|
2024-04-02 14:26:31 +08:00 |
|
hiyouga
|
c548ad5e69
|
fix #2928
Former-commit-id: 9558ee87bc7260a6596385aaa375df544862bfa9
|
2024-03-24 00:34:54 +08:00 |
|
hiyouga
|
46f99ff277
|
improve lora+ impl.
Former-commit-id: 332bad25455a70ad9204e7dd384bb086d789aa39
|
2024-03-13 23:32:51 +08:00 |
|
hiyouga
|
c635bbe465
|
fix #2732
Former-commit-id: bc39ad1d102b91d5417daa38b8a581e1e1ab2af9
|
2024-03-09 22:37:16 +08:00 |
|
hiyouga
|
4881f4e631
|
allow non-packing pretraining
Former-commit-id: 3fee5cc5a3db9ce874ad90f2500ec092d904bd4e
|
2024-03-09 22:21:46 +08:00 |
|
hiyouga
|
43b2ede0f8
|
fix #2756 , patch #2746
Former-commit-id: 627d1c91e675f1d9ebf47bad123cbbf29821da4d
|
2024-03-09 02:01:26 +08:00 |
|
hoshi-hiyouga
|
2f095e2017
|
Merge pull request #2746 from stephen-nju/main
fix deepspeed ppo RuntimeError
Former-commit-id: 656c653f0c628f9494b4d7ae12e60c8eeec1ea7a
|
2024-03-09 01:37:00 +08:00 |
|
hiyouga
|
9b97b23ce7
|
fix aqlm version
Former-commit-id: 05673f81f0295c76957f3247c62f95fda322a63e
|
2024-03-09 00:09:09 +08:00 |
|
stephen
|
18cfd5f349
|
fix ppo runtime error
Former-commit-id: 14e2f221e3e720075e59065a3dc42aa4d993a8b6
|
2024-03-08 11:48:26 +08:00 |
|
hiyouga
|
9a69cadab3
|
fix #2735
Former-commit-id: 416f6333f66b6afd70a3a936d82593efca583235
|
2024-03-07 16:15:53 +08:00 |
|
hiyouga
|
73d9dfc7ab
|
fix version checking
Former-commit-id: 5780da8d640609cca388f55983d0251e5547209a
|
2024-03-06 14:51:51 +08:00 |
|
hiyouga
|
46ee267cfc
|
improve aqlm optim
Former-commit-id: 81be999b407e988c2f42764d827ac859d079ed3e
|
2024-03-05 20:49:50 +08:00 |
|
hiyouga
|
a10bead9b5
|
optimize aqlm training
Former-commit-id: 8b42660e4039b3d6475f502f397686ba6b140627
|
2024-03-05 18:35:41 +08:00 |
|
hiyouga
|
59a9a5994e
|
fix #2649
Former-commit-id: 1c850de660c671d92f0bc63f230d338b60b7c0bd
|
2024-03-01 13:02:41 +08:00 |
|
hiyouga
|
88fddb879d
|
fix #2642
Former-commit-id: d8435e7f1850532310e1bee069b45f38cd666e48
|
2024-02-29 18:32:54 +08:00 |
|
hiyouga
|
544e7a491b
|
release v0.5.3
Former-commit-id: f6bc89581b3cd129448da2defc23848de6f494ed
|
2024-02-29 00:34:19 +08:00 |
|
hiyouga
|
b392e6cfb9
|
support DoRA, AWQ, AQLM #2512
Former-commit-id: 6614cc1f08aa944db083e27e451bbdd733f7dd97
|
2024-02-28 19:53:28 +08:00 |
|
hiyouga
|
596b6828cb
|
support llama pro #2338 , add rslora
Former-commit-id: 40d659b7f30dd5a004703c176ec1f22dc864e505
|
2024-02-15 02:27:36 +08:00 |
|
younesbelkada
|
590b6c2143
|
add v1 hf tags
Former-commit-id: a29cc9f4472c95cd6a43ea350ab728e0a8069c6e
|
2024-02-13 05:58:49 +00:00 |
|
hiyouga
|
5f83860aa1
|
add option to disable version check
Former-commit-id: fd769cb2de696aee3c5e882237e16eace6a9d675
|
2024-02-10 22:31:23 +08:00 |
|
hiyouga
|
f2e7122a96
|
bump up transformers version
Former-commit-id: 82f4d4301ed9f31b160d6313a1d2d44a22865f4d
|
2024-02-04 00:01:16 +08:00 |
|
hiyouga
|
66e0e651b9
|
format style
Former-commit-id: 53b683531b83cd1d19de97c6565f16c1eca6f5e1
|
2024-01-20 20:15:56 +08:00 |
|
hiyouga
|
a423274fd9
|
support function calling
Former-commit-id: 66533b3f65babf2429c92c0f8fafe4eff5e0ff63
|
2024-01-18 09:54:23 +08:00 |
|
hiyouga
|
ccc5b324fe
|
fix rm server
Former-commit-id: 81bc1638682a9fd01518f9f25250a6b584d2a9e6
|
2024-01-03 15:30:46 +08:00 |
|