hiyouga
|
5420905a2e
|
support unsloth generate
Former-commit-id: 0ef1ad9f505dba71db9342f524cc3a7565e5e09e
|
2024-04-24 04:46:53 +08:00 |
|
hiyouga
|
03f2e3284a
|
refactor patcher
Former-commit-id: 263cfe1294f5c3188f5e8d65791f35ee0d87315a
|
2024-04-24 03:02:23 +08:00 |
|
hiyouga
|
35c4a2c212
|
fix #3347 #3387
Former-commit-id: c253c18185a29b59190f3e0ed236c2bb4c788085
|
2024-04-24 01:30:16 +08:00 |
|
hiyouga
|
f8e219dc81
|
fix mod stuff
Former-commit-id: cf3988226e6398c67bb2955578e436fc505aa5c5
|
2024-04-21 18:11:10 +08:00 |
|
hoshi-hiyouga
|
3365cc8cf0
|
Merge pull request #3338 from astramind-ai/main
Adding Mixture of Depth
Former-commit-id: 4da2ece53353b63e672ff529d6beba41ff710c14
|
2024-04-21 18:05:52 +08:00 |
|
hoshi-hiyouga
|
3a5e68b7d9
|
fix #3348
Former-commit-id: aa5e921c00f60074eceb2f9d4d8837cc713edba6
|
2024-04-20 10:34:09 +08:00 |
|
Marco
|
44cda2eece
|
Added Mixture of Depths
Former-commit-id: 75dd98b9abc847e22cb263c17ebcd2ca5dd98345
|
2024-04-18 20:31:24 +02:00 |
|
hiyouga
|
efa808069a
|
support unsloth 2024.4
Former-commit-id: 14a83f8bc4fe44783252378fce59198194a96bb8
|
2024-04-16 00:25:03 +08:00 |
|
hiyouga
|
b5c5283dd6
|
add codegemma
Former-commit-id: 9324176525c2eda22962b0ca1895009b6237e6e3
|
2024-04-16 00:11:15 +08:00 |
|
hiyouga
|
b638c65519
|
support cohere commandR #3184
Former-commit-id: e077c36872740f6b2ac255aee9da6c4c70f28977
|
2024-04-15 23:26:42 +08:00 |
|
hiyouga
|
276f2cb24e
|
update examples
Former-commit-id: 369294b31c8a03a1cafcee83eb31a817007d3c49
|
2024-04-15 22:14:34 +08:00 |
|
hiyouga
|
566d71b7a9
|
fix quant infer and qwen2moe
Former-commit-id: b75d16767f35c36e2cf2aaab8a3844135085bccf
|
2024-04-09 17:12:59 +08:00 |
|
hiyouga
|
1348f7d860
|
fix resize vocab at inference #3022
Former-commit-id: c243720b89eec0af2872fa3c7980a0026d893f4d
|
2024-04-03 18:14:24 +08:00 |
|
hiyouga
|
117b67ea30
|
add moe aux loss control #3085
Former-commit-id: c9187ebc944e2de454ace3304b7d28eabb1b1a81
|
2024-04-02 14:26:31 +08:00 |
|
hiyouga
|
c548ad5e69
|
fix #2928
Former-commit-id: 9558ee87bc7260a6596385aaa375df544862bfa9
|
2024-03-24 00:34:54 +08:00 |
|
hiyouga
|
46f99ff277
|
improve lora+ impl.
Former-commit-id: 332bad25455a70ad9204e7dd384bb086d789aa39
|
2024-03-13 23:32:51 +08:00 |
|
hiyouga
|
c635bbe465
|
fix #2732
Former-commit-id: bc39ad1d102b91d5417daa38b8a581e1e1ab2af9
|
2024-03-09 22:37:16 +08:00 |
|
hiyouga
|
4881f4e631
|
allow non-packing pretraining
Former-commit-id: 3fee5cc5a3db9ce874ad90f2500ec092d904bd4e
|
2024-03-09 22:21:46 +08:00 |
|
hiyouga
|
43b2ede0f8
|
fix #2756 , patch #2746
Former-commit-id: 627d1c91e675f1d9ebf47bad123cbbf29821da4d
|
2024-03-09 02:01:26 +08:00 |
|
hoshi-hiyouga
|
2f095e2017
|
Merge pull request #2746 from stephen-nju/main
fix deepspeed ppo RuntimeError
Former-commit-id: 656c653f0c628f9494b4d7ae12e60c8eeec1ea7a
|
2024-03-09 01:37:00 +08:00 |
|
hiyouga
|
9b97b23ce7
|
fix aqlm version
Former-commit-id: 05673f81f0295c76957f3247c62f95fda322a63e
|
2024-03-09 00:09:09 +08:00 |
|
stephen
|
18cfd5f349
|
fix ppo runtime error
Former-commit-id: 14e2f221e3e720075e59065a3dc42aa4d993a8b6
|
2024-03-08 11:48:26 +08:00 |
|
hiyouga
|
9a69cadab3
|
fix #2735
Former-commit-id: 416f6333f66b6afd70a3a936d82593efca583235
|
2024-03-07 16:15:53 +08:00 |
|
hiyouga
|
73d9dfc7ab
|
fix version checking
Former-commit-id: 5780da8d640609cca388f55983d0251e5547209a
|
2024-03-06 14:51:51 +08:00 |
|
hiyouga
|
46ee267cfc
|
improve aqlm optim
Former-commit-id: 81be999b407e988c2f42764d827ac859d079ed3e
|
2024-03-05 20:49:50 +08:00 |
|
hiyouga
|
a10bead9b5
|
optimize aqlm training
Former-commit-id: 8b42660e4039b3d6475f502f397686ba6b140627
|
2024-03-05 18:35:41 +08:00 |
|
hiyouga
|
59a9a5994e
|
fix #2649
Former-commit-id: 1c850de660c671d92f0bc63f230d338b60b7c0bd
|
2024-03-01 13:02:41 +08:00 |
|
hiyouga
|
88fddb879d
|
fix #2642
Former-commit-id: d8435e7f1850532310e1bee069b45f38cd666e48
|
2024-02-29 18:32:54 +08:00 |
|
hiyouga
|
544e7a491b
|
release v0.5.3
Former-commit-id: f6bc89581b3cd129448da2defc23848de6f494ed
|
2024-02-29 00:34:19 +08:00 |
|
hiyouga
|
b392e6cfb9
|
support DoRA, AWQ, AQLM #2512
Former-commit-id: 6614cc1f08aa944db083e27e451bbdd733f7dd97
|
2024-02-28 19:53:28 +08:00 |
|
hiyouga
|
596b6828cb
|
support llama pro #2338 , add rslora
Former-commit-id: 40d659b7f30dd5a004703c176ec1f22dc864e505
|
2024-02-15 02:27:36 +08:00 |
|
younesbelkada
|
590b6c2143
|
add v1 hf tags
Former-commit-id: a29cc9f4472c95cd6a43ea350ab728e0a8069c6e
|
2024-02-13 05:58:49 +00:00 |
|
hiyouga
|
5f83860aa1
|
add option to disable version check
Former-commit-id: fd769cb2de696aee3c5e882237e16eace6a9d675
|
2024-02-10 22:31:23 +08:00 |
|
hiyouga
|
f2e7122a96
|
bump up transformers version
Former-commit-id: 82f4d4301ed9f31b160d6313a1d2d44a22865f4d
|
2024-02-04 00:01:16 +08:00 |
|
hiyouga
|
66e0e651b9
|
format style
Former-commit-id: 53b683531b83cd1d19de97c6565f16c1eca6f5e1
|
2024-01-20 20:15:56 +08:00 |
|
hiyouga
|
a423274fd9
|
support function calling
Former-commit-id: 66533b3f65babf2429c92c0f8fafe4eff5e0ff63
|
2024-01-18 09:54:23 +08:00 |
|
hiyouga
|
ccc5b324fe
|
fix rm server
Former-commit-id: 81bc1638682a9fd01518f9f25250a6b584d2a9e6
|
2024-01-03 15:30:46 +08:00 |
|
hiyouga
|
ebb32e85f8
|
fix version
Former-commit-id: dd7500b65d0d548441eece101b60d51fa619cc0f
|
2023-12-29 04:53:36 +08:00 |
|
hiyouga
|
921f593632
|
update loader
Former-commit-id: 080d8eab858217ca58bffe719d5ffde7579c5bda
|
2023-12-24 19:10:23 +08:00 |
|
hiyouga
|
940403720a
|
update patcher
Former-commit-id: d6d7b6670847ce4ea10353c5b126214542b45c2b
|
2023-12-23 15:24:27 +08:00 |
|
hiyouga
|
306a70c7ba
|
fix unsloth dtype
Former-commit-id: fd22e6546ce5f38a6a075cf894aafc3d206b2fcd
|
2023-12-23 01:59:49 +08:00 |
|
hiyouga
|
6faf9c35a9
|
support unsloth
Former-commit-id: b857f00234b90b785d82ca7cdb29af3d948b1a7b
|
2023-12-23 00:14:33 +08:00 |
|
ShaneTian
|
d05febe5de
|
Fix slow model initialization in bfloat16 dtype.
Former-commit-id: cf2e2f6f9b7f09b1e2faf6fbc413e3f62e3846c7
|
2023-12-22 16:27:28 +08:00 |
|
hiyouga
|
31cbc67986
|
match version
Former-commit-id: 16db52522584a8e084d4db2a7c253c8b88f27371
|
2023-12-20 22:17:35 +08:00 |
|
hiyouga
|
cc16ece283
|
fix #1900
Former-commit-id: 4c35214396f873588562606b084740b6581188d9
|
2023-12-19 17:21:46 +08:00 |
|
hiyouga
|
790a31404a
|
fix #1742
Former-commit-id: efbb32afdcf0d6aa4ca26f54c95f76dbb84f77dc
|
2023-12-16 20:50:45 +08:00 |
|
hiyouga
|
d81ad2d4bc
|
support dpo-ftx
Former-commit-id: 86dfa04f9821556019fa777106787f73eb70b452
|
2023-12-16 19:21:41 +08:00 |
|
hiyouga
|
296711d502
|
support quantization in export model
Former-commit-id: f32500ae6edccab7d14df4c92467e15986866def
|
2023-12-15 23:44:50 +08:00 |
|
hiyouga
|
33521fb45e
|
fix bug
Former-commit-id: 95ac272907a04a64785f928536de1fd099150f92
|
2023-12-15 21:54:02 +08:00 |
|
hiyouga
|
e5204e60ed
|
fix bug
Former-commit-id: 8b80baf02cfece53527c27712f0899fa3532c414
|
2023-12-15 21:49:26 +08:00 |
|