hiyouga
|
4e3ee3b703
|
support infer 4bit model on GPUs #3023
Former-commit-id: 950a9dab9055839990656b2b40956792b253573d
|
2024-04-01 17:34:04 +08:00 |
|
hiyouga
|
33bef29828
|
fix kv cache
Former-commit-id: a9588e36e95bed896eea8d79ba7108447ff08f4b
|
2024-03-13 01:21:50 +08:00 |
|
hiyouga
|
89770c5a8e
|
fix #2802
Former-commit-id: 1370db270d7ba1a20468abdb29193ce7534d1b4f
|
2024-03-12 17:08:34 +08:00 |
|
hiyouga
|
5322aa2019
|
fix #2782 #2798
Former-commit-id: eb3ab610610a0964bc8a1c9fa015805353f04c31
|
2024-03-12 15:53:29 +08:00 |
|
hiyouga
|
56565bdbd4
|
allow non-packing pretraining
Former-commit-id: 3fee5cc5a3db9ce874ad90f2500ec092d904bd4e
|
2024-03-09 22:21:46 +08:00 |
|
hiyouga
|
a44beb2b20
|
fix chat engine, update webui
Former-commit-id: 8b32dddd7d883bae07735796a517927c79d1c33b
|
2024-03-08 03:01:53 +08:00 |
|
hiyouga
|
ddabd699ca
|
support vllm
Former-commit-id: 889f6e910e654d8ec3922c2185042d737ffbf1c3
|
2024-03-07 20:26:31 +08:00 |
|
hiyouga
|
a02d518edc
|
fix #2735
Former-commit-id: 416f6333f66b6afd70a3a936d82593efca583235
|
2024-03-07 16:15:53 +08:00 |
|
hiyouga
|
c60b53a164
|
improve aqlm optim
Former-commit-id: 81be999b407e988c2f42764d827ac859d079ed3e
|
2024-03-05 20:49:50 +08:00 |
|
hiyouga
|
562b9d0167
|
support llama pro #2338 , add rslora
Former-commit-id: 40d659b7f30dd5a004703c176ec1f22dc864e505
|
2024-02-15 02:27:36 +08:00 |
|
hiyouga
|
d4da4d55e4
|
lint
Former-commit-id: 6b1f89b6494e9b6b087fe90600617a3024e014e5
|
2024-02-07 01:10:04 +08:00 |
|
hiyouga
|
c0e4eebf17
|
format style
Former-commit-id: 53b683531b83cd1d19de97c6565f16c1eca6f5e1
|
2024-01-20 20:15:56 +08:00 |
|
hiyouga
|
b3b8fc7492
|
add upcast_lmhead option
Former-commit-id: 7ef69a1697c11ff13e7503360e40ef36cfb1c345
|
2024-01-19 23:54:25 +08:00 |
|
hiyouga
|
cde9c1a42a
|
support export push_to_hub #2183
Former-commit-id: fac09da7123a500d255de74810a8d057fb5c5f07
|
2024-01-16 23:59:42 +08:00 |
|
hiyouga
|
512a086221
|
fix args
Former-commit-id: ff18f327a3dc96d9677ef32841e8f29ab2eeb7ef
|
2023-12-28 18:47:19 +08:00 |
|
hiyouga
|
e25016cc6b
|
fix export format
Former-commit-id: 7c82bd396b9e6ff395850ad544d95cbf1b7557cd
|
2023-12-28 18:40:46 +08:00 |
|
hiyouga
|
422cabc3f8
|
update loader
Former-commit-id: 080d8eab858217ca58bffe719d5ffde7579c5bda
|
2023-12-24 19:10:23 +08:00 |
|
hiyouga
|
52a348cac2
|
update patcher
Former-commit-id: d6d7b6670847ce4ea10353c5b126214542b45c2b
|
2023-12-23 15:24:27 +08:00 |
|
hiyouga
|
9cdaa43d1c
|
support unsloth
Former-commit-id: b857f00234b90b785d82ca7cdb29af3d948b1a7b
|
2023-12-23 00:14:33 +08:00 |
|
hiyouga
|
d66e90b816
|
fix tokenizer for Yi chat models #1617 #1875
Former-commit-id: 9485692c8d367a0b25d3e653db413aa01cb9ad7d
|
2023-12-18 17:18:11 +08:00 |
|
hiyouga
|
8432e50396
|
refactor adapter hparam
Former-commit-id: f82aece9ebd6df83a7a005cc7cbbcec07fa6e14d
|
2023-12-15 20:53:11 +08:00 |
|
hoshi-hiyouga
|
97d5fb3460
|
Merge branch 'main' into feat/support_ms
Former-commit-id: 698756dffb7d4e602b3e0cab66ef0a4befe7215c
|
2023-12-12 17:55:32 +08:00 |
|
xingjun.wang
|
c1703c4f75
|
update args for MsDataset.load
Former-commit-id: c5f69357a167cbf99a93607177526e787419ea05
|
2023-12-12 13:02:54 +08:00 |
|
hiyouga
|
caf4fa46e0
|
patch modelscope
Former-commit-id: 8888cf53f040f5a2d8c0e59cddf79b252449bf58
|
2023-12-01 22:53:15 +08:00 |
|
yuze.zyz
|
2ea08c6631
|
support ms
Former-commit-id: fdd4f94f563110ef9f96ab4a7fd954def32e9785
|
2023-11-29 20:36:55 +08:00 |
|
hiyouga
|
685d0c975a
|
support full-parameter PPO
Former-commit-id: 4af967d69475e1c9fdf1a7983cd6b83bd431abff
|
2023-11-16 02:08:04 +08:00 |
|
hiyouga
|
dc69e3025e
|
add todo
Former-commit-id: 0bd884feb11736d0ab24ca19885151cb47d9dcd3
|
2023-11-10 14:38:18 +08:00 |
|
hiyouga
|
a3973795a0
|
fix ppo train and dpo eval
Former-commit-id: ced863031836632cb5920e22ae6991f251372118
|
2023-11-07 22:48:51 +08:00 |
|
hiyouga
|
08df3dca98
|
upgrade peft, fix #1088 #1411
Former-commit-id: aa7d104f8e050d12cb8f585bc8a52c850995500f
|
2023-11-07 16:13:36 +08:00 |
|
hiyouga
|
31969e78f7
|
reimplement neftune
Former-commit-id: efe9e5a194d3a9f052701d904715238816e4c09e
|
2023-10-22 16:15:08 +08:00 |
|
hiyouga
|
cb68572bce
|
refactor export, fix #1190
Former-commit-id: 30e60e37023a7c4a2db033ffec0542efa3d5cdfb
|
2023-10-15 16:01:48 +08:00 |
|
hiyouga
|
7fea444d48
|
fix #1184
Former-commit-id: 5b069a967823e659dbc70b0d50361b3ad248087e
|
2023-10-14 19:20:11 +08:00 |
|
hiyouga
|
f1a8fcf917
|
refactor model_dtype, fix PPO trainer
Former-commit-id: 3e17ee5afbcb823a7c9a2f91864b3750cd79edb4
|
2023-10-11 23:16:01 +08:00 |
|
hiyouga
|
4617413bde
|
fix layer norm dtype
Former-commit-id: 67af21961b68d9b54d07b09e444c7140869f26da
|
2023-09-28 00:25:55 +08:00 |
|
hiyouga
|
d6f5a3cae9
|
support LongLoRA
Former-commit-id: 0832ed37e7947d699f17375648a52f80752c2b6b
|
2023-09-27 21:55:50 +08:00 |
|
hiyouga
|
d8ab75ee44
|
fix #944
Former-commit-id: 032245647848aaa4167086636b6c985268c5fee3
|
2023-09-21 19:51:02 +08:00 |
|
hiyouga
|
f326178f89
|
support FlashAttention2
Former-commit-id: 23e56c5554b948d4f08ad87849b261eafd2c7890
|
2023-09-10 20:43:56 +08:00 |
|
hiyouga
|
801d1fa7b9
|
change to right-padding, update reward score #803
Former-commit-id: baa90415bc8f5ebd423d001378b51c3a3a6c2ec7
|
2023-09-08 20:04:31 +08:00 |
|
hiyouga
|
c5fcf5b3a5
|
refactor dataset_attr, add eos in pt, fix #757
Former-commit-id: 0feec9a830b917b36686b61938a66e842eccf930
|
2023-09-01 19:00:45 +08:00 |
|
hiyouga
|
a15a602809
|
support rope scaling, fix #475 #476 #478
Former-commit-id: 337d5f68b72230e545e7a94ca789187c7a2b7187
|
2023-08-12 20:46:27 +08:00 |
|
hiyouga
|
98aa629843
|
Release v0.1.6
Former-commit-id: 43c8b3c3c8bfb2e32d17fb3e8b194938e37d54bd
|
2023-08-11 23:25:57 +08:00 |
|
hiyouga
|
7ada4f5f6f
|
support DPO training (2305.18290)
Former-commit-id: 6d98de148e4af63a7028dfaeb6cf86eb56a4488f
|
2023-08-11 03:02:53 +08:00 |
|
jiongxuc
|
326f8c71a3
|
huggingface login for projects must login while running
Former-commit-id: 0a4a2a1d3e0ff1f57215512d294d782080bd383c
|
2023-08-10 14:57:12 +08:00 |
|
hiyouga
|
a69b1b1c3a
|
modity code structure
Former-commit-id: 0682ed357210897e0b67c4a6eb31a94b3eb929f1
|
2023-07-15 16:54:28 +08:00 |
|