BUAADreamer
|
f5edbf2b49
|
Merge branch 'hiyouga:main' into main
Former-commit-id: 6287d1b789c631205c1033adf036e28deaef4167
|
2024-04-23 18:46:12 +08:00 |
|
BUAADreamer
|
ab6dc0ea30
|
add multimodal LLM BLIP-2 and InstructBLIP
Former-commit-id: a730f89a972f1a9d37c718c716f199cb8d4903b2
|
2024-04-23 18:45:43 +08:00 |
|
hiyouga
|
79d34ce0f3
|
update examples
Former-commit-id: 8bf55682cdfbbdca0f01073eac0084c20a6a09d1
|
2024-04-23 18:29:46 +08:00 |
|
hiyouga
|
1d2e372a8e
|
update readme
Former-commit-id: d4eaee262a64e716ce475dc4eb18d8d9697d8dd8
|
2024-04-22 17:09:17 +08:00 |
|
hiyouga
|
f6a53d83c8
|
update readme
Former-commit-id: 3eab580703ee01a0d2d75e7f01df5165af551386
|
2024-04-22 00:51:35 +08:00 |
|
hiyouga
|
4ec56dd958
|
update readme
Former-commit-id: fdca136309709e43d75a831252b9375a5a99635a
|
2024-04-22 00:42:25 +08:00 |
|
hiyouga
|
ba06eb65ca
|
update readme and examples
Former-commit-id: 27dd9bf201c24f7804811398bc2758966ec78432
|
2024-04-22 00:37:32 +08:00 |
|
hiyouga
|
be716972fe
|
remove extras
Former-commit-id: d67e972f8c3d5273e589c8c85c0a1620f59785c5
|
2024-04-22 00:35:41 +08:00 |
|
hiyouga
|
719585a128
|
update readme
Former-commit-id: 3a8c17907c71f46b1b37501e2afdc99ad89fb4bc
|
2024-04-22 00:21:01 +08:00 |
|
hiyouga
|
348f29aa50
|
set dev version
Former-commit-id: b9557887d7506ff57b2b2bf490092aac4e4becf0
|
2024-04-21 23:14:30 +08:00 |
|
hiyouga
|
c8fe3f544b
|
release v0.6.3
Former-commit-id: 947572af8de201669598f54735f35b50bb719d71
v0.6.3
|
2024-04-21 23:13:23 +08:00 |
|
hiyouga
|
0f1ad7140f
|
fix #3366
Former-commit-id: dc20237455c36de44f8922539d7dfadd8bedb12f
|
2024-04-21 21:34:25 +08:00 |
|
hiyouga
|
233e167f68
|
fix optimizers
Former-commit-id: f811eee2fa12a89a55a9c5d3a05a1521b4347727
|
2024-04-21 20:40:54 +08:00 |
|
hiyouga
|
1d341dcd83
|
fix #3365
Former-commit-id: 415ce41e8fa887e980e5bd575c8e95bd4076b90b
|
2024-04-21 19:20:18 +08:00 |
|
hiyouga
|
d16561e7a4
|
fix bug in galore optimizer
Former-commit-id: c05ac23261a5a8ba893c2918a43dc7777307407b
|
2024-04-21 18:53:22 +08:00 |
|
hiyouga
|
f8e219dc81
|
fix mod stuff
Former-commit-id: cf3988226e6398c67bb2955578e436fc505aa5c5
|
2024-04-21 18:11:10 +08:00 |
|
hoshi-hiyouga
|
3365cc8cf0
|
Merge pull request #3338 from astramind-ai/main
Adding Mixture of Depth
Former-commit-id: 4da2ece53353b63e672ff529d6beba41ff710c14
|
2024-04-21 18:05:52 +08:00 |
|
hoshi-hiyouga
|
3a5e68b7d9
|
fix #3348
Former-commit-id: aa5e921c00f60074eceb2f9d4d8837cc713edba6
|
2024-04-20 10:34:09 +08:00 |
|
hiyouga
|
0cb596fee1
|
add dpo mix dataset
Former-commit-id: 6def3f8bfa51b2d9d73af112352ce07db972e4c9
|
2024-04-20 01:31:38 +08:00 |
|
hiyouga
|
b3b5b530d1
|
fix #3352
Former-commit-id: f315f8e8ec916b82bac94a159e55839ff155c6b5
|
2024-04-19 22:40:01 +08:00 |
|
hiyouga
|
9225c15c88
|
fix llama3 template
Former-commit-id: 20e95250168fbe081c779b2e1ff23f5df3ce02f7
|
2024-04-19 15:46:51 +08:00 |
|
Marco
|
abd9fed445
|
fix small typo
Former-commit-id: 5638a03cd0cf8119ff366b3b3e303b5a2351b065
|
2024-04-18 20:33:29 +02:00 |
|
Marco
|
44cda2eece
|
Added Mixture of Depths
Former-commit-id: 75dd98b9abc847e22cb263c17ebcd2ca5dd98345
|
2024-04-18 20:31:24 +02:00 |
|
hoshi-hiyouga
|
8397808d1d
|
support llama3
Former-commit-id: c1eabb751a5fd73b710714451b146732e0ed4558
|
2024-04-19 01:13:50 +08:00 |
|
hiyouga
|
9e1bd6420d
|
fix #3324
Former-commit-id: 5e710c4ac331f3400534d33b2646c4108c898d98
|
2024-04-18 15:34:45 +08:00 |
|
hiyouga
|
619264c854
|
tiny fix
Former-commit-id: 86399ca8c06273c42c2b184664ae25d3405b3bf6
|
2024-04-18 00:22:17 +08:00 |
|
hiyouga
|
1ebac62e3d
|
update readme
Former-commit-id: a49112a74339ba77bfec53f7870e821fe148db2c
|
2024-04-17 23:40:49 +08:00 |
|
hiyouga
|
ce9bdb3509
|
add mixtral 8x22B models
Former-commit-id: eccbeecff0909e1fa124b5439ffbbfbc5607e1d6
|
2024-04-17 23:35:59 +08:00 |
|
hiyouga
|
0c8d6369ac
|
add CodeQwen models
Former-commit-id: 9f6094241391f8f717818c8ba94e11d1791b4a5c
|
2024-04-17 23:27:22 +08:00 |
|
hiyouga
|
bee796f6b5
|
fix #3316
Former-commit-id: 7395e9e90a209228ff563ab54319955608850fc3
|
2024-04-17 22:54:34 +08:00 |
|
hiyouga
|
9f6349a333
|
fix #3317
Former-commit-id: 7dce1763be4374cf616d96db95ae964ff510a9d6
|
2024-04-17 22:17:19 +08:00 |
|
hiyouga
|
171a029c5e
|
lint
Former-commit-id: 917d65ce65024d17a5030bc57083a427cfae16d7
|
2024-04-16 18:21:09 +08:00 |
|
hoshi-hiyouga
|
eaefaa0fe0
|
Merge pull request #3291 from codemayq/main
support for previewing custom dataset in directory format
Former-commit-id: 40d89152282101a7c08f53e72c2ad7124a0595f3
|
2024-04-16 18:12:09 +08:00 |
|
hiyouga
|
d301f0a64b
|
Update parser.py
Former-commit-id: 92c2133896c20054db86dd53508c982e39bd5ca0
|
2024-04-16 18:09:31 +08:00 |
|
hiyouga
|
0a1578e4e3
|
update readme and gradio version
Former-commit-id: 4029b60ddcbd15b5354503c51178f0f5e7e9aedf
|
2024-04-16 18:09:16 +08:00 |
|
hiyouga
|
a4167fd925
|
support badam for all stages
Former-commit-id: 7a1380646119bfe6855f73dd90570defcea05281
|
2024-04-16 17:44:48 +08:00 |
|
hoshi-hiyouga
|
42084e08ae
|
Merge pull request #3287 from Ledzy/badam
[Feature] Add BAdam algorithm
Former-commit-id: 10a5e1e65b34b03e5ca2a41bf6ded09a3fb25f0c
|
2024-04-16 17:32:16 +08:00 |
|
hoshi-hiyouga
|
9d23f5dc89
|
Update utils.py
Former-commit-id: 01147536b2bb507e87e033fa696e9eb39fe96bbe
|
2024-04-16 17:30:12 +08:00 |
|
hoshi-hiyouga
|
5978427ae0
|
Update trainer.py
Former-commit-id: c6163be1444c00dd000f288e2f834968bd932981
|
2024-04-16 17:29:52 +08:00 |
|
hoshi-hiyouga
|
c7c216069c
|
Update utils.py
Former-commit-id: 7edf4dbed88b8034282f14fd6e0cb6f7f9e5f805
|
2024-04-16 17:29:30 +08:00 |
|
hoshi-hiyouga
|
cde9d1b917
|
Update patcher.py
Former-commit-id: 494e6a1e05b38f5ff61d83327303614f53c92e64
|
2024-04-16 17:29:19 +08:00 |
|
hoshi-hiyouga
|
96213f04b0
|
Update adapter.py
Former-commit-id: 8f7b75b26f020d8ae85baab7b082475c3bfeb512
|
2024-04-16 17:28:12 +08:00 |
|
hoshi-hiyouga
|
7ecea08b9b
|
Update parser.py
Former-commit-id: 898239883afc79f03abd0dc276eef901662a9591
|
2024-04-16 17:27:25 +08:00 |
|
hoshi-hiyouga
|
191971865d
|
Update parser.py
Former-commit-id: 2f3da8169d18b026760cc0ac7dd6141bdd08c932
|
2024-04-16 17:27:02 +08:00 |
|
hoshi-hiyouga
|
ff4f587dd9
|
Update finetuning_args.py
Former-commit-id: 3a23d900aea74078f0bc8cf73fac860a4ce3df67
|
2024-04-16 17:26:30 +08:00 |
|
hoshi-hiyouga
|
de728d0371
|
Update sft.sh
Former-commit-id: 2b4b1562e91bbb02e345e71b7721da9333c0791b
|
2024-04-16 17:25:40 +08:00 |
|
hoshi-hiyouga
|
d08e09642d
|
Update requirements.txt
Former-commit-id: 1e45537ca0bb4d49b4147df01122e365b3d617e4
|
2024-04-16 17:10:17 +08:00 |
|
hoshi-hiyouga
|
351493b183
|
Update setup.py
Former-commit-id: 5df30ea166aff29d48ff83a22ac6ef1611ce3e35
|
2024-04-16 17:10:02 +08:00 |
|
Jonery
|
86ab47e121
|
remove badam from core requirements
Former-commit-id: fa5898944a3867ac5108dd0d579ca0677c87d3d6
|
2024-04-16 12:25:50 +08:00 |
|
Jonery
|
6dd6b3e396
|
resolve gradient checkpointing issue.
Former-commit-id: 6df9135d063bb6102f0cbcdf0d702076f5febbae
|
2024-04-16 12:05:27 +08:00 |
|