hoshi-hiyouga
|
5978427ae0
|
Update trainer.py
Former-commit-id: c6163be1444c00dd000f288e2f834968bd932981
|
2024-04-16 17:29:52 +08:00 |
|
hoshi-hiyouga
|
c7c216069c
|
Update utils.py
Former-commit-id: 7edf4dbed88b8034282f14fd6e0cb6f7f9e5f805
|
2024-04-16 17:29:30 +08:00 |
|
hoshi-hiyouga
|
cde9d1b917
|
Update patcher.py
Former-commit-id: 494e6a1e05b38f5ff61d83327303614f53c92e64
|
2024-04-16 17:29:19 +08:00 |
|
hoshi-hiyouga
|
96213f04b0
|
Update adapter.py
Former-commit-id: 8f7b75b26f020d8ae85baab7b082475c3bfeb512
|
2024-04-16 17:28:12 +08:00 |
|
hoshi-hiyouga
|
7ecea08b9b
|
Update parser.py
Former-commit-id: 898239883afc79f03abd0dc276eef901662a9591
|
2024-04-16 17:27:25 +08:00 |
|
hoshi-hiyouga
|
191971865d
|
Update parser.py
Former-commit-id: 2f3da8169d18b026760cc0ac7dd6141bdd08c932
|
2024-04-16 17:27:02 +08:00 |
|
hoshi-hiyouga
|
ff4f587dd9
|
Update finetuning_args.py
Former-commit-id: 3a23d900aea74078f0bc8cf73fac860a4ce3df67
|
2024-04-16 17:26:30 +08:00 |
|
hoshi-hiyouga
|
de728d0371
|
Update sft.sh
Former-commit-id: 2b4b1562e91bbb02e345e71b7721da9333c0791b
|
2024-04-16 17:25:40 +08:00 |
|
hoshi-hiyouga
|
d08e09642d
|
Update requirements.txt
Former-commit-id: 1e45537ca0bb4d49b4147df01122e365b3d617e4
|
2024-04-16 17:10:17 +08:00 |
|
hoshi-hiyouga
|
351493b183
|
Update setup.py
Former-commit-id: 5df30ea166aff29d48ff83a22ac6ef1611ce3e35
|
2024-04-16 17:10:02 +08:00 |
|
Jonery
|
86ab47e121
|
remove badam from core requirements
Former-commit-id: fa5898944a3867ac5108dd0d579ca0677c87d3d6
|
2024-04-16 12:25:50 +08:00 |
|
Jonery
|
6dd6b3e396
|
resolve gradient checkpointing issue.
Former-commit-id: 6df9135d063bb6102f0cbcdf0d702076f5febbae
|
2024-04-16 12:05:27 +08:00 |
|
Jonery
|
d4d471450f
|
Feature BAdam
Former-commit-id: d8d2807fbcf587c37f7fd34a23e9397d2775ceed
|
2024-04-15 23:15:27 +08:00 |
|
hiyouga
|
276f2cb24e
|
update examples
Former-commit-id: 369294b31c8a03a1cafcee83eb31a817007d3c49
|
2024-04-15 22:14:34 +08:00 |
|
hoshi-hiyouga
|
86556b1c74
|
Merge pull request #3261 from khazic/main
Added specimens for single-card full parameter prediction
Former-commit-id: 60df2a9519fbd8215c3afacc831b0cc89006457a
|
2024-04-15 16:30:57 +08:00 |
|
hoshi-hiyouga
|
0c80751e87
|
Merge pull request #3276 from liu-zichen/fix_mixtral
fix: turn on output_router_logits of mixtral
Former-commit-id: 07bbaf5c67d00a152e5304e81b15fd9189e7bb99
|
2024-04-15 15:38:16 +08:00 |
|
hiyouga
|
9338f878a3
|
fix #3273
Former-commit-id: 3b20c89b342a068356ffc29c3724b645775c65db
|
2024-04-15 15:32:58 +08:00 |
|
liuzc
|
fde3d91242
|
fix: mixtral output_router_logits
Former-commit-id: ab3171ea97ec968b972287287ef9ee2502c6d37c
|
2024-04-15 12:11:49 +08:00 |
|
khazic
|
19adfb88a9
|
Upgrade README.md
Former-commit-id: 697f768d7185789ee054c94f4f161a65b8a505bc
|
2024-04-13 20:50:49 +08:00 |
|
khazic
|
daaafa900a
|
Added specimens for single-card full parameter prediction
Former-commit-id: d8d4fb9fa4b0e1950a453682e5e186f34f085dee
|
2024-04-13 20:45:19 +08:00 |
|
hiyouga
|
106a0104da
|
fix #3247
Former-commit-id: bb67c66f80627805b585d157ba807c0ce378d3f2
|
2024-04-12 17:41:33 +08:00 |
|
hiyouga
|
5486ea09e3
|
fix model card
Former-commit-id: 920e7149bf2b559c9829aa4b11cfb6d00bbb2f9e
|
2024-04-12 17:11:59 +08:00 |
|
hiyouga
|
31bbbb6d13
|
fix #3238
Former-commit-id: 4d7e81ab4722d13bec6ca1af141f94bdc74d0883
|
2024-04-12 14:28:11 +08:00 |
|
hiyouga
|
1a77de82fa
|
set dev version
Former-commit-id: f6cc76571d2c789675883a18e0db3d0c61f33808
|
2024-04-11 20:27:34 +08:00 |
|
hiyouga
|
7468f2535c
|
release v0.6.2
Former-commit-id: f92ad0a62d957b595f6a76a5403216b163eb3d17
v0.6.2
|
2024-04-11 20:08:51 +08:00 |
|
hiyouga
|
38e4f22605
|
Merge branch 'main' of https://github.com/hiyouga/LLaMA-Factory
Former-commit-id: 23ff02c1fd3787daf0bc6ac237c8897d02f726e4
|
2024-04-10 23:58:18 +08:00 |
|
hiyouga
|
2bc2fe7b5e
|
fix #3225
Former-commit-id: 94110ecf27c32e263f1f2ee61842a3a301b9e089
|
2024-04-10 23:57:59 +08:00 |
|
hoshi-hiyouga
|
6d0140d8a0
|
Merge pull request #3201 from kno10/patch-1 and fix #3200
Pass additional_target to unsloth
Former-commit-id: 080a96c52f489fda0d315a77e26c4f6f5d69784a
|
2024-04-10 00:58:48 +08:00 |
|
hoshi-hiyouga
|
7856f98965
|
Update adapter.py
Former-commit-id: 720fde3683529ed7e08ac27c7c4598c6bdc30d44
|
2024-04-10 00:57:51 +08:00 |
|
hoshi-hiyouga
|
e25ddef08c
|
Update adapter.py
Former-commit-id: a84b8d17dbf221259212e81931d80bcdd6284ad7
|
2024-04-10 00:57:30 +08:00 |
|
Erich Schubert
|
95a4589bbf
|
Pass additional_target to unsloth
Fixes #3200
Former-commit-id: f8f87f5b0549cba6a011749c42064047f82ba577
|
2024-04-09 17:53:40 +02:00 |
|
hiyouga
|
566d71b7a9
|
fix quant infer and qwen2moe
Former-commit-id: b75d16767f35c36e2cf2aaab8a3844135085bccf
|
2024-04-09 17:12:59 +08:00 |
|
hiyouga
|
6030a4a720
|
tiny fix
Former-commit-id: d8f1ff51d4c920d4d0aeb9d53db29d1efb733c85
|
2024-04-08 21:28:39 +08:00 |
|
hoshi-hiyouga
|
5dc0cb94d4
|
Merge pull request #3161 from hiyouga/feature/add-mediatek-model
support Breeze-7B
Former-commit-id: af92ac8b62b919a75673011a1c56832e67882ee8
|
2024-04-08 20:56:51 +08:00 |
|
codingma
|
325dafcbb0
|
add empty line
Former-commit-id: 1c6c2e611d10e9fa662e3f4e1e7d23b80ae496cb
|
2024-04-07 18:28:08 +08:00 |
|
codingma
|
1a8a8b8651
|
rename template to breeze
Former-commit-id: 1223e6358dab52b4e1505057f1b16fd9d527c79e
|
2024-04-07 18:27:20 +08:00 |
|
hoshi-hiyouga
|
61a495cb1e
|
Merge pull request #3160 from sliderSun/main
support Qwen1.5-32B
Former-commit-id: 1e5a5882dd494c3e9cf5eae2e0a485ce49d1863c
|
2024-04-07 18:00:40 +08:00 |
|
codingma
|
75866aa020
|
rename template to breeze
Former-commit-id: 1d894e7cfb73b8a29dababb554d051bd50e4f01d
|
2024-04-07 11:39:54 +08:00 |
|
codingma
|
9e4fda326d
|
support https://github.com/hiyouga/LLaMA-Factory/issues/3152
Former-commit-id: 708f0ab4b0aa72e2c73ca36eb9ed058910e43092
|
2024-04-07 11:34:01 +08:00 |
|
sliderSun
|
1131ddfaff
|
fix spell error
Former-commit-id: e6d36a2e593ebc1193b1735075c4ddb5d9f54990
|
2024-04-07 10:59:15 +08:00 |
|
sliderSun
|
9f437b5c43
|
support Qwen1.5-32B
Former-commit-id: c419adf1697b92520342f4ffa697c84bf19ca37d
|
2024-04-07 10:56:03 +08:00 |
|
sliderSun
|
0cc03d3f05
|
support Qwen1.5-32B
Former-commit-id: 8f2c67b95a8e177eb4096382417a70cacba38e90
|
2024-04-07 10:26:13 +08:00 |
|
hiyouga
|
04fc2f78bf
|
update readme
Former-commit-id: 1cf15547e2420a3e5f7a969c21c10c7fbdfc71fe
|
2024-04-07 00:48:24 +08:00 |
|
hiyouga
|
3ac333fc6a
|
update examples
Former-commit-id: de40ad62ba3d4c74c69de97b39cc79786ac28f0f
|
2024-04-04 14:48:21 +08:00 |
|
hiyouga
|
a246ac1914
|
tiny fix
Former-commit-id: 70aceecb27e72095c05462d01f956061669b267e
|
2024-04-04 02:19:03 +08:00 |
|
hiyouga
|
48ceac845c
|
back to gradio 4.21 and fix chat
Former-commit-id: 695734a40a702ea059d855da54080cc8d161e41a
|
2024-04-04 02:07:20 +08:00 |
|
hiyouga
|
b1986a06b9
|
fix bug in latest gradio
Former-commit-id: 44a962862b4a74e50ef5786c8d5719faaa65f63f
|
2024-04-04 00:55:31 +08:00 |
|
hiyouga
|
43d134ba29
|
fix requires for windows
Former-commit-id: 5e25fae40b7ea9cfa72717efbe3677199ca9608f
|
2024-04-03 21:56:43 +08:00 |
|
hiyouga
|
1348f7d860
|
fix resize vocab at inference #3022
Former-commit-id: c243720b89eec0af2872fa3c7980a0026d893f4d
|
2024-04-03 18:14:24 +08:00 |
|
hiyouga
|
f6530222f7
|
fix #3116
Former-commit-id: b7256aa33d761280751518c20f29f9b8ea3fb025
|
2024-04-03 14:47:59 +08:00 |
|