hiyouga
|
f312b7db06
|
add deepseek coder v2 #4346
Former-commit-id: d83d3846d8e3bf5c40d4b90c24e2c5909ec61864
|
2024-06-18 22:53:54 +08:00 |
|
ancv
|
84e1f06e45
|
update packing with sdpa and eager attention mode
Former-commit-id: 285636ba3a57a1038b2f2fd4cf909a1ca07708d4
|
2024-06-16 02:25:47 +07:00 |
|
hiyouga
|
640372cb66
|
tiny fix
Former-commit-id: f7f440986b0ae3b38ea9f2da80789629d4f79ea1
|
2024-06-16 01:06:41 +08:00 |
|
hiyouga
|
4851ef85b7
|
add tests
Former-commit-id: 484634ee9c982e82e919ff67d507e0210345182d
|
2024-06-15 19:51:20 +08:00 |
|
hiyouga
|
61aaab22c9
|
add minicpm #4227
Former-commit-id: e1bb18ce60be9a1b203989def30f1b9194286325
|
2024-06-15 17:58:52 +08:00 |
|
hiyouga
|
acfae2e677
|
add license
Former-commit-id: 69cfc98d7c81756a5ab6bf962240e393e449fef0
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
344d1192ac
|
clean code
Former-commit-id: f54cafd5c7f0383370d1a2f357834a61a97397ce
|
2024-06-13 01:58:16 +08:00 |
|
hiyouga
|
e540759f4f
|
set dev version
Former-commit-id: 16c47cc15226119e33e46ba0f2f6ccb37072257f
|
2024-06-11 00:50:53 +08:00 |
|
hiyouga
|
41eadf5459
|
release v0.8.1
Former-commit-id: 875a34f492701d1c644facbe9ede411af2931513
|
2024-06-11 00:44:26 +08:00 |
|
hiyouga
|
a2acefea6e
|
fix llamafactory-cli env
Former-commit-id: b0515e5f42831b67d1f4d049999ecb68756e66db
|
2024-06-08 07:15:45 +08:00 |
|
hiyouga
|
088292e84a
|
set dev version
Former-commit-id: 08b7fe1c452cc99264ff0312e310b579590c6a45
|
2024-06-08 06:46:09 +08:00 |
|
hiyouga
|
cabe5ca7d0
|
release v0.8.0
Former-commit-id: 004db680b9e3996ec511ee818df6c0c02bf13603
|
2024-06-08 05:20:54 +08:00 |
|
hiyouga
|
5606780ab6
|
add resume args in webui
Former-commit-id: 1d86ad768b1f36e54b4c2a9f18f6ea5a7df04c90
|
2024-06-08 00:22:16 +08:00 |
|
hiyouga
|
8cc3bbdc62
|
fix #4120
Former-commit-id: 2a44da678a5e360a9c0f9056397ac9e801329321
|
2024-06-07 04:18:05 +08:00 |
|
hiyouga
|
093abed7cc
|
add qwen2 models
Former-commit-id: 49cb694d02c876e3740a003a8b332349f4310ad3
|
2024-06-07 00:22:57 +08:00 |
|
hiyouga
|
d3a378ffea
|
fix torch gc
Former-commit-id: e173799d057598e5692a407601c30d8ce1513461
|
2024-06-06 20:30:25 +08:00 |
|
hiyouga
|
990dd6d44c
|
lora modules: all by default
Former-commit-id: 52c4ae87c7f4312704c31ef26b079b2c5b95ea5f
|
2024-06-06 03:53:28 +08:00 |
|
hiyouga
|
8d9f3022d2
|
add codestral 22B
Former-commit-id: b011c7f527a57cb1d21c4e2c9631c2fb62bb835e
|
2024-06-06 03:42:50 +08:00 |
|
hiyouga
|
e9f9b1f250
|
lint
Former-commit-id: 9030501eaef97ea249347198272adf0d709503ec
|
2024-06-06 03:33:44 +08:00 |
|
hoshi-hiyouga
|
fbc1168294
|
Merge pull request #4066 from injet-zhou/main
add throughput entry to training log
Former-commit-id: d2816f343f405f3fab09f2a8eade774b886e8f92
|
2024-06-06 03:32:04 +08:00 |
|
hiyouga
|
0b671615d0
|
update train hparams
Former-commit-id: 1ca9fce55b55bf209f4b76152b586731932a3f39
|
2024-06-06 01:49:20 +08:00 |
|
hiyouga
|
1935f4a1e0
|
add llamafactory-cli env
Former-commit-id: 1df077184845ff5f394b9324d46f8c382869e590
|
2024-06-06 01:28:14 +08:00 |
|
hiyouga
|
fc053cf81f
|
fix #4090
Former-commit-id: d9f15f30a8f4bc64778a5c96baeb6801700d7a2c
|
2024-06-06 00:50:32 +08:00 |
|
hiyouga
|
04a7065830
|
support glm-4
Former-commit-id: a10f4718fbf3f3c89dc7eb31cb8e1a46ca6adda5
|
2024-06-05 15:16:38 +08:00 |
|
faddddeout
|
f4cf31a1a0
|
add throughput entry to log
Former-commit-id: 691f999f64c7bac78761e4354f89816d2f0d46fc
|
2024-06-04 11:04:29 +00:00 |
|
hiyouga
|
ee80c3acf1
|
bump versions
transformers 4.37.2->4.41.2
datasets 2.14.3->2.16.0
accelerate 0.27.2->0.30.1
peft 0.10.0->0.11.1
trl 0.8.1->0.8.6
Former-commit-id: 5f1e041f7295bf42a41dd4d9e7f0c42fcc37fed2
|
2024-06-03 18:29:38 +08:00 |
|
hiyouga
|
219a16130a
|
better llamaboard
* easily resume from checkpoint
* support full and freeze checkpoints
* faster ui
Former-commit-id: 84cfb2452cc86b037ccddee6e833f8eb7c129fa4
|
2024-05-29 23:55:38 +08:00 |
|
hiyouga
|
0c722c879a
|
update readme
Former-commit-id: 440e9de66986ef7736361ce8ec3e23ce68655a56
|
2024-05-29 18:39:11 +08:00 |
|
hzhaoy
|
8bd3c0bae2
|
add TeleChat-12B/TeleChat-12B-v2 models
Former-commit-id: e0675385c88af03aaef8d51586c8a282829c4051
|
2024-05-29 15:00:37 +08:00 |
|
hiyouga
|
edbc4bdac4
|
support DDP in webui
Former-commit-id: d059262ff8dc857f597d2657546ec625726a664a
|
2024-05-28 19:24:22 +08:00 |
|
hiyouga
|
7e9372bb2f
|
tiny fix
Former-commit-id: 4c47b3dcef9e400a1c35fce1ad53619a0a86fe81
|
2024-05-27 20:54:26 +08:00 |
|
hoshi-hiyouga
|
2f795f0341
|
Merge pull request #3921 from gusye1234/main
Add openchat-3.6-8B support
Former-commit-id: 92e6bba3cab22b7835a68f787caf7992a398978e
|
2024-05-27 20:52:37 +08:00 |
|
Jianbai Ye
|
db745355bb
|
add openchat-3.6-8B support
Former-commit-id: b66f39d50d896d7597a1506e67ec210b31c9b700
|
2024-05-27 20:42:08 +08:00 |
|
hiyouga
|
a723876663
|
support Aya23
Former-commit-id: 071935b90006e2c79e39bb9ee0c5d48c6c910501
|
2024-05-27 20:23:24 +08:00 |
|
hiyouga
|
9e87ea0cb7
|
add phi-3 7b/14b, mistral v0.3 models
Former-commit-id: 86dab182f9710b063f518922ccb49b01aa71c576
|
2024-05-27 18:20:16 +08:00 |
|
hiyouga
|
3a334da50f
|
update readme
Former-commit-id: b8d0170fe0d094acce85dcb5f91775e4685ee055
|
2024-05-27 18:14:02 +08:00 |
|
hiyouga
|
ed2601a909
|
support SimPO #3900
Former-commit-id: 6b954ce60155cf8334150b795cfc4bb63ca74c8b
|
2024-05-26 23:46:33 +08:00 |
|
hiyouga
|
16008627db
|
fix #3847
Former-commit-id: d206b306ca4eadc8b3d4feaf490ad12f9452e562
|
2024-05-21 17:53:06 +08:00 |
|
hiyouga
|
41609f323e
|
support paligemma
Former-commit-id: 11c27f9bf204d3d6a9ca5bd4f0a19a420160453f
|
2024-05-21 00:01:22 +08:00 |
|
hiyouga
|
090fc83188
|
fix paligemma inference
Former-commit-id: 46357b7a677e8ba2e0a7c9d4ec1974abd061569c
|
2024-05-20 23:36:43 +08:00 |
|
hiyouga
|
547235deee
|
fix envs
Former-commit-id: d5e150cfb98f8216713415564ab386b8320c88cb
|
2024-05-19 18:27:18 +08:00 |
|
hoshi-hiyouga
|
65715cab9d
|
Merge pull request #3785 from enji-zhou/feature/add_kto
add kto
Former-commit-id: f60faa23e23022fd855dac6b1ecbd21e095bccb5
|
2024-05-18 03:07:18 +08:00 |
|
hiyouga
|
7728cb8fdb
|
add deepseek v2 lite model
Former-commit-id: 5e864e6b721d8b891b1cc2ca2dcac41babb9eaaf
|
2024-05-17 13:25:36 +08:00 |
|
enji.zhou
|
d16a1d9ed0
|
add kto
Former-commit-id: ec51986cf70b0bdd79b8141e45916670fb97a08e
|
2024-05-17 13:09:17 +08:00 |
|
hiyouga
|
040ae800bf
|
add falcon 11b
Former-commit-id: 897acc725edc204fad393cc9616828431b4fa768
|
2024-05-17 00:08:33 +08:00 |
|
hiyouga
|
ee759aa0d8
|
rename package
Former-commit-id: a07ff0c083558cfe6f474d13027642d3052fee08
|
2024-05-16 18:39:08 +08:00 |
|