hiyouga
|
38505ae9e1
|
update accelerate ver for schedule_free optimizers
Former-commit-id: bdde35fd2e
|
2024-09-09 22:51:08 +08:00 |
|
hiyouga
|
3aefdad4ec
|
release v0.9.0 (real)
Former-commit-id: 90d6df6222
|
2024-09-09 01:00:25 +08:00 |
|
hiyouga
|
561ae4d1af
|
fix constants
Former-commit-id: 653fe70acb
|
2024-09-08 23:52:30 +08:00 |
|
hiyouga
|
fb9280a0a7
|
release v0.9.0
Former-commit-id: 54b5c4b819
|
2024-09-08 23:43:35 +08:00 |
|
hiyouga
|
78cf256067
|
support vllm 0.6.0
Former-commit-id: b6681d7198
|
2024-09-08 02:26:20 +08:00 |
|
hiyouga
|
7ccb86b215
|
add docstrings, refactor logger
Former-commit-id: 54c6905937
|
2024-09-08 00:56:56 +08:00 |
|
hoshi-hiyouga
|
de277a8ab8
|
Merge pull request #5372 from LDLINGLINGLING/main
增加了对minicpm3.0的适配'
Former-commit-id: 1274356263
|
2024-09-05 21:35:42 +08:00 |
|
liudan
|
1797fe50a4
|
根据代码规范修改了代码
Former-commit-id: 3d3fbaaff9
|
2024-09-05 20:17:55 +08:00 |
|
hiyouga
|
4fccc65579
|
support Yi-Coder models
Former-commit-id: 359ef8bb0e
|
2024-09-05 03:12:24 +08:00 |
|
hiyouga
|
9df7a26e6b
|
video datasets
Former-commit-id: 8cafc7b055
|
2024-09-05 02:04:17 +08:00 |
|
liudan
|
09cff03026
|
增加了对minicpm3.0的适配'
Former-commit-id: d7ba97be48
|
2024-09-04 23:10:05 +08:00 |
|
hiyouga
|
cb776752f6
|
fix mixed mm inputs and rlhf-v
Former-commit-id: 9967ccb3ae
|
2024-09-01 20:52:47 +08:00 |
|
hiyouga
|
a83756b5e9
|
refactor mm training
Former-commit-id: 3382317e32
|
2024-08-30 02:14:31 +08:00 |
|
hiyouga
|
21d3976eea
|
fix #5295
Former-commit-id: ad72f3e065
|
2024-08-29 20:30:18 +08:00 |
|
hiyouga
|
7b5834b2dd
|
tiny fix
Former-commit-id: f6ae4e75dd
|
2024-08-27 12:49:32 +08:00 |
|
hiyouga
|
daebca2368
|
tiny fix
Former-commit-id: c8b4c7fee5
|
2024-08-20 00:10:52 +08:00 |
|
hoshi-hiyouga
|
5582674f06
|
Merge pull request #5188 from Zxilly/main
fix: report correct device count for intel xpu
Former-commit-id: d39f4a62d3
|
2024-08-19 23:51:39 +08:00 |
|
Ricardo
|
a9312387bc
|
_is_bf16_available judgment supports npu
Former-commit-id: 384ab8db84
|
2024-08-16 02:58:22 +00:00 |
|
Zxilly
|
41a8387195
|
fix: report correct device count for intel xpu
Former-commit-id: dc36fcc3de
|
2024-08-15 08:30:43 +00:00 |
|
hiyouga
|
a8add5c04b
|
add qwen2 math models
Former-commit-id: dc770efb14
|
2024-08-09 20:20:35 +08:00 |
|
hiyouga
|
20013e130b
|
fix #5048
Former-commit-id: b7ca6c8dc1
|
2024-08-05 23:48:19 +08:00 |
|
codingma
|
7125b6cf70
|
support gemma-2-2b
Former-commit-id: dc09d454f2
|
2024-08-01 13:45:48 +08:00 |
|
hiyouga
|
91e54d458f
|
add mistral nemo model
Former-commit-id: 1550fe7331
|
2024-07-24 16:25:53 +08:00 |
|
hiyouga
|
e0875f82b3
|
add llama3.1
Former-commit-id: 26533c0604
|
2024-07-24 16:20:11 +08:00 |
|
hiyouga
|
726e7046db
|
set dev version
Former-commit-id: 88c7fc1599
|
2024-07-19 02:01:46 +08:00 |
|
hiyouga
|
f5cfea56bd
|
release v0.8.3
Former-commit-id: bbd5a64423
|
2024-07-19 01:21:18 +08:00 |
|
hiyouga
|
e90fae61f4
|
support batch_eval_metrics, fix #4826
Former-commit-id: d774b94f12
|
2024-07-17 00:33:00 +08:00 |
|
hoshi-hiyouga
|
7483e187c6
|
Update packages.py
Former-commit-id: f84b007ebb
|
2024-07-07 15:48:29 +08:00 |
|
Lian Junhong
|
7ca84e0a09
|
chore: Update vllm_engine.py to support vllm version >= 0.5.1
Former-commit-id: 322663bf90
|
2024-07-07 15:08:12 +08:00 |
|
hiyouga
|
7fcffb860d
|
add codegeex4, internlm2.5
Former-commit-id: 53b1002fb7
|
2024-07-06 16:16:47 +08:00 |
|
hiyouga
|
7b3c1f29ff
|
fix packing for eager/sdpa attn
Former-commit-id: 6fd6aa4530
|
2024-07-04 01:52:43 +08:00 |
|
hoshi-hiyouga
|
a38ff842d0
|
Merge pull request #4224 from chuan298/main
Implement efficient packing without cross-contamination attention
Former-commit-id: 87d9b2d005
|
2024-07-04 01:18:54 +08:00 |
|
hiyouga
|
bfdaadcc40
|
update packing
Former-commit-id: cce7083024
|
2024-07-04 01:10:55 +08:00 |
|
hiyouga
|
e671ed520b
|
update arg name
Former-commit-id: 8a6a7b9c8a
|
2024-07-03 23:23:24 +08:00 |
|
hiyouga
|
cc31014002
|
improve rlhf
Former-commit-id: c47ab6c072
|
2024-07-02 22:23:08 +08:00 |
|
hzhaoy
|
28e787116b
|
add TeleChat-1B
Former-commit-id: 57b7c00430
|
2024-07-02 17:49:04 +08:00 |
|
hoshi-hiyouga
|
2452f57cd7
|
Merge branch 'main' into main
Former-commit-id: e8e6af2651
|
2024-07-01 21:01:09 +08:00 |
|
hiyouga
|
bbc37b2880
|
fix #4398 #4592
Former-commit-id: d74244d568
|
2024-06-30 21:28:51 +08:00 |
|
hiyouga
|
d3b7c489f2
|
add Gemma2 models
Former-commit-id: 6f63050e1b
|
2024-06-28 01:26:50 +08:00 |
|
hiyouga
|
835f0578c2
|
refactor pissa, improve llamaboard
Former-commit-id: 8baf3b22b0
|
2024-06-28 01:04:24 +08:00 |
|
hiyouga
|
d2d9fa4abb
|
support HQQ/EETQ #4113
Former-commit-id: ad144c2265
|
2024-06-27 00:29:42 +08:00 |
|
hiyouga
|
7be502c5c5
|
update readme
Former-commit-id: e507e60638
|
2024-06-24 18:22:12 +08:00 |
|
ancv
|
5319447aa5
|
move configure_packing to llamafactory.model.patcher and fix constants
Former-commit-id: 770f75dc83
|
2024-06-21 00:45:06 +07:00 |
|
hiyouga
|
80e9f8e000
|
set dev version
Former-commit-id: 42e69a3c63
|
2024-06-19 21:08:16 +08:00 |
|
hiyouga
|
9c1b04cd11
|
release v0.8.2
Former-commit-id: 71327ba85a
|
2024-06-19 20:42:09 +08:00 |
|
hiyouga
|
e3bf22f61b
|
add deepseek coder v2 #4346
Former-commit-id: a233fbc258
|
2024-06-18 22:53:54 +08:00 |
|
ancv
|
988231026a
|
update packing with sdpa and eager attention mode
Former-commit-id: 238f5c3d99
|
2024-06-16 02:25:47 +07:00 |
|
hiyouga
|
c0c6b8075a
|
tiny fix
Former-commit-id: 38b6b0f52e
|
2024-06-16 01:06:41 +08:00 |
|
hiyouga
|
8053929b20
|
add tests
Former-commit-id: 1b834f50be
|
2024-06-15 19:51:20 +08:00 |
|
hiyouga
|
f0d6e63f55
|
add minicpm #4227
Former-commit-id: 572d8bbfdd
|
2024-06-15 17:58:52 +08:00 |
|