shing100
|
0a633f8098
|
add Exaone3.0 template
Former-commit-id: 3a9569647f5dbb1dcd9ef6e5cfc39ec8f9b41e89
|
2024-09-30 09:18:25 +09:00 |
|
hoshi-hiyouga
|
6e4d5d9b2a
|
Update constants.py
Former-commit-id: b257b91cd0a71448af520baa8e864920333da848
|
2024-09-29 23:45:34 +08:00 |
|
BUAADreamer
|
b37bb592ec
|
fix constants
Former-commit-id: bec1cb8d55d01ac8b70b9bacd92a227b48cf8336
|
2024-09-29 22:40:43 +08:00 |
|
BUAADreamer
|
87ab7fc01c
|
fix constants
Former-commit-id: 485fc047169afd027ee65d05e3c5c08b371b6c4d
|
2024-09-29 22:00:01 +08:00 |
|
BUAADreamer
|
1b71afb277
|
add more llava-next series template
Former-commit-id: 65a8923f5a7d20d34fabf4f81746fe9b7bc8c84a
|
2024-09-29 21:29:29 +08:00 |
|
BUAADreamer
|
5aa1e847d9
|
add llava-next/llava-next-video/video-llava
Former-commit-id: 6642cd501d55a1657678428ef2aa0c9b99b7e83f
|
2024-09-28 00:57:03 +08:00 |
|
Zhangchi Feng
|
c576b7ca32
|
Merge branch 'hiyouga:main' into main
Former-commit-id: 900631755b28692bb150a8cf39354af4e2e986c9
|
2024-09-27 18:14:39 +08:00 |
|
hoshi-hiyouga
|
a73988141b
|
add modelscope models
Former-commit-id: 8e5d12c2c4b687dc0d2c5bc25a916ba9f6ce67c9
|
2024-09-26 11:22:48 +08:00 |
|
marko1616
|
b70da07977
|
Chore: Support llama3.2.
Former-commit-id: 885a0b77ab83bf001d7175e2ba440f7928fa4731
|
2024-09-25 16:08:44 -04:00 |
|
hoshi-hiyouga
|
56058e2e84
|
add qwen2.5 models
Former-commit-id: 92ef62f5025475606e533947b7d9c3cae9bfcdbf
|
2024-09-19 02:07:54 +08:00 |
|
hiyouga
|
d2f8bcb890
|
set dev version
Former-commit-id: 0ded76578450f71dfe6570fbba7caaa65c004f03
|
2024-09-11 18:56:37 +08:00 |
|
Zhangchi Feng
|
4b6606832c
|
Merge branch 'hiyouga:main' into main
Former-commit-id: 4643089a7dc6a88c391663131333f35b5da5015b
|
2024-09-10 13:20:24 +08:00 |
|
BUAADreamer
|
f00f4ae9b6
|
support llava-next(video)
Former-commit-id: 31259e7e0caa9ff6449b4abcee0554e211167178
|
2024-09-10 12:31:53 +08:00 |
|
hiyouga
|
38505ae9e1
|
update accelerate ver for schedule_free optimizers
Former-commit-id: bdde35fd2e4a919c1d63ebfc9a0ea8ba0c97e14c
|
2024-09-09 22:51:08 +08:00 |
|
hiyouga
|
3aefdad4ec
|
release v0.9.0 (real)
Former-commit-id: 90d6df622252c6fad985f68b97771c979357e2fc
|
2024-09-09 01:00:25 +08:00 |
|
hiyouga
|
561ae4d1af
|
fix constants
Former-commit-id: 653fe70acbe44853fa0ad073a9b8391d75ef6c2a
|
2024-09-08 23:52:30 +08:00 |
|
hiyouga
|
fb9280a0a7
|
release v0.9.0
Former-commit-id: 54b5c4b8195d23bd9dcc1921af9910d5bdd181fd
|
2024-09-08 23:43:35 +08:00 |
|
hiyouga
|
78cf256067
|
support vllm 0.6.0
Former-commit-id: b6681d7198acf4acbebfe271dd22095e236bc430
|
2024-09-08 02:26:20 +08:00 |
|
hiyouga
|
7ccb86b215
|
add docstrings, refactor logger
Former-commit-id: 54c69059379d77dc9046c144cbe2d0253de3a4da
|
2024-09-08 00:56:56 +08:00 |
|
hoshi-hiyouga
|
de277a8ab8
|
Merge pull request #5372 from LDLINGLINGLING/main
增加了对minicpm3.0的适配'
Former-commit-id: 12743562639ccc6eb0caf170e7123d9844e2b4a6
|
2024-09-05 21:35:42 +08:00 |
|
liudan
|
1797fe50a4
|
根据代码规范修改了代码
Former-commit-id: 3d3fbaaff98da327e10bdebb4aedbdf1ec9565e8
|
2024-09-05 20:17:55 +08:00 |
|
hiyouga
|
4fccc65579
|
support Yi-Coder models
Former-commit-id: 359ef8bb0ebb8ccf9651ac2b737c5a705dab6bad
|
2024-09-05 03:12:24 +08:00 |
|
hiyouga
|
9df7a26e6b
|
video datasets
Former-commit-id: 8cafc7b055a854f483ad1c67f3d487ffd34b5f89
|
2024-09-05 02:04:17 +08:00 |
|
liudan
|
09cff03026
|
增加了对minicpm3.0的适配'
Former-commit-id: d7ba97be484bf781d6fe80252ea29eb505b261bb
|
2024-09-04 23:10:05 +08:00 |
|
hiyouga
|
cb776752f6
|
fix mixed mm inputs and rlhf-v
Former-commit-id: 9967ccb3aef3ca557ad6eafb78c6c99866857008
|
2024-09-01 20:52:47 +08:00 |
|
hiyouga
|
a83756b5e9
|
refactor mm training
Former-commit-id: 3382317e32f88ed377d3e7759bdeaf0f2559d22a
|
2024-08-30 02:14:31 +08:00 |
|
hiyouga
|
21d3976eea
|
fix #5295
Former-commit-id: ad72f3e06593f124d661d61774def336511716e0
|
2024-08-29 20:30:18 +08:00 |
|
hiyouga
|
7b5834b2dd
|
tiny fix
Former-commit-id: f6ae4e75ddaeb4ac4a527f0141ac5b1afefde10e
|
2024-08-27 12:49:32 +08:00 |
|
hiyouga
|
daebca2368
|
tiny fix
Former-commit-id: c8b4c7fee5398654683b713ad5c03b5daf13218a
|
2024-08-20 00:10:52 +08:00 |
|
hoshi-hiyouga
|
5582674f06
|
Merge pull request #5188 from Zxilly/main
fix: report correct device count for intel xpu
Former-commit-id: d39f4a62d3c5a3bbbf39d1eb4b92439acedae18e
|
2024-08-19 23:51:39 +08:00 |
|
Ricardo
|
a9312387bc
|
_is_bf16_available judgment supports npu
Former-commit-id: 384ab8db84eef7d1f6a7643c15c565a7d4906a5c
|
2024-08-16 02:58:22 +00:00 |
|
Zxilly
|
41a8387195
|
fix: report correct device count for intel xpu
Former-commit-id: dc36fcc3de721bdd28edd4eed36677e59a7614be
|
2024-08-15 08:30:43 +00:00 |
|
hiyouga
|
a8add5c04b
|
add qwen2 math models
Former-commit-id: dc770efb14bd6e18421511912fbb959a3cf9f78d
|
2024-08-09 20:20:35 +08:00 |
|
hiyouga
|
20013e130b
|
fix #5048
Former-commit-id: b7ca6c8dc14f689d0df16684a6121cc0ec24f8ba
|
2024-08-05 23:48:19 +08:00 |
|
codingma
|
7125b6cf70
|
support gemma-2-2b
Former-commit-id: dc09d454f285b8584d9017349a9cee3b44eadb72
|
2024-08-01 13:45:48 +08:00 |
|
hiyouga
|
91e54d458f
|
add mistral nemo model
Former-commit-id: 1550fe7331370ad39e8ed69c1b060ead902a77e4
|
2024-07-24 16:25:53 +08:00 |
|
hiyouga
|
e0875f82b3
|
add llama3.1
Former-commit-id: 26533c0604ef765170f93986bc06f3066c5e28ee
|
2024-07-24 16:20:11 +08:00 |
|
hiyouga
|
726e7046db
|
set dev version
Former-commit-id: 88c7fc159999511e6e19fff3d37147a6a7064335
|
2024-07-19 02:01:46 +08:00 |
|
hiyouga
|
f5cfea56bd
|
release v0.8.3
Former-commit-id: bbd5a644230d633f507c72929e8819c07ae38bba
|
2024-07-19 01:21:18 +08:00 |
|
hiyouga
|
e90fae61f4
|
support batch_eval_metrics, fix #4826
Former-commit-id: d774b94f124923829b2eae428e25199d503ebfcb
|
2024-07-17 00:33:00 +08:00 |
|
hoshi-hiyouga
|
7483e187c6
|
Update packages.py
Former-commit-id: f84b007ebbb9fa63f797b4bd1c487372877bbc65
|
2024-07-07 15:48:29 +08:00 |
|
Lian Junhong
|
7ca84e0a09
|
chore: Update vllm_engine.py to support vllm version >= 0.5.1
Former-commit-id: 322663bf90ce7b99ca5b0b43ff9dbd95eb36ff6b
|
2024-07-07 15:08:12 +08:00 |
|
hiyouga
|
7fcffb860d
|
add codegeex4, internlm2.5
Former-commit-id: 53b1002fb74123095e7466c75b941a31a7cfba4d
|
2024-07-06 16:16:47 +08:00 |
|
hiyouga
|
7b3c1f29ff
|
fix packing for eager/sdpa attn
Former-commit-id: 6fd6aa4530f81a2ed306eeb2a5167607288b62c6
|
2024-07-04 01:52:43 +08:00 |
|
hoshi-hiyouga
|
a38ff842d0
|
Merge pull request #4224 from chuan298/main
Implement efficient packing without cross-contamination attention
Former-commit-id: 87d9b2d00513c163335d3f2e2bb3cb3299cecdaa
|
2024-07-04 01:18:54 +08:00 |
|
hiyouga
|
bfdaadcc40
|
update packing
Former-commit-id: cce7083024bed4c7429ddc8288d1c9190fde29f5
|
2024-07-04 01:10:55 +08:00 |
|
hiyouga
|
e671ed520b
|
update arg name
Former-commit-id: 8a6a7b9c8a876da9c16e5ada7df461eb8cabee21
|
2024-07-03 23:23:24 +08:00 |
|
hiyouga
|
cc31014002
|
improve rlhf
Former-commit-id: c47ab6c07287fb260ea49b8b7af46bdd416f88f7
|
2024-07-02 22:23:08 +08:00 |
|
hzhaoy
|
28e787116b
|
add TeleChat-1B
Former-commit-id: 57b7c00430bcfc83afd11547ceead041e8edfd8d
|
2024-07-02 17:49:04 +08:00 |
|
hoshi-hiyouga
|
2452f57cd7
|
Merge branch 'main' into main
Former-commit-id: e8e6af26514272e29a50649b38182beb4db4ebfa
|
2024-07-01 21:01:09 +08:00 |
|