hoshi-hiyouga
|
fec641ec82
|
[misc] allow extra args (#6831)
Former-commit-id: 0fd3a5295cb4e08a4e57e860e82103364c28fba8
|
2025-02-06 12:38:08 +08:00 |
|
hoshi-hiyouga
|
7638f1070e
|
[optim] clean apollo (#6645)
* clean apollo code
* update readme
Former-commit-id: 38b8ec4a99189483124b54df9d6bc6b0d318855a
|
2025-01-15 01:42:50 +08:00 |
|
zhuHQ
|
c2120432db
|
[optim] add support to APOLLO (#6617)
Former-commit-id: 5a252e5a458457adbd19da3b68a3897ad2962824
|
2025-01-15 00:24:56 +08:00 |
|
hoshi-hiyouga
|
28d145a066
|
pin vllm version to 0.6.5 (#6629)
Former-commit-id: 26097ca0adf25ebb7d9e8eec2d2cef673c6cfe88
|
2025-01-14 02:44:02 +08:00 |
|
hoshi-hiyouga
|
4e25d037c8
|
Merge pull request #6564 from stephen-nju/fix_ray
Fix ray
Former-commit-id: d4566839369726023f1b6e8f4b2332bda0c715cc
|
2025-01-08 18:14:18 +08:00 |
|
zhubin
|
b6b53b61f7
|
fix get ray args when args not a dict
Former-commit-id: 5e5398cd5b117b2378107172d3f91cfb0321e842
|
2025-01-08 10:06:02 +00:00 |
|
hiyouga
|
647c51a772
|
imporve log
Former-commit-id: a6abf375975ffea3d51e1b944c9855b5f62ffac8
|
2025-01-08 09:56:10 +00:00 |
|
hiyouga
|
944a2aec4d
|
refactor ray integration, support save ckpt
Former-commit-id: 2f50b27e608b2092bfceab6c6e84e6631e973ee2
|
2025-01-07 09:39:10 +00:00 |
|
Eric Tang
|
4f31ad997c
|
run style check
Former-commit-id: 5ec33baf5f95df9fa2afe5523c825d3eda8a076b
|
2025-01-07 08:55:44 +00:00 |
|
Kourosh Hakhamaneshi
|
8683582300
|
drafting ray integration
Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>
Former-commit-id: 19c12ddae9350f6e25a270fe3372f5b9094cf960
|
2025-01-07 08:55:44 +00:00 |
|
hiyouga
|
f8f05a883b
|
fix #6482
Former-commit-id: 8577f52b4152efe6cc7a8b5f6d37b4f9ba6684e7
|
2024-12-30 06:03:07 +00:00 |
|
hiyouga
|
c1768cfb14
|
support batch infer in vllm
Former-commit-id: 3ef5ed3b9a44eed2f7e3ff221dfc343d0a97c0b5
|
2024-12-04 13:50:00 +00:00 |
|
hiyouga
|
1e6f96508a
|
add vllm config
Former-commit-id: 95365f0ce4f362bde7de8b679b54b548d7055bfb
|
2024-11-10 21:28:18 +08:00 |
|
hiyouga
|
093eda2ad6
|
support rank0 logger
Former-commit-id: 84528eabe560091bfd866b6a0ca864085af7529b
|
2024-11-02 18:31:04 +08:00 |
|
hiyouga
|
248d5daaff
|
use pre-commit
Former-commit-id: 7cfede95df22a9ff236788f04159b6b16b8d04bb
|
2024-10-29 09:07:46 +00:00 |
|
hiyouga
|
c7efc7f2ed
|
tiny fix
Former-commit-id: 1fe424323b212094856f423351dc2a15774d39c3
|
2024-10-11 23:51:54 +08:00 |
|
Johnny
|
9d27aaa38f
|
Update parser.py
Former-commit-id: 60b13c86f4feaffbb43f5a23a28376fe416ed118
|
2024-10-11 12:29:33 +02:00 |
|
hoshi-hiyouga
|
bba026a212
|
Update parser.py
Former-commit-id: e7d291605f184f6ac48429015e15755192d2f274
|
2024-10-07 16:27:23 +08:00 |
|
Johnny
|
2b69ae0eb2
|
Update parser.py
Former-commit-id: 55c449b54aec04e2141bffe75d4016cbac9ef4c5
|
2024-10-07 10:17:45 +02:00 |
|
Johnny
|
f9815dd20a
|
Update parser.py
Former-commit-id: f832edc8dc0e2b78c12dc8edd702fe147a0a5292
|
2024-10-06 20:34:19 +02:00 |
|
hiyouga
|
6476507429
|
fix #5611
Former-commit-id: 3bef07ecf0557999bb0b33b650a778addc8e5b91
|
2024-10-06 10:34:55 +08:00 |
|
hiyouga
|
eb5af3d90b
|
support vllm 0.6.0
Former-commit-id: e39470ec51a9c74ad901871eb816df10e851f351
|
2024-09-08 02:26:20 +08:00 |
|
hiyouga
|
2f6fc27c8b
|
remove visual_inputs, fix qlora
Former-commit-id: be30c01c4f1482520ece770bd54c6a4837c26f0a
|
2024-08-31 00:24:51 +08:00 |
|
hiyouga
|
c62a6ca59d
|
refactor mm training
Former-commit-id: 179c0558699e287cbf38a2d73bff47e86d589c5a
|
2024-08-30 02:14:31 +08:00 |
|
hiyouga
|
c1369a1ec9
|
update liger kernel
Former-commit-id: d6bf6ca2161c99dd5d644e31d2b1df451017b68c
|
2024-08-29 20:46:08 +08:00 |
|
hiyouga
|
d677fe053d
|
fix #5292
Former-commit-id: dd81ce8ce5fdf450027c5f9634abb6ac2cd52128
|
2024-08-29 20:37:47 +08:00 |
|
hiyouga
|
206a8364d4
|
support liger kernel
Former-commit-id: 0f4e54abf6c5feb2329855a4047597ad5147720a
|
2024-08-27 11:20:14 +08:00 |
|
hiyouga
|
59cbce1a46
|
add adam_mini to readme
Former-commit-id: d610c6bcf8a8ba6f4236f5d11f79571b83f4fb11
|
2024-08-09 20:02:03 +08:00 |
|
hiyouga
|
5af32ce705
|
follow #5115
Former-commit-id: 7d917e03e2df570139bae18227d9c7303a12de2a
|
2024-08-09 18:03:00 +08:00 |
|
hiyouga
|
0e88c5754f
|
update parser
Former-commit-id: 5262c8702382ff8bc36a172387bc4c8949f326ea
|
2024-07-19 01:36:39 +08:00 |
|
hiyouga
|
4c1513a845
|
follow #4878 fix #4684
Former-commit-id: 4715e5c5b8040b21e5f401f7e969b9fd2757d520
|
2024-07-18 22:06:12 +08:00 |
|
Shiyu Zhang
|
c1e1918db1
|
仅仅训练最后一轮对话
Former-commit-id: ab6198e4c099edeb1a400f58729cd617e8cd8e50
|
2024-07-18 15:30:25 +08:00 |
|
hiyouga
|
8c93921952
|
support batch_eval_metrics, fix #4826
Former-commit-id: 3fe1df17188825f8a32fbe6a1294b4b532ce0c85
|
2024-07-17 00:33:00 +08:00 |
|
hiyouga
|
cb474c7b11
|
allow computing rouge in training
Former-commit-id: ac67d50673989e8137965f5f718fec67c184f55b
|
2024-07-15 01:16:26 +08:00 |
|
hiyouga
|
e4d11a117b
|
fix up
Former-commit-id: 43a56cb331fae899ca35b0c312730d4ab79d0c42
|
2024-07-15 01:04:56 +08:00 |
|
hiyouga
|
b92214f78b
|
fix #4699
slow tokenizer for yi models
Former-commit-id: 4d23a0bcda0c15a903a62eec72d14c584ce020dd
|
2024-07-14 15:34:22 +08:00 |
|
hiyouga
|
3d219b91b9
|
fix packing for eager/sdpa attn
Former-commit-id: 735a033ceb7f2da6da71d138ea091d8a665411a9
|
2024-07-04 01:52:43 +08:00 |
|
hiyouga
|
5acaa476d6
|
update hparams
Former-commit-id: 1c4feac44192b1f540208837f5a530b0d3f5fb37
|
2024-07-03 23:18:58 +08:00 |
|
ancv
|
20fdf177e8
|
move efficient_packing from data_args to model_args
Former-commit-id: 7b61659c707480bcf8c802c73e10d12ad5b9b965
|
2024-07-02 18:37:55 +07:00 |
|
hiyouga
|
46f0189e88
|
refactor pissa, improve llamaboard
Former-commit-id: 619556e46c19718f702c97df5d570a2a4c5fb13a
|
2024-06-28 01:04:24 +08:00 |
|
hiyouga
|
9103fdf866
|
fix #4549
Former-commit-id: c9fdef10de737d1f433209812ef73e29cb60490a
|
2024-06-28 00:41:58 +08:00 |
|
hiyouga
|
bf99223a80
|
tiny fix
Former-commit-id: c1a78a3a9f8ab9d57577cee37f9c457d60863ba2
|
2024-06-27 20:14:48 +08:00 |
|
hiyouga
|
98f382fda3
|
lint
Former-commit-id: c9e424d2198b5872ce118a6ab4c109bf73be2bee
|
2024-06-25 02:55:50 +08:00 |
|
hiyouga
|
9fd7a410bb
|
tiny fix about badam
Former-commit-id: 03f49267c7406e36aee35639f86e6e0383897090
|
2024-06-25 01:54:53 +08:00 |
|
hoshi-hiyouga
|
bfb2ad7c79
|
Merge pull request #4352 from Ledzy/main
[Enhancement] Support ZeRO-3 when using BAdam
Former-commit-id: 0dc75275efa7d7540b472783a52ea6aeaa503c0b
|
2024-06-25 01:49:13 +08:00 |
|
hiyouga
|
af2cb33bb2
|
tiny fix
Former-commit-id: 2d8d47f6126d68db1701ed18fc31310c6f14dd49
|
2024-06-20 22:56:05 +08:00 |
|
Jonery
|
fa3150548e
|
Cleaner integration.
Former-commit-id: 26d4b05d424bd71f570195dd433258caf6465d92
|
2024-06-19 12:29:40 +08:00 |
|
hiyouga
|
4bc0bea0e9
|
fix #4357
Former-commit-id: a6741bba8cebd16a6a3f97a2dc81057d0e27eb39
|
2024-06-18 22:42:45 +08:00 |
|
Jonery
|
870a54ac84
|
fix typo
Former-commit-id: d4bee3716dbf8a84564d5bcc2059172604819f3e
|
2024-06-18 12:39:26 +08:00 |
|
Jonery
|
12fcfc2b72
|
Support distributed BAdam.
Former-commit-id: bdcb986e37975911c190a74d3e60bb77aa2033bd
|
2024-06-18 12:27:47 +08:00 |
|