hoshi-hiyouga
|
3c6f735cc3
|
[config] fix export max len (#7230)
Former-commit-id: 211c0b3e8f3340acd2fae1762d9152a09f19ba34
|
2025-03-10 16:46:08 +08:00 |
|
hoshi-hiyouga
|
63e4b14565
|
[misc] fix cli (#7204)
Former-commit-id: 999f57133ca163c7108d2d5ee8194eca9b2109b4
|
2025-03-07 15:01:18 +08:00 |
|
hoshi-hiyouga
|
00245a86e6
|
[deps] upgrade vllm (#7183)
Former-commit-id: 37678a3d64668c3b4a4bfefc054e3b9b40427c1a
|
2025-03-06 15:25:08 +08:00 |
|
hoshi-hiyouga
|
8cbfa350fd
|
[misc] fix grad ckpt func (#6916)
Former-commit-id: 35e069a52b3d7cfd9b0107574b09265eb2290f0b
|
2025-02-13 00:17:18 +08:00 |
|
hoshi-hiyouga
|
0a0d7671e0
|
[data] feat: auto template (#6905)
* support auto template
* add unittest
Former-commit-id: 0c6c9150db6414a5a05527ea486dce6633dff4b3
|
2025-02-12 00:22:53 +08:00 |
|
hoshi-hiyouga
|
c322512037
|
[deps] upgrade vllm (#6857)
Former-commit-id: 4bd50f65a3d62528768561019fda2723d045c7fd
|
2025-02-08 15:02:28 +08:00 |
|
hoshi-hiyouga
|
38c52f20f7
|
[misc] allow extra args (#6831)
Former-commit-id: 0fd3a5295cb4e08a4e57e860e82103364c28fba8
|
2025-02-06 12:38:08 +08:00 |
|
hoshi-hiyouga
|
33d420bbcc
|
[optim] clean apollo (#6645)
* clean apollo code
* update readme
Former-commit-id: 38b8ec4a99189483124b54df9d6bc6b0d318855a
|
2025-01-15 01:42:50 +08:00 |
|
zhuHQ
|
9b29a431db
|
[optim] add support to APOLLO (#6617)
Former-commit-id: 5a252e5a458457adbd19da3b68a3897ad2962824
|
2025-01-15 00:24:56 +08:00 |
|
hoshi-hiyouga
|
89b308bf30
|
pin vllm version to 0.6.5 (#6629)
Former-commit-id: 26097ca0adf25ebb7d9e8eec2d2cef673c6cfe88
|
2025-01-14 02:44:02 +08:00 |
|
hoshi-hiyouga
|
9ce3cbf3c7
|
Merge pull request #6564 from stephen-nju/fix_ray
Fix ray
Former-commit-id: d4566839369726023f1b6e8f4b2332bda0c715cc
|
2025-01-08 18:14:18 +08:00 |
|
zhubin
|
b0fb054637
|
fix get ray args when args not a dict
Former-commit-id: 5e5398cd5b117b2378107172d3f91cfb0321e842
|
2025-01-08 10:06:02 +00:00 |
|
hiyouga
|
760dea0787
|
imporve log
Former-commit-id: a6abf375975ffea3d51e1b944c9855b5f62ffac8
|
2025-01-08 09:56:10 +00:00 |
|
hiyouga
|
708e899769
|
refactor ray integration, support save ckpt
Former-commit-id: 2f50b27e608b2092bfceab6c6e84e6631e973ee2
|
2025-01-07 09:39:10 +00:00 |
|
Eric Tang
|
88e9badcbb
|
run style check
Former-commit-id: 5ec33baf5f95df9fa2afe5523c825d3eda8a076b
|
2025-01-07 08:55:44 +00:00 |
|
Kourosh Hakhamaneshi
|
09a17b5415
|
drafting ray integration
Signed-off-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>
Former-commit-id: 19c12ddae9350f6e25a270fe3372f5b9094cf960
|
2025-01-07 08:55:44 +00:00 |
|
hiyouga
|
92c6c384cf
|
fix #6482
Former-commit-id: 8577f52b4152efe6cc7a8b5f6d37b4f9ba6684e7
|
2024-12-30 06:03:07 +00:00 |
|
hiyouga
|
51b18e565d
|
support batch infer in vllm
Former-commit-id: 3ef5ed3b9a44eed2f7e3ff221dfc343d0a97c0b5
|
2024-12-04 13:50:00 +00:00 |
|
hiyouga
|
85343ddf47
|
add vllm config
Former-commit-id: 95365f0ce4f362bde7de8b679b54b548d7055bfb
|
2024-11-10 21:28:18 +08:00 |
|
hiyouga
|
a117731ecb
|
support rank0 logger
Former-commit-id: 84528eabe560091bfd866b6a0ca864085af7529b
|
2024-11-02 18:31:04 +08:00 |
|
hiyouga
|
dbbfb5f5dc
|
use pre-commit
Former-commit-id: 7cfede95df22a9ff236788f04159b6b16b8d04bb
|
2024-10-29 09:07:46 +00:00 |
|
hiyouga
|
916804d11a
|
tiny fix
Former-commit-id: 1fe424323b212094856f423351dc2a15774d39c3
|
2024-10-11 23:51:54 +08:00 |
|
Johnny
|
87cf7cfd65
|
Update parser.py
Former-commit-id: 60b13c86f4feaffbb43f5a23a28376fe416ed118
|
2024-10-11 12:29:33 +02:00 |
|
hoshi-hiyouga
|
6278dc5295
|
Update parser.py
Former-commit-id: e7d291605f184f6ac48429015e15755192d2f274
|
2024-10-07 16:27:23 +08:00 |
|
Johnny
|
34f9777941
|
Update parser.py
Former-commit-id: 55c449b54aec04e2141bffe75d4016cbac9ef4c5
|
2024-10-07 10:17:45 +02:00 |
|
Johnny
|
31b6b841ed
|
Update parser.py
Former-commit-id: f832edc8dc0e2b78c12dc8edd702fe147a0a5292
|
2024-10-06 20:34:19 +02:00 |
|
hiyouga
|
ad96eb43e7
|
fix #5611
Former-commit-id: 3bef07ecf0557999bb0b33b650a778addc8e5b91
|
2024-10-06 10:34:55 +08:00 |
|
hiyouga
|
01dae0f475
|
support vllm 0.6.0
Former-commit-id: e39470ec51a9c74ad901871eb816df10e851f351
|
2024-09-08 02:26:20 +08:00 |
|
hiyouga
|
ec2da8b06a
|
remove visual_inputs, fix qlora
Former-commit-id: be30c01c4f1482520ece770bd54c6a4837c26f0a
|
2024-08-31 00:24:51 +08:00 |
|
hiyouga
|
228f745235
|
refactor mm training
Former-commit-id: 179c0558699e287cbf38a2d73bff47e86d589c5a
|
2024-08-30 02:14:31 +08:00 |
|
hiyouga
|
4096d52752
|
update liger kernel
Former-commit-id: d6bf6ca2161c99dd5d644e31d2b1df451017b68c
|
2024-08-29 20:46:08 +08:00 |
|
hiyouga
|
8695af8766
|
fix #5292
Former-commit-id: dd81ce8ce5fdf450027c5f9634abb6ac2cd52128
|
2024-08-29 20:37:47 +08:00 |
|
hiyouga
|
dd6c96b96d
|
support liger kernel
Former-commit-id: 0f4e54abf6c5feb2329855a4047597ad5147720a
|
2024-08-27 11:20:14 +08:00 |
|
hiyouga
|
c7a1c3f43a
|
add adam_mini to readme
Former-commit-id: d610c6bcf8a8ba6f4236f5d11f79571b83f4fb11
|
2024-08-09 20:02:03 +08:00 |
|
hiyouga
|
727193cdc9
|
follow #5115
Former-commit-id: 7d917e03e2df570139bae18227d9c7303a12de2a
|
2024-08-09 18:03:00 +08:00 |
|
hiyouga
|
8dfe34e307
|
update parser
Former-commit-id: 5262c8702382ff8bc36a172387bc4c8949f326ea
|
2024-07-19 01:36:39 +08:00 |
|
hiyouga
|
9573799224
|
follow #4878 fix #4684
Former-commit-id: 4715e5c5b8040b21e5f401f7e969b9fd2757d520
|
2024-07-18 22:06:12 +08:00 |
|
Shiyu Zhang
|
1538923eed
|
仅仅训练最后一轮对话
Former-commit-id: ab6198e4c099edeb1a400f58729cd617e8cd8e50
|
2024-07-18 15:30:25 +08:00 |
|
hiyouga
|
746e9b352e
|
support batch_eval_metrics, fix #4826
Former-commit-id: 3fe1df17188825f8a32fbe6a1294b4b532ce0c85
|
2024-07-17 00:33:00 +08:00 |
|
hiyouga
|
392ac88d78
|
allow computing rouge in training
Former-commit-id: ac67d50673989e8137965f5f718fec67c184f55b
|
2024-07-15 01:16:26 +08:00 |
|
hiyouga
|
8ce43766c6
|
fix up
Former-commit-id: 43a56cb331fae899ca35b0c312730d4ab79d0c42
|
2024-07-15 01:04:56 +08:00 |
|
hiyouga
|
95f47490f9
|
fix #4699
slow tokenizer for yi models
Former-commit-id: 4d23a0bcda0c15a903a62eec72d14c584ce020dd
|
2024-07-14 15:34:22 +08:00 |
|
hiyouga
|
a0df8be4e8
|
fix packing for eager/sdpa attn
Former-commit-id: 735a033ceb7f2da6da71d138ea091d8a665411a9
|
2024-07-04 01:52:43 +08:00 |
|
hiyouga
|
fe888a9073
|
update hparams
Former-commit-id: 1c4feac44192b1f540208837f5a530b0d3f5fb37
|
2024-07-03 23:18:58 +08:00 |
|
ancv
|
260f55ea47
|
move efficient_packing from data_args to model_args
Former-commit-id: 7b61659c707480bcf8c802c73e10d12ad5b9b965
|
2024-07-02 18:37:55 +07:00 |
|
hiyouga
|
884a4a33ee
|
refactor pissa, improve llamaboard
Former-commit-id: 619556e46c19718f702c97df5d570a2a4c5fb13a
|
2024-06-28 01:04:24 +08:00 |
|
hiyouga
|
b588a099db
|
fix #4549
Former-commit-id: c9fdef10de737d1f433209812ef73e29cb60490a
|
2024-06-28 00:41:58 +08:00 |
|
hiyouga
|
9805350811
|
tiny fix
Former-commit-id: c1a78a3a9f8ab9d57577cee37f9c457d60863ba2
|
2024-06-27 20:14:48 +08:00 |
|
hiyouga
|
f5bf167a6e
|
lint
Former-commit-id: c9e424d2198b5872ce118a6ab4c109bf73be2bee
|
2024-06-25 02:55:50 +08:00 |
|
hiyouga
|
4d2c279083
|
tiny fix about badam
Former-commit-id: 03f49267c7406e36aee35639f86e6e0383897090
|
2024-06-25 01:54:53 +08:00 |
|