hoshi-hiyouga
|
c3c0efbaa0
|
[misc] fix packing and eval plot (#7623)
|
2025-04-07 18:20:57 +08:00 |
|
hoshi-hiyouga
|
332f637592
|
disable valset by default (#6690)
Former-commit-id: a1a94f364e33d1d73852f74eda4fa581e6b16533
|
2025-01-17 21:09:30 +08:00 |
|
Yaser Afshar
|
6f1c8dacea
|
Add missing key to init_kwargs
Former-commit-id: 03fc4621dad132164596a58d3e8693787b7e1aca
|
2024-12-17 12:34:05 +00:00 |
|
hiyouga
|
1e6f96508a
|
add vllm config
Former-commit-id: 95365f0ce4f362bde7de8b679b54b548d7055bfb
|
2024-11-10 21:28:18 +08:00 |
|
hiyouga
|
48f0819327
|
fix #4944
Former-commit-id: 9e8cf3b21a0b12d1413c3c7f3d60399784909242
|
2024-07-24 16:42:51 +08:00 |
|
hiyouga
|
9fd7a410bb
|
tiny fix about badam
Former-commit-id: 03f49267c7406e36aee35639f86e6e0383897090
|
2024-06-25 01:54:53 +08:00 |
|
Jonery
|
c7479751e8
|
add example
Former-commit-id: 75603db09b085e3f703286b87abe041af020e615
|
2024-06-18 13:50:26 +08:00 |
|
hiyouga
|
35379c7c0e
|
update train hparams
Former-commit-id: 1ca9fce55b55bf209f4b76152b586731932a3f39
|
2024-06-06 01:49:20 +08:00 |
|
hiyouga
|
82d744716a
|
fix #4005 #4013
Former-commit-id: 8608fa268cde5cddf8d0c6c2eb2cb5fa246c1831
|
2024-06-03 19:12:29 +08:00 |
|
hiyouga
|
2bff90719b
|
improve KTO impl., replace datasets
Former-commit-id: e56a57ddcf061de6e4acc8679f7dbf0b68364986
|
2024-05-18 03:44:56 +08:00 |
|
hiyouga
|
92b3697e2c
|
update badam example #3764
Former-commit-id: a3730fd0a96bab869be6d695031182dabaea8137
|
2024-05-17 02:21:10 +08:00 |
|
hiyouga
|
a3320f26cf
|
update examples
Former-commit-id: 3b5f138155d96b346bda18e465cf60ec7d99e19c
|
2024-05-17 01:02:00 +08:00 |
|
hiyouga
|
e4972c8fc4
|
update examples
Former-commit-id: 779603055ae9216ff549f5285caac8c0c0a1e9fb
|
2024-05-13 20:39:36 +08:00 |
|
hiyouga
|
50c71dd29f
|
update example docs
Former-commit-id: 102cd42768d9eb2cf1219309a25b41e26149067e
|
2024-05-06 22:51:02 +08:00 |
|
hiyouga
|
5c9da798b5
|
update docs
Former-commit-id: a4a2e94241bea6f96590f6cb8ca8b5cddee1917e
|
2024-05-06 21:47:00 +08:00 |
|
Oscar
|
d0597897bf
|
Fix badam example outdated argument
Former-commit-id: 29aa188cc774cb72367f706f1cd4c07bc5a9f241
|
2024-05-05 23:35:19 +08:00 |
|
hiyouga
|
ce8200ad98
|
update webui and add CLIs
Former-commit-id: 1368dda22ab875914c9dd86ee5146a4f6a4736ad
|
2024-05-03 02:58:23 +08:00 |
|
hiyouga
|
ba06eb65ca
|
update readme and examples
Former-commit-id: 27dd9bf201c24f7804811398bc2758966ec78432
|
2024-04-22 00:37:32 +08:00 |
|
hiyouga
|
be716972fe
|
remove extras
Former-commit-id: d67e972f8c3d5273e589c8c85c0a1620f59785c5
|
2024-04-22 00:35:41 +08:00 |
|
hoshi-hiyouga
|
de728d0371
|
Update sft.sh
Former-commit-id: 2b4b1562e91bbb02e345e71b7721da9333c0791b
|
2024-04-16 17:25:40 +08:00 |
|
Jonery
|
6dd6b3e396
|
resolve gradient checkpointing issue.
Former-commit-id: 6df9135d063bb6102f0cbcdf0d702076f5febbae
|
2024-04-16 12:05:27 +08:00 |
|
Jonery
|
d4d471450f
|
Feature BAdam
Former-commit-id: d8d2807fbcf587c37f7fd34a23e9397d2775ceed
|
2024-04-15 23:15:27 +08:00 |
|