hoshi-hiyouga
|
9b5baa97f0
|
[data] qwen3 fixes (#8109)
|
2025-05-20 02:00:30 +08:00 |
|
hoshi-hiyouga
|
dc080399c6
|
[model] add seed coder and qwen3 quant models (#8039)
|
2025-05-13 15:59:55 +08:00 |
|
hoshi-hiyouga
|
4eec541857
|
[data] add coig-p dataset (#7657)
|
2025-04-09 21:18:25 +08:00 |
|
Zhangchi Feng
|
8f401e37f8
|
[model] support audio (#6701)
* support qwen2_audio
* improve code
* lint
* fix
* fix
* fix
---------
Co-authored-by: hiyouga <hiyouga@buaa.edu.cn>
Former-commit-id: 5eacb5629e4d7733cd992a63747a1335f2c6a929
|
2025-02-05 04:59:09 +08:00 |
|
hiyouga
|
7f71276ad8
|
add docstrings, refactor logger
Former-commit-id: c34e489d71f8f539028543ccf8ee92cecedd6276
|
2024-09-08 00:56:56 +08:00 |
|
hiyouga
|
abd26f5f67
|
update data readme
Former-commit-id: 0af5f054b7b8da8b39eb44b1dfa76050f0c45667
|
2024-09-05 04:44:49 +08:00 |
|
hiyouga
|
4d35ace75e
|
update data readme
Former-commit-id: 81adb153b7d0b30e6cd50c9bf4ca1ccf17458611
|
2024-09-05 04:25:27 +08:00 |
|
hiyouga
|
d789b667d7
|
optimize predict vram
Former-commit-id: a577e44eee351b3ed8011a33ae01cd713354ff97
|
2024-08-30 23:08:45 +08:00 |
|
hiyouga
|
e4d11a117b
|
fix up
Former-commit-id: 43a56cb331fae899ca35b0c312730d4ab79d0c42
|
2024-07-15 01:04:56 +08:00 |
|
codingma
|
5f2bd04799
|
1. add custom eval dataset support
2. merge load dataset and split dataset function
Former-commit-id: 963d97ba07e7efa3a4544c4d077283d9e112b3ad
|
2024-07-05 15:52:10 +08:00 |
|
hoshi-hiyouga
|
91cc571e6e
|
Update README_zh.md
Former-commit-id: 3007d260ed45169583a74497a53b661337dd5f71
|
2024-05-30 00:04:47 +08:00 |
|
seanzhang-zhichen
|
a3b52fd380
|
Merge branch 'main' into add_dataset_sample_num
Former-commit-id: 26300127c45f24e63b91f1b0cc73e46c3a936a91
|
2024-05-24 15:57:47 +08:00 |
|
hiyouga
|
09e78272c2
|
Update README_zh.md
Former-commit-id: 34c4ba6bf9bb89170446fb396aa06ae44d251de0
|
2024-05-21 18:30:59 +08:00 |
|
zhangzc
|
de9f1583c2
|
fix conflict
Former-commit-id: 6922b23a748c2459147bf44b96d86daa89f2c96c
|
2024-05-20 17:10:01 +08:00 |
|
hiyouga
|
57dde7c3bc
|
update data readme
Former-commit-id: 22c7335b496e4a673383d5a1e4e60bf2cb4e35b3
|
2024-05-18 21:37:38 +08:00 |
|
hiyouga
|
6b9003f781
|
update data readme
Former-commit-id: beb864a9367943d3274cb6057423d1eb9aaf85c4
|
2024-05-18 21:15:20 +08:00 |
|
hiyouga
|
2bff90719b
|
improve KTO impl., replace datasets
Former-commit-id: e56a57ddcf061de6e4acc8679f7dbf0b68364986
|
2024-05-18 03:44:56 +08:00 |
|
hoshi-hiyouga
|
eb99999ca8
|
Update README_zh.md
Former-commit-id: 1c673d89faca3160627009fcd0a4aa39138570c0
|
2024-05-02 02:14:55 +08:00 |
|
Lao
|
57fcdca336
|
Update README_zh.md
Former-commit-id: bacc8588dc7b0b43c240189ecf4336bedc299357
|
2024-04-28 23:31:37 +08:00 |
|
khazic
|
3d88589c0f
|
Upgrade the second sharegpt format
Former-commit-id: 057f992a666b029d207a3dc7dfc353f9abcf8316
|
2024-04-28 14:30:05 +08:00 |
|
khazic
|
dfd153cc81
|
added the second sharegpt format
Former-commit-id: 6d140ac98a78ecc0a713842bb917dc8eb14450cb
|
2024-04-28 14:27:45 +08:00 |
|
hiyouga
|
23b881bff1
|
support mllm hf inference
Former-commit-id: 2c7c01282acd7ddabbb17ce3246b8dae4bc4b8cf
|
2024-04-26 05:34:58 +08:00 |
|
hiyouga
|
d764cd8736
|
support ORPO
Former-commit-id: f44a4c27e2461cdaa1b16865f597a31033c0e6d9
|
2024-03-31 18:29:50 +08:00 |
|
zhangzc
|
7cdc16abdf
|
Supports custom data set sampling quantity
Former-commit-id: fa8325401df27595de4611a89dfcc14644956abd
|
2024-03-27 14:22:50 +08:00 |
|
hiyouga
|
62b6a7971a
|
update data/readme
Former-commit-id: aa566e3cea5bc75688b4399a9da07be0b35b921c
|
2024-02-10 21:04:29 +08:00 |
|
hiyouga
|
1955a8ea5a
|
improve aligner
Former-commit-id: cc7296b92e10c24967fc753393275b71d300683f
|
2024-02-10 16:39:19 +08:00 |
|
hiyouga
|
5b8712d061
|
fix autoset attn impl, update data readme
Former-commit-id: 34a6e5f82baf45cc8dbb11f9f7ab4a480ab7ec5c
|
2024-01-31 11:58:07 +08:00 |
|
hiyouga
|
fe4d93c6db
|
add array param format
Former-commit-id: bf910f8a5b21ee552fa9ab069610a3f5f611de57
|
2024-01-21 22:17:48 +08:00 |
|
hiyouga
|
d925ecae1b
|
add models
Former-commit-id: 3a4728557304996bcbe58d7d6380beead7c63c70
|
2023-12-18 19:09:31 +08:00 |
|
hiyouga
|
934d00ea1e
|
support system column #1765
Former-commit-id: f425584a511c5e42bae8b3ba090eaa898b28adad
|
2023-12-12 19:45:59 +08:00 |
|
hiyouga
|
f3ffa8310f
|
fix #1784
Former-commit-id: 4e1af5a5d39d9e2f374c1372e2d67120c63fea09
|
2023-12-09 20:53:18 +08:00 |
|
hiyouga
|
065021d82a
|
update data readme
Former-commit-id: 6a65ef44ed58714c611da60b5af96b85352e8735
|
2023-11-03 00:15:23 +08:00 |
|
hiyouga
|
4bb643e685
|
update data readme (zh)
Former-commit-id: b32fb3a984c681732b82f6544d6c05a98c34cf4c
|
2023-11-02 23:42:49 +08:00 |
|
hiyouga
|
e5b72c6a77
|
refactor dataset_attr, add eos in pt, fix #757
Former-commit-id: 0feec9a830b917b36686b61938a66e842eccf930
|
2023-09-01 19:00:45 +08:00 |
|
codemayq
|
a6662b73f5
|
add readme for dataset
Former-commit-id: bdcb0ea40e726e4c5752f938b379ed9a18e7e1d0
|
2023-08-23 19:55:45 +08:00 |
|
hiyouga
|
6310613699
|
update template
Former-commit-id: a95f3a4d62de1073a78125401cf4289ec0523156
|
2023-08-22 19:46:09 +08:00 |
|
Peter Pan
|
5cac87d317
|
add rm dataset explanation
Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>
Former-commit-id: 1efb95025be6501f1b30b20e7c711d3590b5d1ee
|
2023-08-22 01:33:59 -04:00 |
|
hiyouga
|
a707f5b502
|
update readme, fix web ui postprocess
Former-commit-id: ba51ab3379100108f7b52a3c2444ccdd99e8a6ef
|
2023-07-22 14:29:22 +08:00 |
|
mrhan1993
|
8e6b7034fe
|
根据GLM Efficient Tuning添加中文README,web添加了server_port
Former-commit-id: 29e3acd23eafd891667d7a860ec544a5b05d3c33
|
2023-07-21 16:57:58 +08:00 |
|