hiyouga
|
66a1abac6a
|
add examples
Former-commit-id: 169c68921b1b8ac279834b060d9e7d38a56fe1aa
|
2024-08-30 21:43:19 +08:00 |
|
hiyouga
|
665db18661
|
tiny fix
Former-commit-id: 830511a6d0216da99520aee8b3a753d347a71fa9
|
2024-08-30 03:21:50 +08:00 |
|
hiyouga
|
30d97ca879
|
fix #5307
Former-commit-id: 63c19ddfe483a16c1c9afc2f1441e8070bb0f7e4
|
2024-08-30 02:45:40 +08:00 |
|
hiyouga
|
c62a6ca59d
|
refactor mm training
Former-commit-id: 179c0558699e287cbf38a2d73bff47e86d589c5a
|
2024-08-30 02:14:31 +08:00 |
|
hoshi-hiyouga
|
77c2c7076b
|
Merge pull request #5290 from simonJJJ/qwen2_vl
support qwen2-vl
Former-commit-id: 7156f832af8505b26371559d340c0e69eb962bbc
|
2024-08-30 02:10:36 +08:00 |
|
hoshi-hiyouga
|
7466fd4387
|
fix bug
Former-commit-id: 365e6df71509569f59c40743c115f1a4b945ef0f
|
2024-08-30 02:05:26 +08:00 |
|
hiyouga
|
c1369a1ec9
|
update liger kernel
Former-commit-id: d6bf6ca2161c99dd5d644e31d2b1df451017b68c
|
2024-08-29 20:46:08 +08:00 |
|
hiyouga
|
d677fe053d
|
fix #5292
Former-commit-id: dd81ce8ce5fdf450027c5f9634abb6ac2cd52128
|
2024-08-29 20:37:47 +08:00 |
|
hiyouga
|
7c6785d3df
|
fix #5295
Former-commit-id: c76873b0eb8225f6e6bfc7223c6012387dceb8ed
|
2024-08-29 20:30:18 +08:00 |
|
hiyouga
|
77341ee3c4
|
fix #5305
Former-commit-id: a710ebaf97c258c802f24e508d83f1f3f10edc6d
|
2024-08-29 20:16:01 +08:00 |
|
simonJJJ
|
5b4b60cfb5
|
update
Former-commit-id: a968a416d5e513320c97109229ca1e6ddc003cb1
|
2024-08-28 20:22:46 +08:00 |
|
simonJJJ
|
0f3d54d8a0
|
initial-commit
Former-commit-id: b6a39847a10b417b09db4b5512dd835e9e4ce928
|
2024-08-28 16:51:35 +08:00 |
|
hiyouga
|
7272792f65
|
update wechat
Former-commit-id: ef91752cc6f53088eaf7fc2f64f7148821d82ec2
|
2024-08-27 12:55:23 +08:00 |
|
hiyouga
|
4cc8e16595
|
add extra requires
Former-commit-id: c47511773ae9886aae4e5ea1841866d2125abc34
|
2024-08-27 12:52:12 +08:00 |
|
hiyouga
|
ca5a759f94
|
tiny fix
Former-commit-id: d2cede7023bbe28525ef8b4ad27247445d8c22e5
|
2024-08-27 12:49:32 +08:00 |
|
hoshi-hiyouga
|
be51e56a2e
|
Merge pull request #5237 from marko1616/patch-1
Fix mllm api
Former-commit-id: 017703c7ab7f3dc566792619537c3202ca4f4bb7
|
2024-08-27 12:24:43 +08:00 |
|
marko1616
|
3a9171e275
|
ruff pass.
Former-commit-id: c2f817772f8e7d947dca04f546befc70001abe64
|
2024-08-27 11:30:16 +08:00 |
|
marko1616
|
bd0f3b4050
|
Update chat.py
Former-commit-id: 4e5893a5c4a47ff3cb989bbef0841effc713fc08
|
2024-08-27 11:27:56 +08:00 |
|
hiyouga
|
206a8364d4
|
support liger kernel
Former-commit-id: 0f4e54abf6c5feb2329855a4047597ad5147720a
|
2024-08-27 11:20:14 +08:00 |
|
marko1616
|
097d031066
|
Force re check.
Former-commit-id: 5f04452f7d65e535d0af08944f7b9e29e85f51d7
|
2024-08-23 14:43:18 +08:00 |
|
marko1616
|
2674b42b59
|
Update chat.py
Former-commit-id: 206a16c17d253956afb96daea6f24478e17334fc
|
2024-08-22 12:24:34 +08:00 |
|
marko1616
|
edf2e51bbc
|
Update chat.py
Former-commit-id: edf6dc1995daa6c3635c3fda1052b340693a04f5
|
2024-08-22 12:14:34 +08:00 |
|
MengqingCao
|
47877acc2a
|
update npu base image
Former-commit-id: 20819f7707cfff6b951484e91fc7ecda2bf68528
|
2024-08-21 09:12:38 +00:00 |
|
hiyouga
|
d111a324bc
|
tiny fix
Former-commit-id: 23961bdf6fdbcde64e7b943f699fdeb4ac024043
|
2024-08-20 00:10:52 +08:00 |
|
hoshi-hiyouga
|
388f0a6e05
|
Merge pull request #5156 from YeQiuO/main
fix Llama-template's system prompt bug
Former-commit-id: 0b57175d3bd029675dae2f55995b7eeb4e9adc7a
|
2024-08-20 00:09:03 +08:00 |
|
hoshi-hiyouga
|
8c13c02c55
|
Update template.py
Former-commit-id: f5a075cb1c90f05bb0de26c6aea718f556c54623
|
2024-08-20 00:03:33 +08:00 |
|
hoshi-hiyouga
|
a101fde917
|
Merge pull request #5163 from liu-zichen/fix_ppo_optim
fix lr not change
Former-commit-id: f3c03ec6a89bf57f290820fa31eda24291355e4e
|
2024-08-19 23:56:24 +08:00 |
|
hoshi-hiyouga
|
1f4373b6e5
|
Merge pull request #5185 from chenhuiyu/feature/add-sailorllm-template
Add SailorLLM template
Former-commit-id: 28387d6b2f9e3bcc6321345c46b525c8180ebf7e
|
2024-08-19 23:51:49 +08:00 |
|
hoshi-hiyouga
|
525747b472
|
Merge pull request #5188 from Zxilly/main
fix: report correct device count for intel xpu
Former-commit-id: cd3c536cb3936061d905256850b0e57df4498010
|
2024-08-19 23:51:39 +08:00 |
|
hoshi-hiyouga
|
472f12c985
|
Merge pull request #5193 from Ricardo-L-C/main
_is_bf16_available judgment supports npu
Former-commit-id: 18b9ac49c45af773a2ea563f5e1852dc4b775db8
|
2024-08-19 23:40:59 +08:00 |
|
hoshi-hiyouga
|
b681f24f43
|
Update template.py
Former-commit-id: c6822a217e1c296f4aedd9a2c7610acd1dbd443e
|
2024-08-19 23:40:16 +08:00 |
|
hiyouga
|
fd02b089b6
|
update readme
Former-commit-id: 756e438866876fa54495cf557dd1e299b17a42fb
|
2024-08-19 23:32:04 +08:00 |
|
Ricardo
|
57d4c4a4f8
|
_is_bf16_available judgment supports npu
Former-commit-id: 50a1e892a1005b4cdd82dca1005f71db08ed89a2
|
2024-08-16 02:58:22 +00:00 |
|
Zxilly
|
3595d26846
|
fix: report correct device count for intel xpu
Former-commit-id: 0618f660b6511599365bd9be64499dbab41a79ba
|
2024-08-15 08:30:43 +00:00 |
|
Huiyu Chen
|
22a79c169d
|
Add SailorLLM template
Former-commit-id: a594abe0321a718394a97b5a48ded16e2012c1f0
|
2024-08-15 15:10:14 +08:00 |
|
liu-zichen
|
75dfe259cf
|
fix lr not change
Former-commit-id: 387dd2d51b5d8cd666459040fdd16525b34720d9
|
2024-08-13 16:33:34 +08:00 |
|
codingma
|
2e257d6af0
|
add tutorial and doc links
Former-commit-id: 4f6072562a34e0ec97471210ff54244cf0d0f3df
|
2024-08-13 16:13:10 +08:00 |
|
“Wzw”
|
e734222373
|
fix Llama-template's system prompt bug
Former-commit-id: 2e3eddcd0918b0c968ded0df7c82e3dcff870381
|
2024-08-12 19:22:12 +08:00 |
|
hiyouga
|
6a351b9912
|
update readme
Former-commit-id: 4fecc5ee56873a7ab4941e46a5168cfe2ecb4bb6
|
2024-08-10 10:17:35 +08:00 |
|
hiyouga
|
cfc04aa162
|
update readme
Former-commit-id: fa7bc9f1c7347153f9092ffbbb8e88c6b2f59632
|
2024-08-09 20:46:02 +08:00 |
|
hiyouga
|
943c795318
|
add magpie ultra dataset
Former-commit-id: 3317b24329b87e30f13a78936ac5554f211abf7a
|
2024-08-09 20:28:55 +08:00 |
|
hiyouga
|
7fb61bad04
|
add qwen2 math models
Former-commit-id: 72ff43a1772c9de5ff914d5e1c8bdc8dea9ae0c8
|
2024-08-09 20:20:35 +08:00 |
|
hiyouga
|
47efcdb1dd
|
update examples
Former-commit-id: d5c57c8b7f64afe8061045ec9689abbac45c1175
|
2024-08-09 20:13:46 +08:00 |
|
hiyouga
|
59cbce1a46
|
add adam_mini to readme
Former-commit-id: d610c6bcf8a8ba6f4236f5d11f79571b83f4fb11
|
2024-08-09 20:02:03 +08:00 |
|
hoshi-hiyouga
|
7e755e9cac
|
Merge pull request #5095 from relic-yuexi/feat-optimizer
Feat optimizer
Former-commit-id: f08390d252d42a812b71a08daba7339cc40889b7
|
2024-08-09 19:51:33 +08:00 |
|
hiyouga
|
9d1e2c3c1f
|
update scripts
Former-commit-id: dabf5a1dc661a6581474c6a5ec115322d168ed5f
|
2024-08-09 19:16:23 +08:00 |
|
hiyouga
|
5af32ce705
|
follow #5115
Former-commit-id: 7d917e03e2df570139bae18227d9c7303a12de2a
|
2024-08-09 18:03:00 +08:00 |
|
hoshi-hiyouga
|
4e8861e653
|
Merge pull request #5115 from YeQiuO/main
fix: `Train on the last turn only` truncate bug
Former-commit-id: 2c6dae45f7a7b72c961489ac407b1b444ab7752e
|
2024-08-09 17:58:27 +08:00 |
|
hoshi-hiyouga
|
d4d7ffb17c
|
Merge pull request #5072 from relic-yuexi/main
fix the deepseekcoder template to avoid repeat problem
Former-commit-id: 2ae7d5c91725eab9f994015d8d3577894c7978b6
|
2024-08-09 16:35:21 +08:00 |
|
hoshi-hiyouga
|
46f834ec75
|
Update template.py
Former-commit-id: ae2a5221c109ae3474d219c37433be767abbee91
|
2024-08-09 16:27:42 +08:00 |
|