hiyouga
|
d111a324bc
|
tiny fix
Former-commit-id: 23961bdf6fdbcde64e7b943f699fdeb4ac024043
|
2024-08-20 00:10:52 +08:00 |
|
hoshi-hiyouga
|
388f0a6e05
|
Merge pull request #5156 from YeQiuO/main
fix Llama-template's system prompt bug
Former-commit-id: 0b57175d3bd029675dae2f55995b7eeb4e9adc7a
|
2024-08-20 00:09:03 +08:00 |
|
hoshi-hiyouga
|
8c13c02c55
|
Update template.py
Former-commit-id: f5a075cb1c90f05bb0de26c6aea718f556c54623
|
2024-08-20 00:03:33 +08:00 |
|
hoshi-hiyouga
|
a101fde917
|
Merge pull request #5163 from liu-zichen/fix_ppo_optim
fix lr not change
Former-commit-id: f3c03ec6a89bf57f290820fa31eda24291355e4e
|
2024-08-19 23:56:24 +08:00 |
|
hoshi-hiyouga
|
1f4373b6e5
|
Merge pull request #5185 from chenhuiyu/feature/add-sailorllm-template
Add SailorLLM template
Former-commit-id: 28387d6b2f9e3bcc6321345c46b525c8180ebf7e
|
2024-08-19 23:51:49 +08:00 |
|
hoshi-hiyouga
|
525747b472
|
Merge pull request #5188 from Zxilly/main
fix: report correct device count for intel xpu
Former-commit-id: cd3c536cb3936061d905256850b0e57df4498010
|
2024-08-19 23:51:39 +08:00 |
|
hoshi-hiyouga
|
b681f24f43
|
Update template.py
Former-commit-id: c6822a217e1c296f4aedd9a2c7610acd1dbd443e
|
2024-08-19 23:40:16 +08:00 |
|
Ricardo
|
57d4c4a4f8
|
_is_bf16_available judgment supports npu
Former-commit-id: 50a1e892a1005b4cdd82dca1005f71db08ed89a2
|
2024-08-16 02:58:22 +00:00 |
|
Zxilly
|
3595d26846
|
fix: report correct device count for intel xpu
Former-commit-id: 0618f660b6511599365bd9be64499dbab41a79ba
|
2024-08-15 08:30:43 +00:00 |
|
Huiyu Chen
|
22a79c169d
|
Add SailorLLM template
Former-commit-id: a594abe0321a718394a97b5a48ded16e2012c1f0
|
2024-08-15 15:10:14 +08:00 |
|
liu-zichen
|
75dfe259cf
|
fix lr not change
Former-commit-id: 387dd2d51b5d8cd666459040fdd16525b34720d9
|
2024-08-13 16:33:34 +08:00 |
|
“Wzw”
|
e734222373
|
fix Llama-template's system prompt bug
Former-commit-id: 2e3eddcd0918b0c968ded0df7c82e3dcff870381
|
2024-08-12 19:22:12 +08:00 |
|
hiyouga
|
7fb61bad04
|
add qwen2 math models
Former-commit-id: 72ff43a1772c9de5ff914d5e1c8bdc8dea9ae0c8
|
2024-08-09 20:20:35 +08:00 |
|
hiyouga
|
59cbce1a46
|
add adam_mini to readme
Former-commit-id: d610c6bcf8a8ba6f4236f5d11f79571b83f4fb11
|
2024-08-09 20:02:03 +08:00 |
|
hoshi-hiyouga
|
7e755e9cac
|
Merge pull request #5095 from relic-yuexi/feat-optimizer
Feat optimizer
Former-commit-id: f08390d252d42a812b71a08daba7339cc40889b7
|
2024-08-09 19:51:33 +08:00 |
|
hiyouga
|
5af32ce705
|
follow #5115
Former-commit-id: 7d917e03e2df570139bae18227d9c7303a12de2a
|
2024-08-09 18:03:00 +08:00 |
|
hoshi-hiyouga
|
4e8861e653
|
Merge pull request #5115 from YeQiuO/main
fix: `Train on the last turn only` truncate bug
Former-commit-id: 2c6dae45f7a7b72c961489ac407b1b444ab7752e
|
2024-08-09 17:58:27 +08:00 |
|
hoshi-hiyouga
|
46f834ec75
|
Update template.py
Former-commit-id: ae2a5221c109ae3474d219c37433be767abbee91
|
2024-08-09 16:27:42 +08:00 |
|
“Wzw”
|
6ec64a7e56
|
mask_history args verify valid
Former-commit-id: 2f8388b4f4195d934400ad9267d72e10ca4105a3
|
2024-08-08 10:12:01 +08:00 |
|
“Wzw”
|
d71446e387
|
fix mask_history tiny bug
Former-commit-id: cac07aac6196be026f723b2397a343d4fb675973
|
2024-08-08 10:09:33 +08:00 |
|
moontidef
|
8f42d7df56
|
feat: add support for adammini
Former-commit-id: a2d5fafb705ff44db1711e972490f0abebc2012b
|
2024-08-07 10:08:22 +08:00 |
|
moontidef
|
33a90b9026
|
fix: rename optimzer to optimizer
Former-commit-id: 186dc1fde822e6a603ac273538741ea3853f243e
|
2024-08-07 10:05:01 +08:00 |
|
moontidef
|
710902b0d0
|
Merge branch 'hiyouga:main' into main
Former-commit-id: d1b23283e0e4286f126d38d7bdc55802f74c8922
|
2024-08-06 00:18:45 +08:00 |
|
moontidef
|
7b4f5d3b21
|
fix: fix the deepseekcoder template to avoid repeat problem
Former-commit-id: 56294831115f095135f72490a8a435434b2f0a11
|
2024-08-05 23:55:45 +08:00 |
|
hiyouga
|
13093963b1
|
fix #5048
Former-commit-id: 71a6861667ae68c1fd6a69acf68e1359b858cf1b
|
2024-08-05 23:48:19 +08:00 |
|
codingma
|
4b6252151e
|
support gemma-2-2b
Former-commit-id: 7037192cf6049fd7d675aed4a6237ed929c6b170
|
2024-08-01 13:45:48 +08:00 |
|
hoshi-hiyouga
|
f3765d1996
|
Merge pull request #5010 from Eruly/main
Add Korean web UI (llamafactory-cli webui)
Former-commit-id: 2050806aa826028df45c0c746b4314afe178dcd3
|
2024-07-30 01:55:54 +08:00 |
|
eruly
|
9fdf800750
|
Add Korean web UI (llamafactory-cli webui)
Former-commit-id: 357a035f2aeb9548368c230c5a17dcdfa4844b17
|
2024-07-29 13:47:13 +00:00 |
|
codingma
|
75e80fa820
|
fix pissa save
Former-commit-id: 25a1dad7c8df79c15efecb8c6f871a13a327f57a
|
2024-07-29 10:44:34 +08:00 |
|
hoshi-hiyouga
|
9f74d36ba4
|
Merge pull request #4892 from piamo/main
update deepseek template
Former-commit-id: 3233efc8404972098665286d9dec7312dd6ecfab
|
2024-07-26 11:49:34 +08:00 |
|
hoshi-hiyouga
|
fc2435f135
|
Merge pull request #4950 from liuwwang/main and fi
fix: Repair the issue where quantization failed after merging the adapter.
Former-commit-id: 93a68ea1f4372973f745a2c250250ecaac515e27
|
2024-07-26 11:48:56 +08:00 |
|
hiyouga
|
28b5f656db
|
update webui
Former-commit-id: 463edec1b1c1345afc791e225deb33f118f3582e
|
2024-07-24 21:11:51 +08:00 |
|
hiyouga
|
211038584a
|
tiny fix
Former-commit-id: 28cac0e325bfd7a6c0c344ad2d46511613190cd7
|
2024-07-24 18:33:39 +08:00 |
|
hiyouga
|
ff5ba97970
|
fix #4928
Former-commit-id: 6d557e8959678f9d4edbcb3d5a6dfba14b429b18
|
2024-07-24 17:00:29 +08:00 |
|
hiyouga
|
5c6d88e91c
|
add mistral nemo model
Former-commit-id: 428bb49f53b32947bc0a62ca19ab10844154c07c
|
2024-07-24 16:25:53 +08:00 |
|
hiyouga
|
0a04d9470f
|
add llama3.1
Former-commit-id: 3c433890f9b61c520572f5233aae70584da0f330
|
2024-07-24 16:20:11 +08:00 |
|
Liuww
|
f0408c0dde
|
fix: Repair the issue where quantization failed after merging the adapter.
Former-commit-id: 8109561b7f577d448f8bca7e569f7f443cf6bb52
|
2024-07-24 14:31:29 +08:00 |
|
hiyouga
|
a041f4a111
|
tiny fix
Former-commit-id: bf6a2f032c598f969708c1c3db4875d6239c41a9
|
2024-07-22 21:10:15 +08:00 |
|
hoshi-hiyouga
|
cdf9dae53e
|
fix #4917
Former-commit-id: e26919aafd8436489d065789c9c25d72c8d05a6d
|
2024-07-22 11:28:31 +08:00 |
|
hiyouga
|
1917f431f5
|
tiny fix
Former-commit-id: 9133316e558a3c8744f5eb6ab8678686bf4859ed
|
2024-07-22 00:06:03 +08:00 |
|
hiyouga
|
a770afbff2
|
fix flashattn + packing
Former-commit-id: 4adc6ce4abc718c25f39b316bfc3352d0d01ed1e
|
2024-07-21 17:07:45 +08:00 |
|
huangpan.foo
|
b1a5bf025b
|
update deepseek template
Former-commit-id: f5ca86ec95bb301df42ffaa6923fc3037a224e34
|
2024-07-19 15:02:54 +08:00 |
|
hiyouga
|
adff3e5050
|
set dev version
Former-commit-id: 0b9a2275dc533b65578278f979ce053e95a644b3
|
2024-07-19 02:01:46 +08:00 |
|
hiyouga
|
0e88c5754f
|
update parser
Former-commit-id: 5262c8702382ff8bc36a172387bc4c8949f326ea
|
2024-07-19 01:36:39 +08:00 |
|
hiyouga
|
3fff875f99
|
release v0.8.3
Former-commit-id: 7180a3b99c3c218dfb0dc607ad5e87219269a678
|
2024-07-19 01:21:18 +08:00 |
|
hiyouga
|
994b9089e9
|
add unittest
Former-commit-id: 8a1f0c5f922989e08a19c65de0b2c4afd2a5771f
|
2024-07-19 01:06:27 +08:00 |
|
hiyouga
|
4c1513a845
|
follow #4878 fix #4684
Former-commit-id: 4715e5c5b8040b21e5f401f7e969b9fd2757d520
|
2024-07-18 22:06:12 +08:00 |
|
Shiyu Zhang
|
c1e1918db1
|
仅仅训练最后一轮对话
Former-commit-id: ab6198e4c099edeb1a400f58729cd617e8cd8e50
|
2024-07-18 15:30:25 +08:00 |
|
hiyouga
|
341225a405
|
fix metrics #4786
Former-commit-id: 7d0c4bd394fc3cba197db1719f1164b9dd66ac21
|
2024-07-17 00:47:00 +08:00 |
|
hiyouga
|
8c93921952
|
support batch_eval_metrics, fix #4826
Former-commit-id: 3fe1df17188825f8a32fbe6a1294b4b532ce0c85
|
2024-07-17 00:33:00 +08:00 |
|