Commit Graph

163 Commits

Author SHA1 Message Date
hiyouga
4b8add7287 update model name 2025-01-02 12:19:21 +00:00
hiyouga
67442bd497 add gpt2 model 2025-01-02 12:07:38 +00:00
hiyouga
e67b9dcc3a add deepseek3 model 2024-12-30 13:39:20 +00:00
hiyouga
6f5bb3b8e5 fix #6482 2024-12-30 06:03:07 +00:00
hiyouga
ee0e400f41 add qvq #6439 2024-12-25 07:52:41 +00:00
hiyouga
8fd38d273e update readme 2024-12-23 14:08:59 +00:00
hoshi-hiyouga
c23a4d0658 Merge pull request #5922 from Tuyohai/main
support granite3 models
2024-12-23 16:46:02 +08:00
hiyouga
d3509050dc add paligemma2 2024-12-18 08:57:26 +00:00
hoshi-hiyouga
015f213788 Merge pull request #6313 from ge-xing/main
support telechat2 model
2024-12-18 16:16:17 +08:00
hiyouga
b24ae55ebf support llama3 tool prompt 2024-12-17 15:52:37 +00:00
zhaohu xing
04f19ed0f3 support telechat2 model 2024-12-17 12:15:33 +00:00
hiyouga
2d107d3aef generalized packing & fix #6343 2024-12-17 10:26:19 +00:00
hiyouga
1324d158f9 support batch infer in vllm 2024-12-04 13:50:00 +00:00
hiyouga
68a612115a add qwq 2024-11-28 08:50:57 +00:00
hiyouga
ec9ff8caa2 add skywork o1 2024-11-27 05:51:59 +00:00
hiyouga
17afb7d410 add marco-o1 and openo1 dataset 2024-11-27 04:20:23 +00:00
hiyouga
b0ccc2ee86 set dev version 2024-11-25 01:36:49 +08:00
hiyouga
d622f8fdec release v0.9.1 2024-11-24 23:48:41 +08:00
hiyouga
446441fdb0 fix inputs 2024-11-23 18:26:02 +00:00
marko1616
3f2c056253 Support llama3.2vl. 2024-11-23 16:07:35 +00:00
hoshi-hiyouga
bd639a137e Merge pull request #6078 from wtmlon/support-efficient-tokens-calculation
support effective tokens calculation on sft/dpo
2024-11-20 13:43:15 +08:00
Ting
40627c601e code refactor 2024-11-19 20:33:18 +08:00
hiyouga
431ac4892c add qwen-coder and opencoder 2024-11-15 21:48:38 +08:00
hiyouga
c5fae465ec update datasets version 2024-11-04 07:52:26 +00:00
steven
6eefb4d7d2 support granite3 models 2024-11-04 10:35:03 +08:00
hiyouga
2e843d989e fix phi3 template 2024-11-02 21:31:23 +08:00
hoshi-hiyouga
c58cc22d06 Merge pull request #5910 from Cuiyn/index
Support Index series models.
2024-11-02 20:16:54 +08:00
Cuiyn
ecca9db66b fix: rename to Index-1.9B-Charater-Chat and Index-1.9B-Chat-32K 2024-11-02 20:04:14 +08:00
hiyouga
c38aa29336 support rank0 logger 2024-11-02 18:31:04 +08:00
Cuiyn
a15a69ab44 Add support for Index 2024-11-02 13:45:27 +08:00
hiyouga
30567a1487 fix incorrect loss value for vlms 2024-10-30 08:56:46 +00:00
Kingsley
67f59579d7 Merge branch 'hiyouga:main' into pixtral-patch 2024-10-29 21:01:25 +08:00
hiyouga
ae045c884f fix #5747 2024-10-29 10:47:04 +00:00
hiyouga
21db8ed2f4 use pre-commit 2024-10-29 09:07:46 +00:00
hiyouga
77666bd227 update requires 2024-10-29 16:10:07 +08:00
Kingsley
0d3106e9fa Merge branch 'hiyouga:main' into pixtral-patch 2024-10-23 15:30:03 +08:00
KUANGDD
341a79fb96 Merge branch 'pixtral-patch' of https://github.com/Kuangdd01/LLaMA-Factory-X into pixtral-patch 2024-10-23 15:28:19 +08:00
KUANGDD
9d6143e36a modify style & little change 2024-10-23 15:24:07 +08:00
hoshi-hiyouga
d155b7008c fix #5768 2024-10-22 11:06:22 +08:00
hoshi-hiyouga
769fbb6349 Update misc.py 2024-10-17 19:48:51 +08:00
KUANGDD
9f44598b92 required transformers version 2024-10-14 21:11:09 +08:00
Kingsley
95330893c5 Merge branch 'hiyouga:main' into pixtral-patch 2024-10-13 17:42:02 +08:00
hiyouga
3af57795dd tiny fix 2024-10-11 23:51:54 +08:00
huniu20
843b5d85e9 bugs fixed 2024-10-11 19:56:13 +08:00
huniu20
7b91be33c9 add om_hub_token argument 2024-10-10 17:16:46 +08:00
huniu20
0f669f221a 1. add model and dataset info to support webui 2024-10-10 16:46:34 +08:00
huniu20
24ebe187e3 1. add modelers hub support 2024-10-09 17:21:37 +08:00
Kingsley
93a441a6b7 Merge branch 'hiyouga:main' into pixtral-patch 2024-10-08 21:04:08 +08:00
hiyouga
451d271718 tiny fix 2024-10-08 17:48:56 +08:00
Kingsley
15d555c8c5 register model fix 2024-09-30 20:04:47 +08:00