370 Commits

Author SHA1 Message Date
hiyouga
25b9cfa163 update scripts
Former-commit-id: 86f7099fa3fadd9c5a2059361ab5a5e1dbf5b1a2
2024-08-09 19:16:23 +08:00
hiyouga
20013e130b fix #5048
Former-commit-id: b7ca6c8dc14f689d0df16684a6121cc0ec24f8ba
2024-08-05 23:48:19 +08:00
hoshi-hiyouga
2f72383969 Update README.md
Former-commit-id: 9e409eadb0d43b90f2df6b458182b591831cf3e9
2024-07-30 01:53:19 +08:00
hoshi-hiyouga
f510c2d279 Update README.md
Former-commit-id: 8d5a41f2cdc15707ec6e0373b86463e962c31b7a
2024-07-30 01:52:35 +08:00
liudan
3c3a5c09dc 增加了MiniCPM在页面首页的支持列表,MiniCPM官方github也放了LLama_factory的友情链接
Former-commit-id: b9ed9d45cc2bb82ab042c282ddb3e5e97b554541
2024-07-29 10:58:28 +08:00
hiyouga
884b0bbb4f tiny fix
Former-commit-id: 668654b5adae3f897d5291b81410226e1304eff9
2024-07-26 11:51:00 +08:00
hoshi-hiyouga
e2720c11b1 Merge pull request #4970 from HardAndHeavy/add-rocm
Add ROCm support

Former-commit-id: b8896b9b8bf025fd150e8bdeecf3b4355dc958aa
2024-07-26 11:41:23 +08:00
hoshi-hiyouga
d4e84b9a11 Update README.md
Former-commit-id: 1186ad53d43dace9dec335331dbe246f1c5a729b
2024-07-26 11:29:28 +08:00
hoshi-hiyouga
f38decfbaf Update README.md
Former-commit-id: f97beca23a1c79df38769b8dd40c9b19d4e5ef5c
2024-07-26 11:29:09 +08:00
HardAndHeavy
27f42f6319 Add ROCm support
Former-commit-id: c8e18a669adc775f17555cbf06a5ceef6c0d6235
2024-07-25 21:29:28 +03:00
khazic
ed5c75bd64 Added the reference address for TRL PPO details.
Former-commit-id: ceba96f9ed121bb75b8e802d9b758871a94046f1
2024-07-25 09:03:21 +08:00
hiyouga
bc36e36658 fix #4959
Former-commit-id: 77cff78863918656662b41d259b68669b7cc2237
2024-07-24 23:44:00 +08:00
hoshi-hiyouga
4e429f2e05 Update README.md
Former-commit-id: 5626bdc56d5cfb71a6c7c9629e69810dcba22594
2024-07-24 21:07:14 +08:00
hiyouga
e0875f82b3 add llama3.1
Former-commit-id: 26533c0604ef765170f93986bc06f3066c5e28ee
2024-07-24 16:20:11 +08:00
hiyouga
0d438e5cf4 update readme
Former-commit-id: 87346c094631b054ca975694416df324d2031c9a
2024-07-03 19:39:05 +08:00
wangzhihong
3881f4eb58 add LazyLLM to Projects using LLaMA Factory in README.md
Former-commit-id: 22da47ba27dc9c15887d21d47c456fb26fc81f5b
2024-07-03 11:12:20 +08:00
hiyouga
768093c789 update readme
Former-commit-id: d4e2af1fa422caeb1a2daff7cb9af17073cab13c
2024-07-01 00:22:52 +08:00
hiyouga
bbc37b2880 fix #4398 #4592
Former-commit-id: d74244d56858d837044e5c9cea57a1b3c2ca0214
2024-06-30 21:28:51 +08:00
hiyouga
c3792dae9f update readme
Former-commit-id: 0e0d69b77c36a6110f43b0c760e9b86e2f5ee267
2024-06-28 06:55:19 +08:00
hiyouga
d3b7c489f2 add Gemma2 models
Former-commit-id: 6f63050e1b61742d5f7e48bdc62c46748031d7cb
2024-06-28 01:26:50 +08:00
hiyouga
7c488cea57 tiny fix
Former-commit-id: e44a4f07f09bbee55c10ccee91dd858256c36054
2024-06-27 20:14:48 +08:00
hoshi-hiyouga
37d3adb1f8 Merge pull request #4461 from hzhaoy/feature/support-flash-attn
support flash-attn in Dockerfile

Former-commit-id: 64b131dcfa381045cba6b77ab9e0dbf6a3934e03
2024-06-27 20:05:26 +08:00
hiyouga
d2d9fa4abb support HQQ/EETQ #4113
Former-commit-id: ad144c2265cdee0d23014dbb3d017ea257cb26ed
2024-06-27 00:29:42 +08:00
hzhaoy
c662c2e56f add flash-attn installation flag in Dockerfile
Former-commit-id: e19491b0f0446f2fb2154cf14e0b2fbba5b54808
2024-06-27 00:13:30 +08:00
hiyouga
dafc9268bc fix #4419
Former-commit-id: efb81b25ecd5cb9f4cfda8f2da8b159e4ab26a90
2024-06-25 01:51:29 +08:00
hiyouga
d519c2fde5 tiny fix
Former-commit-id: 41086059b12ecb7827eb390294e315068ff9c2e6
2024-06-25 01:15:19 +08:00
hoshi-hiyouga
cbc23fc299 Update README.md
Former-commit-id: 5dc8fa647e9af2c6d666c9559553c05d1c4860b3
2024-06-25 01:03:38 +08:00
MengqingCao
3b499948a5 update docker files
1. add docker-npu (Dockerfile and docker-compose.yml)
  2. move cuda docker to docker-cuda and tiny changes to adapt to the new path


Former-commit-id: d7207e8ad10c7df6dcb1f5e59ff8eb06f9d77e67
2024-06-24 10:57:36 +00:00
hiyouga
7be502c5c5 update readme
Former-commit-id: e507e60638b2e8c66f24805b3b28f6b9f98f5924
2024-06-24 18:22:12 +08:00
hiyouga
9e5988717d tiny fix
Former-commit-id: 344b9a36b2e0b60ee61fba171b35a391e3517fed
2024-06-18 23:32:18 +08:00
hoshi-hiyouga
9b30635ff0 Merge pull request #4309 from EliMCosta/patch-1
Add Magpie and Webinstruct dataset samples

Former-commit-id: 10316dd8ca812382ddbaad0b8fce67d9b000df34
2024-06-18 23:30:19 +08:00
hiyouga
e3bf22f61b add deepseek coder v2 #4346
Former-commit-id: a233fbc258d38c62d78b9d1eaf034720361795e6
2024-06-18 22:53:54 +08:00
hiyouga
9e0ec3831f update readme
Former-commit-id: fcb2e8e7b7b79915af24c4e3264b579b3649ea90
2024-06-17 18:47:24 +08:00
Eli Costa
26e942b0ad Update README.md
Add Magpie and Webinstruct to README

Former-commit-id: 103664203cf5a8562b5b000676ce95a6da2b7698
2024-06-16 11:19:25 -03:00
hiyouga
f25b8626bf support pissa
Former-commit-id: 8c1046d78ac6c8f9429b73617e35e1eccb35138f
2024-06-16 01:08:12 +08:00
hiyouga
4dcd124dbd update readme
Former-commit-id: acd84ce5350ef985e3712a40442c6f7a54d08d40
2024-06-15 05:13:16 +08:00
hiyouga
0926d81053 update examples
Former-commit-id: b6e008c152421db668c971b0828cbee6a80b16bc
2024-06-13 03:15:06 +08:00
hiyouga
e89d1b1ec3 add neo-sft dataset
Former-commit-id: c7a5620ccc72b7574255ea764693ccb866c48263
2024-06-13 01:00:56 +08:00
hiyouga
99ce085415 fix lint
Former-commit-id: 713fde4259233af645bade7790211064a07a2a6f
2024-06-13 00:48:44 +08:00
hiyouga
b2b0b96051 fix docker compose usage
Former-commit-id: 947a34f53b74e4cd2b964941cf1580bcabde2228
2024-06-13 00:07:48 +08:00
hiyouga
77e4dc255f update readme
Former-commit-id: 2ce2e5bc478f6ffcafe8e6451b1fef4e8994694c
2024-06-12 17:39:12 +08:00
hiyouga
d984776d35 fix #4145
Fix the docker image


Former-commit-id: 949e9908ad634874cf5449ee9904745c7acda611
2024-06-11 00:19:17 +08:00
-.-
b187450340 fix README
Former-commit-id: 483cdd9b6ad42bc43a97df8ce867e3a9ef9bf5bc
2024-06-08 23:51:56 +08:00
hiyouga
3547a26f86 add ultrafeedback and fineweb #4085 #4132
Former-commit-id: 12d79f89c5082eb29842b501e1cb88433a248ba3
2024-06-08 02:42:34 +08:00
hiyouga
4f3e680b57 init unittest
Former-commit-id: 1c7f0ab51906b20190f8d4db932623cff76efc01
2024-06-08 01:35:58 +08:00
hiyouga
f76d427332 fix ppo in trl 0.8.6
Former-commit-id: 2702d7e952523b584d67c8901888b492d4a79b14
2024-06-07 04:48:29 +08:00
hiyouga
d3196318be fix #4120
Former-commit-id: f9e818d79cf686cb34789327add7ed1f749966c6
2024-06-07 04:18:05 +08:00
hiyouga
8a0263551d add qwen2 models
Former-commit-id: 8e95648850fdd5075724359ffdb22beb48b75952
2024-06-07 00:22:57 +08:00
hiyouga
2f0a333e9c update readme
Former-commit-id: 53eb2de75e2df372b87801cea4ccafd6e73e59df
2024-06-06 16:59:18 +08:00
hiyouga
8cc6bb961b update readme
Former-commit-id: 87a7822b98ef204a7a36fa4caf4e09a092f6a2da
2024-06-06 16:25:42 +08:00