hiyouga
5eacd17090
add adam_mini to readme
...
Former-commit-id: e2a28f51c635d64ff9de65a37087d89356bdedcc
2024-08-09 20:02:03 +08:00
hiyouga
25b9cfa163
update scripts
...
Former-commit-id: 86f7099fa3fadd9c5a2059361ab5a5e1dbf5b1a2
2024-08-09 19:16:23 +08:00
hiyouga
20013e130b
fix #5048
...
Former-commit-id: b7ca6c8dc14f689d0df16684a6121cc0ec24f8ba
2024-08-05 23:48:19 +08:00
hoshi-hiyouga
2f72383969
Update README.md
...
Former-commit-id: 9e409eadb0d43b90f2df6b458182b591831cf3e9
2024-07-30 01:53:19 +08:00
hoshi-hiyouga
f510c2d279
Update README.md
...
Former-commit-id: 8d5a41f2cdc15707ec6e0373b86463e962c31b7a
2024-07-30 01:52:35 +08:00
liudan
3c3a5c09dc
增加了MiniCPM在页面首页的支持列表,MiniCPM官方github也放了LLama_factory的友情链接
...
Former-commit-id: b9ed9d45cc2bb82ab042c282ddb3e5e97b554541
2024-07-29 10:58:28 +08:00
hiyouga
884b0bbb4f
tiny fix
...
Former-commit-id: 668654b5adae3f897d5291b81410226e1304eff9
2024-07-26 11:51:00 +08:00
hoshi-hiyouga
e2720c11b1
Merge pull request #4970 from HardAndHeavy/add-rocm
...
Add ROCm support
Former-commit-id: b8896b9b8bf025fd150e8bdeecf3b4355dc958aa
2024-07-26 11:41:23 +08:00
hoshi-hiyouga
d4e84b9a11
Update README.md
...
Former-commit-id: 1186ad53d43dace9dec335331dbe246f1c5a729b
2024-07-26 11:29:28 +08:00
hoshi-hiyouga
f38decfbaf
Update README.md
...
Former-commit-id: f97beca23a1c79df38769b8dd40c9b19d4e5ef5c
2024-07-26 11:29:09 +08:00
HardAndHeavy
27f42f6319
Add ROCm support
...
Former-commit-id: c8e18a669adc775f17555cbf06a5ceef6c0d6235
2024-07-25 21:29:28 +03:00
khazic
ed5c75bd64
Added the reference address for TRL PPO details.
...
Former-commit-id: ceba96f9ed121bb75b8e802d9b758871a94046f1
2024-07-25 09:03:21 +08:00
hiyouga
bc36e36658
fix #4959
...
Former-commit-id: 77cff78863918656662b41d259b68669b7cc2237
2024-07-24 23:44:00 +08:00
hoshi-hiyouga
4e429f2e05
Update README.md
...
Former-commit-id: 5626bdc56d5cfb71a6c7c9629e69810dcba22594
2024-07-24 21:07:14 +08:00
hiyouga
e0875f82b3
add llama3.1
...
Former-commit-id: 26533c0604ef765170f93986bc06f3066c5e28ee
2024-07-24 16:20:11 +08:00
hiyouga
0d438e5cf4
update readme
...
Former-commit-id: 87346c094631b054ca975694416df324d2031c9a
2024-07-03 19:39:05 +08:00
wangzhihong
3881f4eb58
add LazyLLM to Projects using LLaMA Factory
in README.md
...
Former-commit-id: 22da47ba27dc9c15887d21d47c456fb26fc81f5b
2024-07-03 11:12:20 +08:00
hiyouga
768093c789
update readme
...
Former-commit-id: d4e2af1fa422caeb1a2daff7cb9af17073cab13c
2024-07-01 00:22:52 +08:00
hiyouga
bbc37b2880
fix #4398 #4592
...
Former-commit-id: d74244d56858d837044e5c9cea57a1b3c2ca0214
2024-06-30 21:28:51 +08:00
hiyouga
c3792dae9f
update readme
...
Former-commit-id: 0e0d69b77c36a6110f43b0c760e9b86e2f5ee267
2024-06-28 06:55:19 +08:00
hiyouga
d3b7c489f2
add Gemma2 models
...
Former-commit-id: 6f63050e1b61742d5f7e48bdc62c46748031d7cb
2024-06-28 01:26:50 +08:00
hiyouga
7c488cea57
tiny fix
...
Former-commit-id: e44a4f07f09bbee55c10ccee91dd858256c36054
2024-06-27 20:14:48 +08:00
hoshi-hiyouga
37d3adb1f8
Merge pull request #4461 from hzhaoy/feature/support-flash-attn
...
support flash-attn in Dockerfile
Former-commit-id: 64b131dcfa381045cba6b77ab9e0dbf6a3934e03
2024-06-27 20:05:26 +08:00
hiyouga
d2d9fa4abb
support HQQ/EETQ #4113
...
Former-commit-id: ad144c2265cdee0d23014dbb3d017ea257cb26ed
2024-06-27 00:29:42 +08:00
hzhaoy
c662c2e56f
add flash-attn installation flag in Dockerfile
...
Former-commit-id: e19491b0f0446f2fb2154cf14e0b2fbba5b54808
2024-06-27 00:13:30 +08:00
hiyouga
dafc9268bc
fix #4419
...
Former-commit-id: efb81b25ecd5cb9f4cfda8f2da8b159e4ab26a90
2024-06-25 01:51:29 +08:00
hiyouga
d519c2fde5
tiny fix
...
Former-commit-id: 41086059b12ecb7827eb390294e315068ff9c2e6
2024-06-25 01:15:19 +08:00
hoshi-hiyouga
cbc23fc299
Update README.md
...
Former-commit-id: 5dc8fa647e9af2c6d666c9559553c05d1c4860b3
2024-06-25 01:03:38 +08:00
MengqingCao
3b499948a5
update docker files
...
1. add docker-npu (Dockerfile and docker-compose.yml)
2. move cuda docker to docker-cuda and tiny changes to adapt to the new path
Former-commit-id: d7207e8ad10c7df6dcb1f5e59ff8eb06f9d77e67
2024-06-24 10:57:36 +00:00
hiyouga
7be502c5c5
update readme
...
Former-commit-id: e507e60638b2e8c66f24805b3b28f6b9f98f5924
2024-06-24 18:22:12 +08:00
hiyouga
9e5988717d
tiny fix
...
Former-commit-id: 344b9a36b2e0b60ee61fba171b35a391e3517fed
2024-06-18 23:32:18 +08:00
hoshi-hiyouga
9b30635ff0
Merge pull request #4309 from EliMCosta/patch-1
...
Add Magpie and Webinstruct dataset samples
Former-commit-id: 10316dd8ca812382ddbaad0b8fce67d9b000df34
2024-06-18 23:30:19 +08:00
hiyouga
e3bf22f61b
add deepseek coder v2 #4346
...
Former-commit-id: a233fbc258d38c62d78b9d1eaf034720361795e6
2024-06-18 22:53:54 +08:00
hiyouga
9e0ec3831f
update readme
...
Former-commit-id: fcb2e8e7b7b79915af24c4e3264b579b3649ea90
2024-06-17 18:47:24 +08:00
Eli Costa
26e942b0ad
Update README.md
...
Add Magpie and Webinstruct to README
Former-commit-id: 103664203cf5a8562b5b000676ce95a6da2b7698
2024-06-16 11:19:25 -03:00
hiyouga
f25b8626bf
support pissa
...
Former-commit-id: 8c1046d78ac6c8f9429b73617e35e1eccb35138f
2024-06-16 01:08:12 +08:00
hiyouga
4dcd124dbd
update readme
...
Former-commit-id: acd84ce5350ef985e3712a40442c6f7a54d08d40
2024-06-15 05:13:16 +08:00
hiyouga
0926d81053
update examples
...
Former-commit-id: b6e008c152421db668c971b0828cbee6a80b16bc
2024-06-13 03:15:06 +08:00
hiyouga
e89d1b1ec3
add neo-sft dataset
...
Former-commit-id: c7a5620ccc72b7574255ea764693ccb866c48263
2024-06-13 01:00:56 +08:00
hiyouga
99ce085415
fix lint
...
Former-commit-id: 713fde4259233af645bade7790211064a07a2a6f
2024-06-13 00:48:44 +08:00
hiyouga
b2b0b96051
fix docker compose usage
...
Former-commit-id: 947a34f53b74e4cd2b964941cf1580bcabde2228
2024-06-13 00:07:48 +08:00
hiyouga
77e4dc255f
update readme
...
Former-commit-id: 2ce2e5bc478f6ffcafe8e6451b1fef4e8994694c
2024-06-12 17:39:12 +08:00
hiyouga
d984776d35
fix #4145
...
Fix the docker image
Former-commit-id: 949e9908ad634874cf5449ee9904745c7acda611
2024-06-11 00:19:17 +08:00
-.-
b187450340
fix README
...
Former-commit-id: 483cdd9b6ad42bc43a97df8ce867e3a9ef9bf5bc
2024-06-08 23:51:56 +08:00
hiyouga
3547a26f86
add ultrafeedback and fineweb #4085 #4132
...
Former-commit-id: 12d79f89c5082eb29842b501e1cb88433a248ba3
2024-06-08 02:42:34 +08:00
hiyouga
4f3e680b57
init unittest
...
Former-commit-id: 1c7f0ab51906b20190f8d4db932623cff76efc01
2024-06-08 01:35:58 +08:00
hiyouga
f76d427332
fix ppo in trl 0.8.6
...
Former-commit-id: 2702d7e952523b584d67c8901888b492d4a79b14
2024-06-07 04:48:29 +08:00
hiyouga
d3196318be
fix #4120
...
Former-commit-id: f9e818d79cf686cb34789327add7ed1f749966c6
2024-06-07 04:18:05 +08:00
hiyouga
8a0263551d
add qwen2 models
...
Former-commit-id: 8e95648850fdd5075724359ffdb22beb48b75952
2024-06-07 00:22:57 +08:00
hiyouga
2f0a333e9c
update readme
...
Former-commit-id: 53eb2de75e2df372b87801cea4ccafd6e73e59df
2024-06-06 16:59:18 +08:00