hoshi-hiyouga
f38decfbaf
Update README.md
...
Former-commit-id: f97beca23a
2024-07-26 11:29:09 +08:00
HardAndHeavy
27f42f6319
Add ROCm support
...
Former-commit-id: c8e18a669a
2024-07-25 21:29:28 +03:00
khazic
ed5c75bd64
Added the reference address for TRL PPO details.
...
Former-commit-id: ceba96f9ed
2024-07-25 09:03:21 +08:00
hiyouga
bc36e36658
fix #4959
...
Former-commit-id: 77cff78863
2024-07-24 23:44:00 +08:00
hoshi-hiyouga
4e429f2e05
Update README.md
...
Former-commit-id: 5626bdc56d
2024-07-24 21:07:14 +08:00
hiyouga
e0875f82b3
add llama3.1
...
Former-commit-id: 26533c0604
2024-07-24 16:20:11 +08:00
hiyouga
0d438e5cf4
update readme
...
Former-commit-id: 87346c0946
2024-07-03 19:39:05 +08:00
wangzhihong
3881f4eb58
add LazyLLM to Projects using LLaMA Factory in README.md
...
Former-commit-id: 22da47ba27
2024-07-03 11:12:20 +08:00
hiyouga
768093c789
update readme
...
Former-commit-id: d4e2af1fa4
2024-07-01 00:22:52 +08:00
hiyouga
bbc37b2880
fix #4398 #4592
...
Former-commit-id: d74244d568
2024-06-30 21:28:51 +08:00
hiyouga
c3792dae9f
update readme
...
Former-commit-id: 0e0d69b77c
2024-06-28 06:55:19 +08:00
hiyouga
d3b7c489f2
add Gemma2 models
...
Former-commit-id: 6f63050e1b
2024-06-28 01:26:50 +08:00
hiyouga
7c488cea57
tiny fix
...
Former-commit-id: e44a4f07f0
2024-06-27 20:14:48 +08:00
hoshi-hiyouga
37d3adb1f8
Merge pull request #4461 from hzhaoy/feature/support-flash-attn
...
support flash-attn in Dockerfile
Former-commit-id: 64b131dcfa
2024-06-27 20:05:26 +08:00
hiyouga
d2d9fa4abb
support HQQ/EETQ #4113
...
Former-commit-id: ad144c2265
2024-06-27 00:29:42 +08:00
hzhaoy
c662c2e56f
add flash-attn installation flag in Dockerfile
...
Former-commit-id: e19491b0f0
2024-06-27 00:13:30 +08:00
hiyouga
dafc9268bc
fix #4419
...
Former-commit-id: efb81b25ec
2024-06-25 01:51:29 +08:00
hiyouga
d519c2fde5
tiny fix
...
Former-commit-id: 41086059b1
2024-06-25 01:15:19 +08:00
hoshi-hiyouga
cbc23fc299
Update README.md
...
Former-commit-id: 5dc8fa647e
2024-06-25 01:03:38 +08:00
MengqingCao
3b499948a5
update docker files
...
1. add docker-npu (Dockerfile and docker-compose.yml)
2. move cuda docker to docker-cuda and tiny changes to adapt to the new path
Former-commit-id: d7207e8ad1
2024-06-24 10:57:36 +00:00
hiyouga
7be502c5c5
update readme
...
Former-commit-id: e507e60638
2024-06-24 18:22:12 +08:00
hiyouga
9e5988717d
tiny fix
...
Former-commit-id: 344b9a36b2
2024-06-18 23:32:18 +08:00
hoshi-hiyouga
9b30635ff0
Merge pull request #4309 from EliMCosta/patch-1
...
Add Magpie and Webinstruct dataset samples
Former-commit-id: 10316dd8ca
2024-06-18 23:30:19 +08:00
hiyouga
e3bf22f61b
add deepseek coder v2 #4346
...
Former-commit-id: a233fbc258
2024-06-18 22:53:54 +08:00
hiyouga
9e0ec3831f
update readme
...
Former-commit-id: fcb2e8e7b7
2024-06-17 18:47:24 +08:00
Eli Costa
26e942b0ad
Update README.md
...
Add Magpie and Webinstruct to README
Former-commit-id: 103664203c
2024-06-16 11:19:25 -03:00
hiyouga
f25b8626bf
support pissa
...
Former-commit-id: 8c1046d78a
2024-06-16 01:08:12 +08:00
hiyouga
4dcd124dbd
update readme
...
Former-commit-id: acd84ce535
2024-06-15 05:13:16 +08:00
hiyouga
0926d81053
update examples
...
Former-commit-id: b6e008c152
2024-06-13 03:15:06 +08:00
hiyouga
e89d1b1ec3
add neo-sft dataset
...
Former-commit-id: c7a5620ccc
2024-06-13 01:00:56 +08:00
hiyouga
99ce085415
fix lint
...
Former-commit-id: 713fde4259
2024-06-13 00:48:44 +08:00
hiyouga
b2b0b96051
fix docker compose usage
...
Former-commit-id: 947a34f53b
2024-06-13 00:07:48 +08:00
hiyouga
77e4dc255f
update readme
...
Former-commit-id: 2ce2e5bc47
2024-06-12 17:39:12 +08:00
hiyouga
d984776d35
fix #4145
...
Fix the docker image
Former-commit-id: 949e9908ad
2024-06-11 00:19:17 +08:00
-.-
b187450340
fix README
...
Former-commit-id: 483cdd9b6a
2024-06-08 23:51:56 +08:00
hiyouga
3547a26f86
add ultrafeedback and fineweb #4085 #4132
...
Former-commit-id: 12d79f89c5
2024-06-08 02:42:34 +08:00
hiyouga
4f3e680b57
init unittest
...
Former-commit-id: 1c7f0ab519
2024-06-08 01:35:58 +08:00
hiyouga
f76d427332
fix ppo in trl 0.8.6
...
Former-commit-id: 2702d7e952
2024-06-07 04:48:29 +08:00
hiyouga
d3196318be
fix #4120
...
Former-commit-id: f9e818d79c
2024-06-07 04:18:05 +08:00
hiyouga
8a0263551d
add qwen2 models
...
Former-commit-id: 8e95648850
2024-06-07 00:22:57 +08:00
hiyouga
2f0a333e9c
update readme
...
Former-commit-id: 53eb2de75e
2024-06-06 16:59:18 +08:00
hiyouga
8cc6bb961b
update readme
...
Former-commit-id: 87a7822b98
2024-06-06 16:25:42 +08:00
hiyouga
cceff9f520
lora modules: all by default
...
Former-commit-id: cae4737907
2024-06-06 03:53:28 +08:00
hiyouga
cafbb79d3a
support image input in api #3971 #4061
...
Former-commit-id: 946f601136
2024-06-06 02:29:55 +08:00
hiyouga
b097f04a79
update readme
...
Former-commit-id: eef1e542a9
2024-06-05 16:32:32 +08:00
hiyouga
94c37490d1
support glm-4
...
Former-commit-id: f48f5e646e
2024-06-05 15:16:38 +08:00
hiyouga
72ebcb9a04
update readme
...
Former-commit-id: c4f50865ad
2024-05-30 16:40:17 +08:00
hiyouga
a71a6a05c3
update readme
...
Former-commit-id: 89ca832740
2024-05-29 18:39:11 +08:00
hoshi-hiyouga
2e7dae0f97
Merge pull request #3930 from MengqingCao/npu
...
Add Ascend npu doc and dependency
Former-commit-id: 880b4a9acf
2024-05-29 18:33:38 +08:00
MengqingCao
29fe1cd688
update cann kernels url
...
Former-commit-id: e14f5b37e4
2024-05-29 09:53:31 +00:00