hzhaoy
c662c2e56f
add flash-attn installation flag in Dockerfile
...
Former-commit-id: e19491b0f0
2024-06-27 00:13:30 +08:00
hiyouga
dafc9268bc
fix #4419
...
Former-commit-id: efb81b25ec
2024-06-25 01:51:29 +08:00
hiyouga
d519c2fde5
tiny fix
...
Former-commit-id: 41086059b1
2024-06-25 01:15:19 +08:00
hoshi-hiyouga
678884f97c
Update README_zh.md
...
Former-commit-id: ec95f942d1
2024-06-25 01:06:59 +08:00
MengqingCao
3b499948a5
update docker files
...
1. add docker-npu (Dockerfile and docker-compose.yml)
2. move cuda docker to docker-cuda and tiny changes to adapt to the new path
Former-commit-id: d7207e8ad1
2024-06-24 10:57:36 +00:00
hiyouga
a1df18c5df
update readme
...
Former-commit-id: 4ea84a8333
2024-06-24 18:29:04 +08:00
hiyouga
7be502c5c5
update readme
...
Former-commit-id: e507e60638
2024-06-24 18:22:12 +08:00
hiyouga
9e5988717d
tiny fix
...
Former-commit-id: 344b9a36b2
2024-06-18 23:32:18 +08:00
hoshi-hiyouga
9b30635ff0
Merge pull request #4309 from EliMCosta/patch-1
...
Add Magpie and Webinstruct dataset samples
Former-commit-id: 10316dd8ca
2024-06-18 23:30:19 +08:00
hiyouga
e3bf22f61b
add deepseek coder v2 #4346
...
Former-commit-id: a233fbc258
2024-06-18 22:53:54 +08:00
hiyouga
9e0ec3831f
update readme
...
Former-commit-id: fcb2e8e7b7
2024-06-17 18:47:24 +08:00
Eli Costa
d7459853d8
Update README_zh.md
...
Fix details tag in datasets menus
Former-commit-id: 3ec57ac239
2024-06-16 11:34:31 -03:00
Eli Costa
ee30db72a3
Update README_zh.md
...
Add Magpie and WebInstruct to README
Former-commit-id: 82d5c5c1e8
2024-06-16 11:22:06 -03:00
hiyouga
f25b8626bf
support pissa
...
Former-commit-id: 8c1046d78a
2024-06-16 01:08:12 +08:00
hiyouga
4dcd124dbd
update readme
...
Former-commit-id: acd84ce535
2024-06-15 05:13:16 +08:00
hiyouga
0926d81053
update examples
...
Former-commit-id: b6e008c152
2024-06-13 03:15:06 +08:00
hiyouga
e89d1b1ec3
add neo-sft dataset
...
Former-commit-id: c7a5620ccc
2024-06-13 01:00:56 +08:00
hiyouga
99ce085415
fix lint
...
Former-commit-id: 713fde4259
2024-06-13 00:48:44 +08:00
hiyouga
b2b0b96051
fix docker compose usage
...
Former-commit-id: 947a34f53b
2024-06-13 00:07:48 +08:00
hiyouga
77e4dc255f
update readme
...
Former-commit-id: 2ce2e5bc47
2024-06-12 17:39:12 +08:00
hiyouga
d984776d35
fix #4145
...
Fix the docker image
Former-commit-id: 949e9908ad
2024-06-11 00:19:17 +08:00
-.-
b187450340
fix README
...
Former-commit-id: 483cdd9b6a
2024-06-08 23:51:56 +08:00
hiyouga
3547a26f86
add ultrafeedback and fineweb #4085 #4132
...
Former-commit-id: 12d79f89c5
2024-06-08 02:42:34 +08:00
hiyouga
4f3e680b57
init unittest
...
Former-commit-id: 1c7f0ab519
2024-06-08 01:35:58 +08:00
hiyouga
f76d427332
fix ppo in trl 0.8.6
...
Former-commit-id: 2702d7e952
2024-06-07 04:48:29 +08:00
hiyouga
d3196318be
fix #4120
...
Former-commit-id: f9e818d79c
2024-06-07 04:18:05 +08:00
hiyouga
8a0263551d
add qwen2 models
...
Former-commit-id: 8e95648850
2024-06-07 00:22:57 +08:00
hiyouga
2f0a333e9c
update readme
...
Former-commit-id: 53eb2de75e
2024-06-06 16:59:18 +08:00
hiyouga
8cc6bb961b
update readme
...
Former-commit-id: 87a7822b98
2024-06-06 16:25:42 +08:00
hiyouga
cceff9f520
lora modules: all by default
...
Former-commit-id: cae4737907
2024-06-06 03:53:28 +08:00
hiyouga
cafbb79d3a
support image input in api #3971 #4061
...
Former-commit-id: 946f601136
2024-06-06 02:29:55 +08:00
hiyouga
b097f04a79
update readme
...
Former-commit-id: eef1e542a9
2024-06-05 16:32:32 +08:00
hiyouga
94c37490d1
support glm-4
...
Former-commit-id: f48f5e646e
2024-06-05 15:16:38 +08:00
hiyouga
72ebcb9a04
update readme
...
Former-commit-id: c4f50865ad
2024-05-30 16:40:17 +08:00
hiyouga
a71a6a05c3
update readme
...
Former-commit-id: 89ca832740
2024-05-29 18:39:11 +08:00
hoshi-hiyouga
2e7dae0f97
Merge pull request #3930 from MengqingCao/npu
...
Add Ascend npu doc and dependency
Former-commit-id: 880b4a9acf
2024-05-29 18:33:38 +08:00
MengqingCao
29fe1cd688
update cann kernels url
...
Former-commit-id: e14f5b37e4
2024-05-29 09:53:31 +00:00
hiyouga
3152c7dd1c
update readme
...
Former-commit-id: 087b9faa39
2024-05-28 19:35:52 +08:00
hiyouga
2a473f36fb
update readme
...
Former-commit-id: c8765349ba
2024-05-28 16:41:34 +08:00
hiyouga
ac9c52dfb4
update readme
...
Former-commit-id: 99ee0dadd9
2024-05-28 16:19:56 +08:00
hiyouga
f41319f31b
fix #3931
...
Former-commit-id: 5d45adf47d
2024-05-28 13:44:22 +08:00
MengqingCao
099a932cbc
add Ascend npu doc and dependency
...
Former-commit-id: cd67d6eeb5
2024-05-28 01:33:54 +00:00
hiyouga
db569a2d61
add llava 1k datasets
...
Former-commit-id: 08bd0440b5
2024-05-27 19:57:33 +08:00
hiyouga
51a1097c64
add phi-3 7b/14b, mistral v0.3 models
...
Former-commit-id: efa4b196ca
2024-05-27 18:20:16 +08:00
hiyouga
df33548b39
update readme
...
Former-commit-id: 5581cb2e4e
2024-05-27 18:14:02 +08:00
hiyouga
4807c11db8
support SimPO #3900
...
Former-commit-id: cb63b32986
2024-05-26 23:46:33 +08:00
donggang
3f52df0ca9
adapted to 910B image
...
Former-commit-id: 2f68a71fc0
2024-05-23 09:48:22 +00:00
hiyouga
eabaf0def8
update wechat
...
Former-commit-id: 2670f6fb3d
2024-05-21 18:22:32 +08:00
hiyouga
11f79ea20e
fix #3847
...
Former-commit-id: 335501e228
2024-05-21 17:53:06 +08:00
hiyouga
cce3892f91
support paligemma
...
Former-commit-id: 2a67457e39
2024-05-21 00:01:22 +08:00