Commit Graph

502 Commits

Author SHA1 Message Date
hzhaoy
c662c2e56f add flash-attn installation flag in Dockerfile
Former-commit-id: e19491b0f0
2024-06-27 00:13:30 +08:00
hiyouga
dafc9268bc fix #4419
Former-commit-id: efb81b25ec
2024-06-25 01:51:29 +08:00
hiyouga
d519c2fde5 tiny fix
Former-commit-id: 41086059b1
2024-06-25 01:15:19 +08:00
hoshi-hiyouga
678884f97c Update README_zh.md
Former-commit-id: ec95f942d1
2024-06-25 01:06:59 +08:00
MengqingCao
3b499948a5 update docker files
1. add docker-npu (Dockerfile and docker-compose.yml)
  2. move cuda docker to docker-cuda and tiny changes to adapt to the new path


Former-commit-id: d7207e8ad1
2024-06-24 10:57:36 +00:00
hiyouga
a1df18c5df update readme
Former-commit-id: 4ea84a8333
2024-06-24 18:29:04 +08:00
hiyouga
7be502c5c5 update readme
Former-commit-id: e507e60638
2024-06-24 18:22:12 +08:00
hiyouga
9e5988717d tiny fix
Former-commit-id: 344b9a36b2
2024-06-18 23:32:18 +08:00
hoshi-hiyouga
9b30635ff0 Merge pull request #4309 from EliMCosta/patch-1
Add Magpie and Webinstruct dataset samples

Former-commit-id: 10316dd8ca
2024-06-18 23:30:19 +08:00
hiyouga
e3bf22f61b add deepseek coder v2 #4346
Former-commit-id: a233fbc258
2024-06-18 22:53:54 +08:00
hiyouga
9e0ec3831f update readme
Former-commit-id: fcb2e8e7b7
2024-06-17 18:47:24 +08:00
Eli Costa
d7459853d8 Update README_zh.md
Fix details tag in datasets menus

Former-commit-id: 3ec57ac239
2024-06-16 11:34:31 -03:00
Eli Costa
ee30db72a3 Update README_zh.md
Add Magpie and WebInstruct to README

Former-commit-id: 82d5c5c1e8
2024-06-16 11:22:06 -03:00
hiyouga
f25b8626bf support pissa
Former-commit-id: 8c1046d78a
2024-06-16 01:08:12 +08:00
hiyouga
4dcd124dbd update readme
Former-commit-id: acd84ce535
2024-06-15 05:13:16 +08:00
hiyouga
0926d81053 update examples
Former-commit-id: b6e008c152
2024-06-13 03:15:06 +08:00
hiyouga
e89d1b1ec3 add neo-sft dataset
Former-commit-id: c7a5620ccc
2024-06-13 01:00:56 +08:00
hiyouga
99ce085415 fix lint
Former-commit-id: 713fde4259
2024-06-13 00:48:44 +08:00
hiyouga
b2b0b96051 fix docker compose usage
Former-commit-id: 947a34f53b
2024-06-13 00:07:48 +08:00
hiyouga
77e4dc255f update readme
Former-commit-id: 2ce2e5bc47
2024-06-12 17:39:12 +08:00
hiyouga
d984776d35 fix #4145
Fix the docker image


Former-commit-id: 949e9908ad
2024-06-11 00:19:17 +08:00
-.-
b187450340 fix README
Former-commit-id: 483cdd9b6a
2024-06-08 23:51:56 +08:00
hiyouga
3547a26f86 add ultrafeedback and fineweb #4085 #4132
Former-commit-id: 12d79f89c5
2024-06-08 02:42:34 +08:00
hiyouga
4f3e680b57 init unittest
Former-commit-id: 1c7f0ab519
2024-06-08 01:35:58 +08:00
hiyouga
f76d427332 fix ppo in trl 0.8.6
Former-commit-id: 2702d7e952
2024-06-07 04:48:29 +08:00
hiyouga
d3196318be fix #4120
Former-commit-id: f9e818d79c
2024-06-07 04:18:05 +08:00
hiyouga
8a0263551d add qwen2 models
Former-commit-id: 8e95648850
2024-06-07 00:22:57 +08:00
hiyouga
2f0a333e9c update readme
Former-commit-id: 53eb2de75e
2024-06-06 16:59:18 +08:00
hiyouga
8cc6bb961b update readme
Former-commit-id: 87a7822b98
2024-06-06 16:25:42 +08:00
hiyouga
cceff9f520 lora modules: all by default
Former-commit-id: cae4737907
2024-06-06 03:53:28 +08:00
hiyouga
cafbb79d3a support image input in api #3971 #4061
Former-commit-id: 946f601136
2024-06-06 02:29:55 +08:00
hiyouga
b097f04a79 update readme
Former-commit-id: eef1e542a9
2024-06-05 16:32:32 +08:00
hiyouga
94c37490d1 support glm-4
Former-commit-id: f48f5e646e
2024-06-05 15:16:38 +08:00
hiyouga
72ebcb9a04 update readme
Former-commit-id: c4f50865ad
2024-05-30 16:40:17 +08:00
hiyouga
a71a6a05c3 update readme
Former-commit-id: 89ca832740
2024-05-29 18:39:11 +08:00
hoshi-hiyouga
2e7dae0f97 Merge pull request #3930 from MengqingCao/npu
Add Ascend npu doc and dependency

Former-commit-id: 880b4a9acf
2024-05-29 18:33:38 +08:00
MengqingCao
29fe1cd688 update cann kernels url
Former-commit-id: e14f5b37e4
2024-05-29 09:53:31 +00:00
hiyouga
3152c7dd1c update readme
Former-commit-id: 087b9faa39
2024-05-28 19:35:52 +08:00
hiyouga
2a473f36fb update readme
Former-commit-id: c8765349ba
2024-05-28 16:41:34 +08:00
hiyouga
ac9c52dfb4 update readme
Former-commit-id: 99ee0dadd9
2024-05-28 16:19:56 +08:00
hiyouga
f41319f31b fix #3931
Former-commit-id: 5d45adf47d
2024-05-28 13:44:22 +08:00
MengqingCao
099a932cbc add Ascend npu doc and dependency
Former-commit-id: cd67d6eeb5
2024-05-28 01:33:54 +00:00
hiyouga
db569a2d61 add llava 1k datasets
Former-commit-id: 08bd0440b5
2024-05-27 19:57:33 +08:00
hiyouga
51a1097c64 add phi-3 7b/14b, mistral v0.3 models
Former-commit-id: efa4b196ca
2024-05-27 18:20:16 +08:00
hiyouga
df33548b39 update readme
Former-commit-id: 5581cb2e4e
2024-05-27 18:14:02 +08:00
hiyouga
4807c11db8 support SimPO #3900
Former-commit-id: cb63b32986
2024-05-26 23:46:33 +08:00
donggang
3f52df0ca9 adapted to 910B image
Former-commit-id: 2f68a71fc0
2024-05-23 09:48:22 +00:00
hiyouga
eabaf0def8 update wechat
Former-commit-id: 2670f6fb3d
2024-05-21 18:22:32 +08:00
hiyouga
11f79ea20e fix #3847
Former-commit-id: 335501e228
2024-05-21 17:53:06 +08:00
hiyouga
cce3892f91 support paligemma
Former-commit-id: 2a67457e39
2024-05-21 00:01:22 +08:00