Commit Graph

562 Commits

Author SHA1 Message Date
hiyouga
70bf2a2247 update readme
Former-commit-id: 27ba1b63ce
2024-04-26 05:44:30 +08:00
hiyouga
fff1fb1232 add olmo 1.7
Former-commit-id: 44a43ee152
2024-04-24 05:50:50 +08:00
hiyouga
80c8586534 reenable sdpa and fast tok by default
Former-commit-id: 07737a3d2d
2024-04-24 02:18:44 +08:00
hiyouga
1bea1ed868 support phi-3
Former-commit-id: 1a13f05555
2024-04-24 00:28:53 +08:00
hiyouga
d5d6fb3970 update readme
Former-commit-id: db7f3b9784
2024-04-22 17:09:17 +08:00
hiyouga
381b461eb6 update readme
Former-commit-id: 836ca05586
2024-04-22 00:51:35 +08:00
hiyouga
e842328f05 update readme
Former-commit-id: 34d66a3a85
2024-04-22 00:42:25 +08:00
hiyouga
d8deb0f99e update readme and examples
Former-commit-id: a1f1fac33b
2024-04-22 00:37:32 +08:00
hiyouga
c0d8d530dd update readme
Former-commit-id: a83e7587a0
2024-04-22 00:21:01 +08:00
hiyouga
ec81d45d27 fix mod stuff
Former-commit-id: f58425ab45
2024-04-21 18:11:10 +08:00
Marco
639297a5ef Added Mixture of Depths
Former-commit-id: 620add7b9f
2024-04-18 20:31:24 +02:00
hoshi-hiyouga
15d17a1e86 support llama3
Former-commit-id: 2aaaede247
2024-04-19 01:13:50 +08:00
hiyouga
9aa62ffb57 fix #3324
Former-commit-id: 942362d008
2024-04-18 15:34:45 +08:00
hiyouga
e2e0bbde12 tiny fix
Former-commit-id: 3b43a3b7c5
2024-04-18 00:22:17 +08:00
hiyouga
a49dd8b4f3 update readme
Former-commit-id: e2f1c6fc6a
2024-04-17 23:40:49 +08:00
hiyouga
8a369cc084 add mixtral 8x22B models
Former-commit-id: cab0598fd0
2024-04-17 23:35:59 +08:00
hiyouga
8beb7a9239 update readme and gradio version
Former-commit-id: 5d62a51c12
2024-04-16 18:09:16 +08:00
hiyouga
0a94fab357 support badam for all stages
Former-commit-id: e3d8fc75eb
2024-04-16 17:44:48 +08:00
hiyouga
01b0913f08 update readme
Former-commit-id: cf52911fed
2024-04-16 02:36:54 +08:00
hiyouga
76b41acc10 update readme
Former-commit-id: 6084eb7cf1
2024-04-16 02:35:36 +08:00
hiyouga
bd2b758b48 add codegemma
Former-commit-id: 6543f3d449
2024-04-16 00:11:15 +08:00
hiyouga
2dc3343b1c support cohere commandR #3184
Former-commit-id: e0dbac2845
2024-04-15 23:26:42 +08:00
hiyouga
431e9804ee release v0.6.2
Former-commit-id: 9d4c949461
2024-04-11 20:08:51 +08:00
hiyouga
527cce1eb5 update readme
Former-commit-id: a88fe8c1af
2024-04-07 00:48:24 +08:00
hiyouga
a6d347726f fix requires for windows
Former-commit-id: 7f6e412604
2024-04-03 21:56:43 +08:00
hiyouga
4b23159f53 update vllm example
Former-commit-id: 49a2dfaf90
2024-04-02 22:45:20 +08:00
hiyouga
3b16f97c6c update readme
Former-commit-id: 66b0fe4e96
2024-04-02 22:17:48 +08:00
hiyouga
03c538ebb3 add zh readme
Former-commit-id: 7765f337c7
2024-04-02 20:58:45 +08:00
hiyouga
135c4e3512 update readme
Former-commit-id: 11a6c1bad6
2024-04-02 20:37:37 +08:00
hiyouga
291ac11156 update readme
Former-commit-id: 949e5fe638
2024-04-02 20:22:11 +08:00
hiyouga
bf5ffeeae0 simplify readme
Former-commit-id: 92dab8a90b
2024-04-02 20:07:43 +08:00
hiyouga
8d987b7af7 add qwen1.5 moe
Former-commit-id: 54b7d34908
2024-04-01 21:49:40 +08:00
hiyouga
34f1de0574 fix #3077
Former-commit-id: aee634cd20
2024-04-01 21:35:18 +08:00
hiyouga
ddad9be81d update readme
Former-commit-id: 099db6acc0
2024-03-31 18:46:34 +08:00
hiyouga
2f878bde11 support ORPO
Former-commit-id: 17bf8a2c3a
2024-03-31 18:29:50 +08:00
hiyouga
1421db282a update readme
Former-commit-id: c1fe6ce782
2024-03-28 22:02:32 +08:00
hiyouga
b265002c19 add project
Former-commit-id: 1e43319f9c
2024-03-28 20:24:27 +08:00
hiyouga
a9d5b4b68e update readme
Former-commit-id: 6c94305e47
2024-03-28 18:35:11 +08:00
hiyouga
89c400633a update trainers
Former-commit-id: 8c77b10912
2024-03-28 18:16:27 +08:00
hiyouga
e90c3769e5 update readme
Former-commit-id: 7b3d8188f5
2024-03-25 23:06:13 +08:00
hoshi-hiyouga
94fb50c52a Merge pull request #2967 from Tsumugii24/main
Update README_zh.md

Former-commit-id: f633ac6646
2024-03-25 23:02:22 +08:00
Tsumugii24
03c387c543 Update README.md
Former-commit-id: 1704599503
2024-03-25 22:54:38 +08:00
hiyouga
27151b8c65 release v0.6.0
Former-commit-id: 6f2b563f12
2024-03-25 22:38:56 +08:00
hiyouga
58aa576ae5 fix #2941
Former-commit-id: a1c8c98c5f
2024-03-24 00:28:44 +08:00
0xez
3f50d572ed Update README.md, fix the release date of the paper
Former-commit-id: 675ba41562
2024-03-21 22:14:48 +08:00
hiyouga
63c83f3802 add citation
Former-commit-id: 5eaa50fa01
2024-03-21 17:04:10 +08:00
hiyouga
0684e315be paper release
Former-commit-id: 0581bfdbc7
2024-03-21 13:49:17 +08:00
hiyouga
ada7e20eb4 update readme
Former-commit-id: bfe7a91289
2024-03-21 00:48:42 +08:00
hiyouga
7999836fb6 support fsdp + qlora
Former-commit-id: 8408225162
2024-03-21 00:36:06 +08:00
hiyouga
8717e98200 fix #2777 #2895
Former-commit-id: 9bec3c98a2
2024-03-20 17:59:45 +08:00