hiyouga
|
719585a128
|
update readme
Former-commit-id: 3a8c17907c71f46b1b37501e2afdc99ad89fb4bc
|
2024-04-22 00:21:01 +08:00 |
|
hiyouga
|
f8e219dc81
|
fix mod stuff
Former-commit-id: cf3988226e6398c67bb2955578e436fc505aa5c5
|
2024-04-21 18:11:10 +08:00 |
|
Marco
|
44cda2eece
|
Added Mixture of Depths
Former-commit-id: 75dd98b9abc847e22cb263c17ebcd2ca5dd98345
|
2024-04-18 20:31:24 +02:00 |
|
hoshi-hiyouga
|
8397808d1d
|
support llama3
Former-commit-id: c1eabb751a5fd73b710714451b146732e0ed4558
|
2024-04-19 01:13:50 +08:00 |
|
hiyouga
|
9e1bd6420d
|
fix #3324
Former-commit-id: 5e710c4ac331f3400534d33b2646c4108c898d98
|
2024-04-18 15:34:45 +08:00 |
|
hiyouga
|
619264c854
|
tiny fix
Former-commit-id: 86399ca8c06273c42c2b184664ae25d3405b3bf6
|
2024-04-18 00:22:17 +08:00 |
|
hiyouga
|
1ebac62e3d
|
update readme
Former-commit-id: a49112a74339ba77bfec53f7870e821fe148db2c
|
2024-04-17 23:40:49 +08:00 |
|
hiyouga
|
ce9bdb3509
|
add mixtral 8x22B models
Former-commit-id: eccbeecff0909e1fa124b5439ffbbfbc5607e1d6
|
2024-04-17 23:35:59 +08:00 |
|
hiyouga
|
0a1578e4e3
|
update readme and gradio version
Former-commit-id: 4029b60ddcbd15b5354503c51178f0f5e7e9aedf
|
2024-04-16 18:09:16 +08:00 |
|
hiyouga
|
a4167fd925
|
support badam for all stages
Former-commit-id: 7a1380646119bfe6855f73dd90570defcea05281
|
2024-04-16 17:44:48 +08:00 |
|
hiyouga
|
b053c6454e
|
update readme
Former-commit-id: 8f233745c3aa7a6ef57f275bec80ee731ff76de3
|
2024-04-16 02:36:54 +08:00 |
|
hiyouga
|
ebf0f4a77c
|
update readme
Former-commit-id: f9a246572c1ec0e4b36bff237c6523ce629b7000
|
2024-04-16 02:35:36 +08:00 |
|
hiyouga
|
b5c5283dd6
|
add codegemma
Former-commit-id: 9324176525c2eda22962b0ca1895009b6237e6e3
|
2024-04-16 00:11:15 +08:00 |
|
hiyouga
|
b638c65519
|
support cohere commandR #3184
Former-commit-id: e077c36872740f6b2ac255aee9da6c4c70f28977
|
2024-04-15 23:26:42 +08:00 |
|
hiyouga
|
7468f2535c
|
release v0.6.2
Former-commit-id: f92ad0a62d957b595f6a76a5403216b163eb3d17
|
2024-04-11 20:08:51 +08:00 |
|
hiyouga
|
04fc2f78bf
|
update readme
Former-commit-id: 1cf15547e2420a3e5f7a969c21c10c7fbdfc71fe
|
2024-04-07 00:48:24 +08:00 |
|
hiyouga
|
43d134ba29
|
fix requires for windows
Former-commit-id: 5e25fae40b7ea9cfa72717efbe3677199ca9608f
|
2024-04-03 21:56:43 +08:00 |
|
hiyouga
|
a74a7585e0
|
update vllm example
Former-commit-id: 2df6d2eacfa27ebc69455696b93649624c1facbe
|
2024-04-02 22:45:20 +08:00 |
|
hiyouga
|
5bf0cca2b8
|
update readme
Former-commit-id: 7ea7333b51be6b1120fc0b13675f5a0ac3c5a12b
|
2024-04-02 22:17:48 +08:00 |
|
hiyouga
|
35621c6089
|
add zh readme
Former-commit-id: 389a170a4d42c56c71c0e17bbe018c4cb1983b5a
|
2024-04-02 20:58:45 +08:00 |
|
hiyouga
|
c1510d19c7
|
update readme
Former-commit-id: 9b8e7ccdab167f53fb897e1940562682324e8ff0
|
2024-04-02 20:37:37 +08:00 |
|
hiyouga
|
2074cf99fb
|
update readme
Former-commit-id: 0c73d3c8a5762a8f119b27322ffd52a61de6fe38
|
2024-04-02 20:22:11 +08:00 |
|
hiyouga
|
b12176d818
|
simplify readme
Former-commit-id: 0da6ec2d516326fe9c7583ba71cd1778eb838178
|
2024-04-02 20:07:43 +08:00 |
|
hiyouga
|
85726c91ce
|
add qwen1.5 moe
Former-commit-id: 3ea94f0d12cec25ac694a2c4ae8971c356990b61
|
2024-04-01 21:49:40 +08:00 |
|
hiyouga
|
40211db275
|
fix #3077
Former-commit-id: d0340391e8075cff0d84b3ef879c2101b66ca1dc
|
2024-04-01 21:35:18 +08:00 |
|
hiyouga
|
9abd83adb1
|
update readme
Former-commit-id: 297b01f16ac78cde15a5d85a9a5b82ea20bfaf23
|
2024-03-31 18:46:34 +08:00 |
|
hiyouga
|
d764cd8736
|
support ORPO
Former-commit-id: f44a4c27e2461cdaa1b16865f597a31033c0e6d9
|
2024-03-31 18:29:50 +08:00 |
|
hiyouga
|
50224b09cc
|
update readme
Former-commit-id: 312d4f90784800dc8db4eaa7d908e6761115bc51
|
2024-03-28 22:02:32 +08:00 |
|
hiyouga
|
32dcc5a491
|
add project
Former-commit-id: 0418e9fecb2337b5d1b72e8358adb8aa10803c4b
|
2024-03-28 20:24:27 +08:00 |
|
hiyouga
|
f0e564beaa
|
update readme
Former-commit-id: 6b634b5c2dbad827e8cc9850b8d7697c2056532a
|
2024-03-28 18:35:11 +08:00 |
|
hiyouga
|
59e6ebf039
|
update trainers
Former-commit-id: d0dd6eefed0b86895ed00a7cafb331e5193db645
|
2024-03-28 18:16:27 +08:00 |
|
hiyouga
|
2a5d02fd0f
|
update readme
Former-commit-id: 32e6a7f10fdc28106e3b086eb79304943c6e8fab
|
2024-03-25 23:06:13 +08:00 |
|
hoshi-hiyouga
|
ea550ed9e0
|
Merge pull request #2967 from Tsumugii24/main
Update README_zh.md
Former-commit-id: 4c3b8da2caf74e9d6819bdb1a4e30ca3c549a2d8
|
2024-03-25 23:02:22 +08:00 |
|
Tsumugii24
|
02665cd42b
|
Update README.md
Former-commit-id: fd28fff2b9dfdb3e59b160c5fcee9cdc69e53564
|
2024-03-25 22:54:38 +08:00 |
|
hiyouga
|
daab85e3e6
|
release v0.6.0
Former-commit-id: 51910d5803eb718e4976da0b3bfcdc5eeeea48eb
|
2024-03-25 22:38:56 +08:00 |
|
hiyouga
|
a57d839e1d
|
fix #2941
Former-commit-id: 3775ab52017f0b610ddd8199cccfb8c001eda507
|
2024-03-24 00:28:44 +08:00 |
|
0xez
|
d5005e766f
|
Update README.md, fix the release date of the paper
Former-commit-id: 4bf9ef3095376f0208f783f180c13bef88581824
|
2024-03-21 22:14:48 +08:00 |
|
hiyouga
|
1cf0f11840
|
add citation
Former-commit-id: 54199205f2000c0500d29822387646133e06e8b2
|
2024-03-21 17:04:10 +08:00 |
|
hiyouga
|
052e8b2cc6
|
paper release
Former-commit-id: 7bd384655244ce6a8c1f34aa6fed54122d0e9da5
|
2024-03-21 13:49:17 +08:00 |
|
hiyouga
|
8963e89633
|
update readme
Former-commit-id: ab98d4d617b7193c474f58a29ca9475fea7564aa
|
2024-03-21 00:48:42 +08:00 |
|
hiyouga
|
935ee0a023
|
support fsdp + qlora
Former-commit-id: b894bf8e84be689db258021f0638e9ac939abcbc
|
2024-03-21 00:36:06 +08:00 |
|
hiyouga
|
c7af26a9e3
|
fix #2777 #2895
Former-commit-id: 54d5f62d29456a8d9d0c0dd3d0bbfffe48935803
|
2024-03-20 17:59:45 +08:00 |
|
khazic
|
c32d6c8250
|
Updated README with new information
Former-commit-id: 90a81c2e52bd44beb3b7feb5d2517b073f7f6ef9
|
2024-03-20 14:21:16 +08:00 |
|
刘一博
|
757158da63
|
Updated README with new information
Former-commit-id: fddbc29ca1bd9b13372087e6a349f21240abc013
|
2024-03-20 14:11:28 +08:00 |
|
hiyouga
|
46f99ff277
|
improve lora+ impl.
Former-commit-id: 332bad25455a70ad9204e7dd384bb086d789aa39
|
2024-03-13 23:32:51 +08:00 |
|
hiyouga
|
dff77004f2
|
support olmo
Former-commit-id: 2719510e8c6baa591c74458b773e4e47215e6052
|
2024-03-12 18:30:38 +08:00 |
|
hoshi-hiyouga
|
9ee416a8fc
|
Merge pull request #2743 from S3Studio/DockerizeSupport
Add dockerize support
Former-commit-id: 30751a7b9218770cc2bc6cae857a28950bffbb6c
|
2024-03-12 00:05:49 +08:00 |
|
hiyouga
|
7ff8a064f3
|
support layerwise galore
Former-commit-id: d43a4da0947897d0be3f62fad3107754d4c89f2b
|
2024-03-10 00:24:11 +08:00 |
|
hiyouga
|
f37d481c5d
|
add GaLore results
Former-commit-id: ac05b9bba62924693bdede85917d21b844849b8c
|
2024-03-09 04:11:55 +08:00 |
|
hiyouga
|
5d7d8bd55c
|
update hardware requirements
Former-commit-id: 604b3d10fc1448f702943114b66b97bded21e080
|
2024-03-09 03:58:18 +08:00 |
|