hiyouga
|
413a9fec67
|
fix #3352
Former-commit-id: f315f8e8ec916b82bac94a159e55839ff155c6b5
|
2024-04-19 22:40:01 +08:00 |
|
hiyouga
|
cc2892e891
|
fix llama3 template
Former-commit-id: 20e95250168fbe081c779b2e1ff23f5df3ce02f7
|
2024-04-19 15:46:51 +08:00 |
|
Marco
|
64c883b9f8
|
fix small typo
Former-commit-id: 5638a03cd0cf8119ff366b3b3e303b5a2351b065
|
2024-04-18 20:33:29 +02:00 |
|
Marco
|
68dbd5d220
|
Added Mixture of Depths
Former-commit-id: 75dd98b9abc847e22cb263c17ebcd2ca5dd98345
|
2024-04-18 20:31:24 +02:00 |
|
hoshi-hiyouga
|
39f0cc7d8b
|
support llama3
Former-commit-id: c1eabb751a5fd73b710714451b146732e0ed4558
|
2024-04-19 01:13:50 +08:00 |
|
hiyouga
|
0d0c6612a5
|
fix #3324
Former-commit-id: 5e710c4ac331f3400534d33b2646c4108c898d98
|
2024-04-18 15:34:45 +08:00 |
|
hiyouga
|
dcc34ab729
|
tiny fix
Former-commit-id: 86399ca8c06273c42c2b184664ae25d3405b3bf6
|
2024-04-18 00:22:17 +08:00 |
|
hiyouga
|
ca2c480736
|
add mixtral 8x22B models
Former-commit-id: eccbeecff0909e1fa124b5439ffbbfbc5607e1d6
|
2024-04-17 23:35:59 +08:00 |
|
hiyouga
|
7b0da78222
|
add CodeQwen models
Former-commit-id: 9f6094241391f8f717818c8ba94e11d1791b4a5c
|
2024-04-17 23:27:22 +08:00 |
|
hiyouga
|
dd992dcce9
|
fix #3316
Former-commit-id: 7395e9e90a209228ff563ab54319955608850fc3
|
2024-04-17 22:54:34 +08:00 |
|
hiyouga
|
d3adeed72a
|
fix #3317
Former-commit-id: 7dce1763be4374cf616d96db95ae964ff510a9d6
|
2024-04-17 22:17:19 +08:00 |
|
hiyouga
|
1a41d2a4e7
|
lint
Former-commit-id: 917d65ce65024d17a5030bc57083a427cfae16d7
|
2024-04-16 18:21:09 +08:00 |
|
hoshi-hiyouga
|
b636ac24c0
|
Merge pull request #3291 from codemayq/main
support for previewing custom dataset in directory format
Former-commit-id: 40d89152282101a7c08f53e72c2ad7124a0595f3
|
2024-04-16 18:12:09 +08:00 |
|
hiyouga
|
f7a751bb6f
|
Update parser.py
Former-commit-id: 92c2133896c20054db86dd53508c982e39bd5ca0
|
2024-04-16 18:09:31 +08:00 |
|
hiyouga
|
16e20ffa8f
|
update readme and gradio version
Former-commit-id: 4029b60ddcbd15b5354503c51178f0f5e7e9aedf
|
2024-04-16 18:09:16 +08:00 |
|
hiyouga
|
d41793228e
|
support badam for all stages
Former-commit-id: 7a1380646119bfe6855f73dd90570defcea05281
|
2024-04-16 17:44:48 +08:00 |
|
hoshi-hiyouga
|
e8667f9c90
|
Merge pull request #3287 from Ledzy/badam
[Feature] Add BAdam algorithm
Former-commit-id: 10a5e1e65b34b03e5ca2a41bf6ded09a3fb25f0c
|
2024-04-16 17:32:16 +08:00 |
|
hoshi-hiyouga
|
71743d4726
|
Update utils.py
Former-commit-id: 01147536b2bb507e87e033fa696e9eb39fe96bbe
|
2024-04-16 17:30:12 +08:00 |
|
hoshi-hiyouga
|
3f1f729d88
|
Update trainer.py
Former-commit-id: c6163be1444c00dd000f288e2f834968bd932981
|
2024-04-16 17:29:52 +08:00 |
|
hoshi-hiyouga
|
b5c3d23a22
|
Update utils.py
Former-commit-id: 7edf4dbed88b8034282f14fd6e0cb6f7f9e5f805
|
2024-04-16 17:29:30 +08:00 |
|
hoshi-hiyouga
|
88a080ced4
|
Update patcher.py
Former-commit-id: 494e6a1e05b38f5ff61d83327303614f53c92e64
|
2024-04-16 17:29:19 +08:00 |
|
hoshi-hiyouga
|
12f43694be
|
Update adapter.py
Former-commit-id: 8f7b75b26f020d8ae85baab7b082475c3bfeb512
|
2024-04-16 17:28:12 +08:00 |
|
hoshi-hiyouga
|
e9f351b1d5
|
Update parser.py
Former-commit-id: 898239883afc79f03abd0dc276eef901662a9591
|
2024-04-16 17:27:25 +08:00 |
|
hoshi-hiyouga
|
5d330d3c32
|
Update parser.py
Former-commit-id: 2f3da8169d18b026760cc0ac7dd6141bdd08c932
|
2024-04-16 17:27:02 +08:00 |
|
hoshi-hiyouga
|
e56b77b1ca
|
Update finetuning_args.py
Former-commit-id: 3a23d900aea74078f0bc8cf73fac860a4ce3df67
|
2024-04-16 17:26:30 +08:00 |
|
Jonery
|
2ba03e6ef3
|
resolve gradient checkpointing issue.
Former-commit-id: 6df9135d063bb6102f0cbcdf0d702076f5febbae
|
2024-04-16 12:05:27 +08:00 |
|
codingma
|
57a093525d
|
add check
Former-commit-id: 008f6498977c243c80e87242f05c9cf9573541ac
|
2024-04-16 10:56:39 +08:00 |
|
codingma
|
507d725b4b
|
support for previewing custom dataset in directory format
Former-commit-id: 501cff38c819f06f15194907ce7e052d5f28025a
|
2024-04-16 10:43:14 +08:00 |
|
hiyouga
|
c5510439d0
|
add empty template
Former-commit-id: a325ffa8a668bec354d2636683806acef105e196
|
2024-04-16 03:10:02 +08:00 |
|
hiyouga
|
ba4efe3ff6
|
support unsloth 2024.4
Former-commit-id: 14a83f8bc4fe44783252378fce59198194a96bb8
|
2024-04-16 00:25:03 +08:00 |
|
hiyouga
|
2aa1d1476e
|
add codegemma
Former-commit-id: 9324176525c2eda22962b0ca1895009b6237e6e3
|
2024-04-16 00:11:15 +08:00 |
|
hiyouga
|
19874e39ee
|
support cohere commandR #3184
Former-commit-id: e077c36872740f6b2ac255aee9da6c4c70f28977
|
2024-04-15 23:26:42 +08:00 |
|
Jonery
|
22188f1fa3
|
Feature BAdam
Former-commit-id: d8d2807fbcf587c37f7fd34a23e9397d2775ceed
|
2024-04-15 23:15:27 +08:00 |
|
hoshi-hiyouga
|
41783ae083
|
Merge pull request #3254 from marko1616/feature/Add-support-for-CohereForAI/c4ai-command-r-plus
Add template&support for c4ai-command-r/plus (tested)
Former-commit-id: 41d39ec4889abad050820bf153133ac3a11228a3
|
2024-04-15 22:59:35 +08:00 |
|
hoshi-hiyouga
|
56339caf4d
|
Update template.py
Former-commit-id: 00b8be7dafa65e13b344724a8d3855919ee4f631
|
2024-04-15 22:58:01 +08:00 |
|
hoshi-hiyouga
|
03c83838b1
|
Update constants.py
Former-commit-id: 39199f712aa7b7a1c66080d9c84651fd2eb0b425
|
2024-04-15 22:56:55 +08:00 |
|
hiyouga
|
be206df674
|
update examples
Former-commit-id: 369294b31c8a03a1cafcee83eb31a817007d3c49
|
2024-04-15 22:14:34 +08:00 |
|
marko1616
|
91f8a8248c
|
change default_system accroding to official template
Former-commit-id: 7ad9029c5e77a87a7c324b8f90b4f80a31a5c78b
|
2024-04-15 20:45:46 +08:00 |
|
marko1616
|
227b973961
|
Revert "Add support for function call(Not strictly following origin)"
This reverts commit 65ef864f660c00c96c5b28d127bece5aaf0bf42e [formerly 44f3ada4e394c06b0d972329ed2a62d2be2ea0c6].
Former-commit-id: fac9cc6e01dd8f3bc449b656804476e1871326f0
|
2024-04-15 20:27:09 +08:00 |
|
marko1616
|
65ef864f66
|
Add support for function call(Not strictly following origin)
Former-commit-id: 44f3ada4e394c06b0d972329ed2a62d2be2ea0c6
|
2024-04-15 20:16:52 +08:00 |
|
hoshi-hiyouga
|
740d89e9df
|
Merge pull request #3276 from liu-zichen/fix_mixtral
fix: turn on output_router_logits of mixtral
Former-commit-id: 07bbaf5c67d00a152e5304e81b15fd9189e7bb99
|
2024-04-15 15:38:16 +08:00 |
|
hiyouga
|
506276c9cb
|
fix #3273
Former-commit-id: 3b20c89b342a068356ffc29c3724b645775c65db
|
2024-04-15 15:32:58 +08:00 |
|
liuzc
|
44c86150c9
|
fix: mixtral output_router_logits
Former-commit-id: ab3171ea97ec968b972287287ef9ee2502c6d37c
|
2024-04-15 12:11:49 +08:00 |
|
marko1616
|
bbded4412f
|
Typo fix
Former-commit-id: 607625497738b2c8be736be7b0bd5c6f4cbaad5e
|
2024-04-13 17:30:21 +08:00 |
|
marko1616
|
7d36d19aaa
|
Typo fix
Former-commit-id: 51b1e49e288e66c1b0c24ac070201c988fb2a389
|
2024-04-13 07:52:11 +08:00 |
|
marko1616
|
07d01ed16a
|
Add c4ai-command-r-plus link
Former-commit-id: acaf953ca46eca8fb378067f4ada133654e4f088
|
2024-04-13 07:32:40 +08:00 |
|
marko1616
|
ba9fc46712
|
Add template&support(Not tested)
Former-commit-id: 60bb60c4dc30a9641ddb57a44ef126f0768566c4
|
2024-04-13 04:31:33 +08:00 |
|
hiyouga
|
64da8145bf
|
fix model card
Former-commit-id: 920e7149bf2b559c9829aa4b11cfb6d00bbb2f9e
|
2024-04-12 17:11:59 +08:00 |
|
hiyouga
|
4cb8cc563d
|
fix #3238
Former-commit-id: 4d7e81ab4722d13bec6ca1af141f94bdc74d0883
|
2024-04-12 14:28:11 +08:00 |
|
hiyouga
|
5d7887fd5d
|
set dev version
Former-commit-id: f6cc76571d2c789675883a18e0db3d0c61f33808
|
2024-04-11 20:27:34 +08:00 |
|