hiyouga
|
6dde0508e9
|
improve fix tokenizer
Former-commit-id: 57b138abad6397596bc47be94e092e8fabedc06f
|
2024-02-09 14:53:14 +08:00 |
|
hoshi-hiyouga
|
4e20904d44
|
Merge pull request #2423 from mayflower/main
Support for german sft and dpo
Former-commit-id: 8e282e4e6bee6493b1bd38ba239ca49a6a840a92
|
2024-02-07 15:58:20 +08:00 |
|
hiyouga
|
2a6a0b7d4d
|
support qwen1.5
Former-commit-id: 8a03a572b058c5cc4ff598670dc8595b2b97e374
|
2024-02-06 00:10:51 +08:00 |
|
Johann-Peter Hartmann
|
912bb5cb03
|
Add support for german datasets
Former-commit-id: bbc038aa236952597e97d1ccf1ae2d64a16339b5
|
2024-01-30 10:18:01 +01:00 |
|
hiyouga
|
7af334f89e
|
release v0.5.0 (real)
Former-commit-id: 2146e1d9195c179fa8f92144ec2b7034e1a9f942
|
2024-01-21 01:54:49 +08:00 |
|
hiyouga
|
83ad6d25b2
|
update readme
Former-commit-id: 11e0c732c4968b083f60a0bb6f7bb5dd5ca2ba56
|
2024-01-18 14:30:48 +08:00 |
|
hiyouga
|
2ae45b0708
|
tiny fix
Former-commit-id: 6b1e9207e988c253a808e6bb26e3af9d071b77bc
|
2024-01-15 23:34:23 +08:00 |
|
Junu Moon(Fran)
|
51216668fc
|
fix: typo on README.md
Former-commit-id: 372066b559305a1428c88fbd6b01e332bfd5e3e1
|
2024-01-15 19:50:35 +09:00 |
|
hiyouga
|
dff7c87397
|
support deepseek moe
Former-commit-id: 07fbb32496b9b81c4cfe67cb9a15a6b2c43852c3
|
2024-01-14 00:14:49 +08:00 |
|
hiyouga
|
9f9b1b4632
|
fix phi modules
Former-commit-id: 68d7e925ec51b6ee277513de8f61ac18a8378b98
|
2024-01-13 23:12:47 +08:00 |
|
JessyTsu1
|
d76b2acd3f
|
Update README.md
Former-commit-id: 547d4df5c7a1d6dd95cfed37229701ce507b421c
|
2024-01-11 23:18:29 +08:00 |
|
JessyTsu1
|
63d05d3fcc
|
Update README.md
Former-commit-id: dcd4858fd2c2ac4d3cce8a369dc9991108c03821
|
2024-01-11 23:17:00 +08:00 |
|
hiyouga
|
9537bc7f3e
|
fix #1789
Former-commit-id: d86455f685fa531e651333e00b4fe54d895cf2e4
|
2024-01-09 18:31:27 +08:00 |
|
hiyouga
|
e730c45072
|
add yuan model
Former-commit-id: 6a0377e2e51633bd5fb10fa8628e554565c5ee3e
|
2023-12-29 13:50:24 +08:00 |
|
hiyouga
|
512a086221
|
fix args
Former-commit-id: ff18f327a3dc96d9677ef32841e8f29ab2eeb7ef
|
2023-12-28 18:47:19 +08:00 |
|
hiyouga
|
6298f4779c
|
tiny update
Former-commit-id: 4417b8ee20b381c964f452f52081667dfa33cd7b
|
2023-12-25 18:29:34 +08:00 |
|
hiyouga
|
52a348cac2
|
update patcher
Former-commit-id: d6d7b6670847ce4ea10353c5b126214542b45c2b
|
2023-12-23 15:24:27 +08:00 |
|
hiyouga
|
ebf4786979
|
update readme
Former-commit-id: d3dea7a926e9d356a39ca2033b03be7f559cc143
|
2023-12-23 02:17:41 +08:00 |
|
hiyouga
|
9cdaa43d1c
|
support unsloth
Former-commit-id: b857f00234b90b785d82ca7cdb29af3d948b1a7b
|
2023-12-23 00:14:33 +08:00 |
|
hiyouga
|
c51d29c36f
|
update readme
Former-commit-id: 36cd747e6a1a568e1a03e6c6611fec48e6ab9df7
|
2023-12-18 22:29:45 +08:00 |
|
hiyouga
|
ae5f71588b
|
update readme
Former-commit-id: 01267eee0da0bffb3f0c0378e2e60d14e05585c4
|
2023-12-18 15:46:45 +08:00 |
|
hiyouga
|
cedf58978e
|
support autogptq in llama board #246
Former-commit-id: fea01226703d1534b5cf511bcb6a49e73bc86ce1
|
2023-12-16 16:31:30 +08:00 |
|
hiyouga
|
a08089f449
|
support quantization in export model
Former-commit-id: f32500ae6edccab7d14df4c92467e15986866def
|
2023-12-15 23:44:50 +08:00 |
|
hiyouga
|
ef730c23e2
|
update dc link
Former-commit-id: f6789e50e17a377b6d9b434d8e12ad99d8eecfeb
|
2023-12-15 22:11:31 +08:00 |
|
hiyouga
|
8432e50396
|
refactor adapter hparam
Former-commit-id: f82aece9ebd6df83a7a005cc7cbbcec07fa6e14d
|
2023-12-15 20:53:11 +08:00 |
|
hiyouga
|
8dc554a56b
|
remove loftq
Former-commit-id: e175c0a1c631296117abda2403a4b87bbdd35a66
|
2023-12-13 01:53:46 +08:00 |
|
hiyouga
|
071c78d5e2
|
update readme
Former-commit-id: e81037d766f89f7e2b6539596397983eba52b492
|
2023-12-12 23:30:29 +08:00 |
|
hiyouga
|
0507637911
|
support loftq
Former-commit-id: e7ac2eb7f7daae17525a278ffbe2f82c0fbd8093
|
2023-12-12 22:47:06 +08:00 |
|
hiyouga
|
d9f621be13
|
support system column #1765
Former-commit-id: f425584a511c5e42bae8b3ba090eaa898b28adad
|
2023-12-12 19:45:59 +08:00 |
|
hiyouga
|
e88b100ce2
|
update readme
Former-commit-id: 42e042a4206aeb5177ddde56386e9655b0c06460
|
2023-12-12 11:44:30 +08:00 |
|
hiyouga
|
82592501be
|
support mixtral
Former-commit-id: 75b5b8e36ab1933b2625f11b645f56cbc805fd85
|
2023-12-12 11:39:04 +08:00 |
|
hiyouga
|
7250a6db54
|
update readme
Former-commit-id: a15f8cf19cac42acfb9917a2d7c9fa36a838b360
|
2023-12-04 11:22:01 +08:00 |
|
hiyouga
|
b7cdc1d4e9
|
update readme
Former-commit-id: d3c46cb126a9182be765341fe31c860d71430712
|
2023-12-04 11:02:29 +08:00 |
|
hiyouga
|
a58d846a29
|
add logo
Former-commit-id: 597894ad31c186120335252ccc0cc48fcea701b4
|
2023-12-02 01:31:24 +08:00 |
|
hiyouga
|
dc62b998b9
|
update readme
Former-commit-id: a0a9408e11f6b4cfb39af3f28402353b7cf48fa6
|
2023-12-01 22:58:29 +08:00 |
|
hiyouga
|
caf4fa46e0
|
patch modelscope
Former-commit-id: 8888cf53f040f5a2d8c0e59cddf79b252449bf58
|
2023-12-01 22:53:15 +08:00 |
|
hoshi-hiyouga
|
6f3e3174c1
|
Merge branch 'main' into feat/support_ms
Former-commit-id: b8954342611e24bc3af972747fd016cde89eee3f
|
2023-12-01 20:23:46 +08:00 |
|
yuze.zyz
|
b8ee512ec2
|
add readme
Former-commit-id: 3d5ec6f12b4ae7d04520e6865516a9a6dd4f7efe
|
2023-12-01 16:11:30 +08:00 |
|
hiyouga
|
1602fe2350
|
fix #1696
Former-commit-id: 722ae14a652af34d9b91f9459e613d7959ecaa7e
|
2023-12-01 15:34:50 +08:00 |
|
hiyouga
|
46066ed801
|
add models
Former-commit-id: b9eaadde8b5f4b9f89fa7bb910b325fcf9c84434
|
2023-11-30 19:16:13 +08:00 |
|
hiyouga
|
9e509b7613
|
add gpu requirement #1657
Former-commit-id: 8581a9133790573031d9615a551fb677eb3be461
|
2023-11-29 12:05:03 +08:00 |
|
hiyouga
|
36fbe36b10
|
update readme
Former-commit-id: 561481a8008fde5a3273558460193864a09866ed
|
2023-11-21 13:15:46 +08:00 |
|
hiyouga
|
da30d9ba02
|
support GPTQ tuning #729 #1481 #1545 , fix chatglm template #1453 #1480 #1569
Former-commit-id: fdccc6cc9b68890199e9250cabdb996ff2f853b9
|
2023-11-20 22:52:11 +08:00 |
|
hiyouga
|
78e6ac0156
|
update ppo trainer
Former-commit-id: caa525a5c6f228b9ad71387d1fe4f1c2ffa2479e
|
2023-11-20 21:39:15 +08:00 |
|
hoshi-hiyouga
|
05fd97c637
|
Merge pull request #1553 from hannlp/hans
Change the default argument settings for PPO training
Former-commit-id: 1b64678fa4979485f67c3bb1420dfdff6fcbc6e7
|
2023-11-20 20:32:55 +08:00 |
|
hiyouga
|
47d921f9f0
|
update benchmark
Former-commit-id: 1cd2ae910e3ffca92978772d000de6fde2f6bb13
|
2023-11-18 11:30:01 +08:00 |
|
hiyouga
|
633a1da456
|
update readme
Former-commit-id: a4d86a4bea1cce2219a54def9dfd3fd732d48e72
|
2023-11-18 11:15:56 +08:00 |
|
hiyouga
|
2f593a7d66
|
add benchmark
Former-commit-id: 85a09cb649be740a47359371499d821ee0d5c81e
|
2023-11-18 11:09:52 +08:00 |
|
Yuchen Han
|
d455ca2391
|
Update README.md
Former-commit-id: c1532dc6fe5d5b427011bd5509a2bc44ee16d951
|
2023-11-17 00:17:36 -08:00 |
|
hiyouga
|
33302970f3
|
update readme
Former-commit-id: 4018aabc5d1623033d27a8aced25804de79b7e7b
|
2023-11-16 15:58:37 +08:00 |
|