hiyouga
|
b641e9e97e
|
fix #1784
Former-commit-id: 28d5de7e78
|
2023-12-09 20:53:18 +08:00 |
|
yuze.zyz
|
c523613f0a
|
support ms dataset
Former-commit-id: 9c2247d700
|
2023-12-08 18:00:57 +08:00 |
|
hiyouga
|
89cf856776
|
fix #1771 and temporarily fix #1764
Former-commit-id: d42c0b1d34
|
2023-12-08 16:26:20 +08:00 |
|
hiyouga
|
9b84a706af
|
add models
Former-commit-id: e25f7bae16
|
2023-12-06 13:33:18 +08:00 |
|
hiyouga
|
027caabbb6
|
fix ppo trainer save logic
Former-commit-id: d3dccd0693
|
2023-12-04 19:00:19 +08:00 |
|
hiyouga
|
cd2b0a024b
|
fix #1715
Former-commit-id: c9b166615c
|
2023-12-03 22:35:47 +08:00 |
|
hiyouga
|
16b7296ae1
|
release v0.3.3
Former-commit-id: 438dea679b
|
2023-12-03 21:59:45 +08:00 |
|
hiyouga
|
6493558c3b
|
fix bug
Former-commit-id: 8b681ee273
|
2023-12-03 21:40:40 +08:00 |
|
hiyouga
|
64eead3fb1
|
ppo support rm server
Former-commit-id: 747db40172
|
2023-12-03 21:38:51 +08:00 |
|
hiyouga
|
1cb390b9b2
|
implement rm server #1543
Former-commit-id: 7df4f3ab20
|
2023-12-03 20:52:54 +08:00 |
|
hiyouga
|
2279b1948e
|
fix #1707 #1710
Former-commit-id: 03d05991f8
|
2023-12-03 11:33:12 +08:00 |
|
hiyouga
|
6720189f3f
|
fix #1642
Former-commit-id: b69763ff92
|
2023-12-02 00:37:53 +08:00 |
|
hiyouga
|
f8376b228a
|
add xuanyuan models
Former-commit-id: 6e7af11b98
|
2023-12-02 00:35:29 +08:00 |
|
hiyouga
|
4cc08e00c7
|
fix gptq training
Former-commit-id: f57445c7a0
|
2023-12-02 00:27:15 +08:00 |
|
hiyouga
|
c8eff09c7c
|
tiny fix
Former-commit-id: a973ce6e89
|
2023-12-01 23:37:10 +08:00 |
|
hiyouga
|
e0da912f8e
|
fix gptq model inference
Former-commit-id: 01e6c539b0
|
2023-12-01 23:34:14 +08:00 |
|
hiyouga
|
a388af4adc
|
fix #1703
Former-commit-id: 662d9a3a4e
|
2023-12-01 22:55:41 +08:00 |
|
hiyouga
|
c60e79c12e
|
patch modelscope
Former-commit-id: bd42c229b0
|
2023-12-01 22:53:15 +08:00 |
|
hoshi-hiyouga
|
9a26819a58
|
Merge branch 'main' into feat/support_ms
Former-commit-id: 00f5c9ee16
|
2023-12-01 20:23:46 +08:00 |
|
yuze.zyz
|
fcd61657ee
|
remove useless code
Former-commit-id: 5a2392f105
|
2023-12-01 17:28:23 +08:00 |
|
tastelikefeet
|
eb835b693d
|
fix bug
Former-commit-id: d9e52957e2
|
2023-12-01 17:27:00 +08:00 |
|
hiyouga
|
e964fa7df7
|
fix err hint
Former-commit-id: a5a248d569
|
2023-12-01 17:13:22 +08:00 |
|
hiyouga
|
dbb8342ec0
|
add err hint
Former-commit-id: a51b8ec620
|
2023-12-01 17:04:37 +08:00 |
|
yuze.zyz
|
b2200409f5
|
add readme
Former-commit-id: 5aa6751e52
|
2023-12-01 16:11:30 +08:00 |
|
hiyouga
|
a44ba7a2b8
|
tiny fix
Former-commit-id: e597d3c084
|
2023-12-01 15:58:50 +08:00 |
|
hoshi-hiyouga
|
6e094e491e
|
Merge pull request #1695 from Samge0/dev
Improve:"CUDA_VISIBLE_DEVICES" read from the env
Former-commit-id: fbc6220692
|
2023-12-01 15:56:18 +08:00 |
|
hoshi-hiyouga
|
752b3dd58d
|
Merge pull request #1690 from billvsme/main
Improve get_current_device
Former-commit-id: d043a4e7ba
|
2023-12-01 15:44:35 +08:00 |
|
hiyouga
|
9a6b694e12
|
fix #1696
Former-commit-id: bf6f6aeefe
|
2023-12-01 15:34:50 +08:00 |
|
tastelikefeet
|
63e12226a0
|
add model
Former-commit-id: 8ce4d11e38
|
2023-12-01 15:06:17 +08:00 |
|
samge
|
7cf4e3b9c6
|
Improve:"CUDA_VISIBLE_DEVICES" read from the env
Former-commit-id: 421d4de604
|
2023-12-01 11:35:02 +08:00 |
|
billvsme
|
e400f2e8ad
|
improve get_current_device
Former-commit-id: 40dfcbc3d4
|
2023-11-30 22:40:35 +08:00 |
|
hiyouga
|
3d291a82d3
|
fix #1597
Former-commit-id: 327d7f7efe
|
2023-11-30 21:47:06 +08:00 |
|
hiyouga
|
ba6d290d0b
|
fix #1668
Former-commit-id: 1585962eb7
|
2023-11-30 21:02:00 +08:00 |
|
hiyouga
|
bb6b4823ad
|
fix #1682
Former-commit-id: a38dbf55e3
|
2023-11-30 20:03:32 +08:00 |
|
hiyouga
|
1c43fb6a41
|
add models
Former-commit-id: 509abe8864
|
2023-11-30 19:16:13 +08:00 |
|
yuze.zyz
|
45925e4a9c
|
fix
Former-commit-id: fb2204c183
|
2023-11-29 21:43:58 +08:00 |
|
yuze.zyz
|
e08e0e5814
|
support ms
Former-commit-id: d38a2e7341
|
2023-11-29 20:36:55 +08:00 |
|
hiyouga
|
ecfc7d1b50
|
fix #1658
Former-commit-id: 77d1b14fc2
|
2023-11-28 20:57:24 +08:00 |
|
hiyouga
|
ae1048db6d
|
fix #1659
Former-commit-id: 475a3fa0f4
|
2023-11-28 20:52:28 +08:00 |
|
hiyouga
|
b015ac35d8
|
support export size setting
Former-commit-id: 859a6ea942
|
2023-11-26 18:34:09 +08:00 |
|
hiyouga
|
5f2943dc84
|
support Yi-34B-Chat models
Former-commit-id: ff1c289229
|
2023-11-23 19:31:49 +08:00 |
|
hiyouga
|
9697c3e970
|
set version
Former-commit-id: 35c2da3eba
|
2023-11-20 22:57:44 +08:00 |
|
hiyouga
|
4966bd7911
|
support GPTQ tuning #729 #1481 #1545 , fix chatglm template #1453 #1480 #1569
Former-commit-id: 9ea9380145
|
2023-11-20 22:52:11 +08:00 |
|
hiyouga
|
f06c4c8f7a
|
update ppo trainer
Former-commit-id: 5021062493
|
2023-11-20 21:39:15 +08:00 |
|
hoshi-hiyouga
|
d72f123851
|
Merge pull request #1553 from hannlp/hans
Change the default argument settings for PPO training
Former-commit-id: 48211e3799
|
2023-11-20 20:32:55 +08:00 |
|
hiyouga
|
a7b1632ace
|
fix value head model resuming
Former-commit-id: 2a36fd5064
|
2023-11-20 19:01:37 +08:00 |
|
hiyouga
|
682d81caa9
|
fix #1567
Former-commit-id: 99a3f06377
|
2023-11-20 18:46:36 +08:00 |
|
hiyouga
|
32545bd6d9
|
better data streaming
Former-commit-id: 00baaa990e
|
2023-11-19 23:32:47 +08:00 |
|
hiyouga
|
d1e03512f4
|
fix model card network issue
Former-commit-id: 211b2db5a8
|
2023-11-19 23:03:19 +08:00 |
|
hiyouga
|
8d82d7e994
|
fix Mistral template
https://github.com/lm-sys/FastChat/pull/2547
Former-commit-id: bfb9433165
|
2023-11-19 16:29:30 +08:00 |
|