xingjun.wang
277790d868
update dataset info
...
Former-commit-id: 73b50a26b9
2023-12-12 14:53:59 +08:00
xingjun.wang
6cb2c99e7d
add use_streaming
...
Former-commit-id: adc98c86da
2023-12-12 14:23:05 +08:00
xingjun.wang
1bd75afae8
fix cache dir
...
Former-commit-id: 1909f0d117
2023-12-12 14:21:33 +08:00
xingjun.wang
c1974c91e5
add print info for test
...
Former-commit-id: 168321a4da
2023-12-12 14:14:40 +08:00
xingjun.wang
e17f2a3f7f
update cache dir
...
Former-commit-id: edc82b923a
2023-12-12 13:08:18 +08:00
xingjun.wang
879209829e
update args for MsDataset.load
...
Former-commit-id: 09533e95ed
2023-12-12 13:02:54 +08:00
xingjun.wang
9f17d36ccf
add new datasets
...
Former-commit-id: fe4acc66b0
2023-12-12 12:44:15 +08:00
xingjun.wang
92fb73abd4
add open orca
...
Former-commit-id: 0ce18a3782
2023-12-12 12:34:04 +08:00
xingjun.wang
6520aecef1
update
...
Former-commit-id: cfba1009d0
2023-12-12 12:03:23 +08:00
xingjun.wang
1d65d24071
for test
...
Former-commit-id: 5b979147f0
2023-12-12 11:52:59 +08:00
xingjun.wang
2918743520
for test
...
Former-commit-id: 8a908a8c64
2023-12-12 11:47:59 +08:00
yuze.zyz
9c30cdb53d
fix typo
...
Former-commit-id: e4cf2a75ca
2023-12-08 18:13:26 +08:00
yuze.zyz
c523613f0a
support ms dataset
...
Former-commit-id: 9c2247d700
2023-12-08 18:00:57 +08:00
hoshi-hiyouga
9a26819a58
Merge branch 'main' into feat/support_ms
...
Former-commit-id: 00f5c9ee16
2023-12-01 20:23:46 +08:00
yuze.zyz
fcd61657ee
remove useless code
...
Former-commit-id: 5a2392f105
2023-12-01 17:28:23 +08:00
tastelikefeet
eb835b693d
fix bug
...
Former-commit-id: d9e52957e2
2023-12-01 17:27:00 +08:00
hiyouga
e964fa7df7
fix err hint
...
Former-commit-id: a5a248d569
2023-12-01 17:13:22 +08:00
hiyouga
dbb8342ec0
add err hint
...
Former-commit-id: a51b8ec620
2023-12-01 17:04:37 +08:00
hoshi-hiyouga
947192900a
Merge pull request #1699 from Samge0/patch-1
...
Update .gitignore
Former-commit-id: aec946b119
2023-12-01 16:52:57 +08:00
SamgeShao
c5842fcd96
Update .gitignore
...
Former-commit-id: 7cabb9903d
2023-12-01 16:37:41 +08:00
yuze.zyz
b2200409f5
add readme
...
Former-commit-id: 5aa6751e52
2023-12-01 16:11:30 +08:00
hiyouga
a44ba7a2b8
tiny fix
...
Former-commit-id: e597d3c084
2023-12-01 15:58:50 +08:00
hoshi-hiyouga
6e094e491e
Merge pull request #1695 from Samge0/dev
...
Improve:"CUDA_VISIBLE_DEVICES" read from the env
Former-commit-id: fbc6220692
2023-12-01 15:56:18 +08:00
hoshi-hiyouga
752b3dd58d
Merge pull request #1690 from billvsme/main
...
Improve get_current_device
Former-commit-id: d043a4e7ba
2023-12-01 15:44:35 +08:00
hiyouga
9a6b694e12
fix #1696
...
Former-commit-id: bf6f6aeefe
2023-12-01 15:34:50 +08:00
tastelikefeet
63e12226a0
add model
...
Former-commit-id: 8ce4d11e38
2023-12-01 15:06:17 +08:00
hoshi-hiyouga
43ff245d88
Merge pull request #1689 from mlinmg/patch-2
...
Update dataset_info.json - Added Nectar
Former-commit-id: a0fde6e421
2023-12-01 14:29:36 +08:00
samge
7cf4e3b9c6
Improve:"CUDA_VISIBLE_DEVICES" read from the env
...
Former-commit-id: 421d4de604
2023-12-01 11:35:02 +08:00
Marco
a26f68ba47
Update dataset_info.json
...
Added the Nectar dataset already preprocessed and divided in sft and rl to which I added a preprompt to each instruction since it has been seen that this increase instruction following
Former-commit-id: 9468ee9012
2023-11-30 16:21:34 +01:00
billvsme
e400f2e8ad
improve get_current_device
...
Former-commit-id: 40dfcbc3d4
2023-11-30 22:40:35 +08:00
hiyouga
3d291a82d3
fix #1597
...
Former-commit-id: 327d7f7efe
2023-11-30 21:47:06 +08:00
hiyouga
ba6d290d0b
fix #1668
...
Former-commit-id: 1585962eb7
2023-11-30 21:02:00 +08:00
hiyouga
bb6b4823ad
fix #1682
...
Former-commit-id: a38dbf55e3
2023-11-30 20:03:32 +08:00
hiyouga
1c43fb6a41
add models
...
Former-commit-id: 509abe8864
2023-11-30 19:16:13 +08:00
yuze.zyz
45925e4a9c
fix
...
Former-commit-id: fb2204c183
2023-11-29 21:43:58 +08:00
yuze.zyz
e08e0e5814
support ms
...
Former-commit-id: d38a2e7341
2023-11-29 20:36:55 +08:00
hiyouga
8ec3617f19
add gpu requirement #1657
...
Former-commit-id: 9d38e5687d
2023-11-29 12:05:03 +08:00
hiyouga
ecfc7d1b50
fix #1658
...
Former-commit-id: 77d1b14fc2
2023-11-28 20:57:24 +08:00
hiyouga
ae1048db6d
fix #1659
...
Former-commit-id: 475a3fa0f4
2023-11-28 20:52:28 +08:00
hiyouga
33dc25e24f
Update wechat.jpg
...
Former-commit-id: c2d4300ac4
2023-11-28 17:27:23 +08:00
hiyouga
b015ac35d8
support export size setting
...
Former-commit-id: 859a6ea942
2023-11-26 18:34:09 +08:00
hiyouga
5f2943dc84
support Yi-34B-Chat models
...
Former-commit-id: ff1c289229
2023-11-23 19:31:49 +08:00
hiyouga
c4f6cf1270
update readme
...
Former-commit-id: 5085b00a1d
2023-11-21 13:15:46 +08:00
hiyouga
9697c3e970
set version
...
Former-commit-id: 35c2da3eba
2023-11-20 22:57:44 +08:00
hiyouga
4966bd7911
support GPTQ tuning #729 #1481 #1545 , fix chatglm template #1453 #1480 #1569
...
Former-commit-id: 9ea9380145
2023-11-20 22:52:11 +08:00
hiyouga
f06c4c8f7a
update ppo trainer
...
Former-commit-id: 5021062493
2023-11-20 21:39:15 +08:00
hoshi-hiyouga
d72f123851
Merge pull request #1553 from hannlp/hans
...
Change the default argument settings for PPO training
Former-commit-id: 48211e3799
2023-11-20 20:32:55 +08:00
hiyouga
a7b1632ace
fix value head model resuming
...
Former-commit-id: 2a36fd5064
2023-11-20 19:01:37 +08:00
hiyouga
682d81caa9
fix #1567
...
Former-commit-id: 99a3f06377
2023-11-20 18:46:36 +08:00
hiyouga
32545bd6d9
better data streaming
...
Former-commit-id: 00baaa990e
2023-11-19 23:32:47 +08:00