Commit Graph

1205 Commits

Author SHA1 Message Date
hiyouga
dbb8342ec0 add err hint
Former-commit-id: a51b8ec620
2023-12-01 17:04:37 +08:00
hoshi-hiyouga
947192900a Merge pull request #1699 from Samge0/patch-1
Update .gitignore

Former-commit-id: aec946b119
2023-12-01 16:52:57 +08:00
SamgeShao
c5842fcd96 Update .gitignore
Former-commit-id: 7cabb9903d
2023-12-01 16:37:41 +08:00
yuze.zyz
b2200409f5 add readme
Former-commit-id: 5aa6751e52
2023-12-01 16:11:30 +08:00
hiyouga
a44ba7a2b8 tiny fix
Former-commit-id: e597d3c084
2023-12-01 15:58:50 +08:00
hoshi-hiyouga
6e094e491e Merge pull request #1695 from Samge0/dev
Improve:"CUDA_VISIBLE_DEVICES" read from the env

Former-commit-id: fbc6220692
2023-12-01 15:56:18 +08:00
hoshi-hiyouga
752b3dd58d Merge pull request #1690 from billvsme/main
Improve get_current_device

Former-commit-id: d043a4e7ba
2023-12-01 15:44:35 +08:00
hiyouga
9a6b694e12 fix #1696
Former-commit-id: bf6f6aeefe
2023-12-01 15:34:50 +08:00
tastelikefeet
63e12226a0 add model
Former-commit-id: 8ce4d11e38
2023-12-01 15:06:17 +08:00
hoshi-hiyouga
43ff245d88 Merge pull request #1689 from mlinmg/patch-2
Update dataset_info.json - Added Nectar

Former-commit-id: a0fde6e421
2023-12-01 14:29:36 +08:00
samge
7cf4e3b9c6 Improve:"CUDA_VISIBLE_DEVICES" read from the env
Former-commit-id: 421d4de604
2023-12-01 11:35:02 +08:00
Marco
a26f68ba47 Update dataset_info.json
Added the Nectar dataset already preprocessed and divided in sft and rl to which I added a preprompt to each instruction since it has been seen that this increase instruction following

Former-commit-id: 9468ee9012
2023-11-30 16:21:34 +01:00
billvsme
e400f2e8ad improve get_current_device
Former-commit-id: 40dfcbc3d4
2023-11-30 22:40:35 +08:00
hiyouga
3d291a82d3 fix #1597
Former-commit-id: 327d7f7efe
2023-11-30 21:47:06 +08:00
hiyouga
ba6d290d0b fix #1668
Former-commit-id: 1585962eb7
2023-11-30 21:02:00 +08:00
hiyouga
bb6b4823ad fix #1682
Former-commit-id: a38dbf55e3
2023-11-30 20:03:32 +08:00
hiyouga
1c43fb6a41 add models
Former-commit-id: 509abe8864
2023-11-30 19:16:13 +08:00
yuze.zyz
45925e4a9c fix
Former-commit-id: fb2204c183
2023-11-29 21:43:58 +08:00
yuze.zyz
e08e0e5814 support ms
Former-commit-id: d38a2e7341
2023-11-29 20:36:55 +08:00
hiyouga
8ec3617f19 add gpu requirement #1657
Former-commit-id: 9d38e5687d
2023-11-29 12:05:03 +08:00
hiyouga
ecfc7d1b50 fix #1658
Former-commit-id: 77d1b14fc2
2023-11-28 20:57:24 +08:00
hiyouga
ae1048db6d fix #1659
Former-commit-id: 475a3fa0f4
2023-11-28 20:52:28 +08:00
hiyouga
33dc25e24f Update wechat.jpg
Former-commit-id: c2d4300ac4
2023-11-28 17:27:23 +08:00
hiyouga
b015ac35d8 support export size setting
Former-commit-id: 859a6ea942
2023-11-26 18:34:09 +08:00
hiyouga
5f2943dc84 support Yi-34B-Chat models
Former-commit-id: ff1c289229
2023-11-23 19:31:49 +08:00
hiyouga
c4f6cf1270 update readme
Former-commit-id: 5085b00a1d
2023-11-21 13:15:46 +08:00
hiyouga
9697c3e970 set version
Former-commit-id: 35c2da3eba
2023-11-20 22:57:44 +08:00
hiyouga
4966bd7911 support GPTQ tuning #729 #1481 #1545 , fix chatglm template #1453 #1480 #1569
Former-commit-id: 9ea9380145
2023-11-20 22:52:11 +08:00
hiyouga
f06c4c8f7a update ppo trainer
Former-commit-id: 5021062493
2023-11-20 21:39:15 +08:00
hoshi-hiyouga
d72f123851 Merge pull request #1553 from hannlp/hans
Change the default argument settings for PPO training

Former-commit-id: 48211e3799
2023-11-20 20:32:55 +08:00
hiyouga
a7b1632ace fix value head model resuming
Former-commit-id: 2a36fd5064
2023-11-20 19:01:37 +08:00
hiyouga
682d81caa9 fix #1567
Former-commit-id: 99a3f06377
2023-11-20 18:46:36 +08:00
hiyouga
32545bd6d9 better data streaming
Former-commit-id: 00baaa990e
2023-11-19 23:32:47 +08:00
hiyouga
d1e03512f4 fix model card network issue
Former-commit-id: 211b2db5a8
2023-11-19 23:03:19 +08:00
hiyouga
8d82d7e994 fix Mistral template
https://github.com/lm-sys/FastChat/pull/2547

Former-commit-id: bfb9433165
2023-11-19 16:29:30 +08:00
hiyouga
a53afb27eb fix #1263
Former-commit-id: 065bfaeed4
2023-11-19 16:05:18 +08:00
hiyouga
48d6d925f7 fix #1558
Former-commit-id: 1740131d63
2023-11-19 14:15:47 +08:00
hiyouga
112108d564 fix evaluator and cached_file in 4.31.0
Former-commit-id: ff6056405d
2023-11-18 19:39:23 +08:00
hiyouga
6d8d8509da update benchmark
Former-commit-id: a2019c8b61
2023-11-18 11:30:01 +08:00
hiyouga
60fe410f05 update readme
Former-commit-id: 90212280d6
2023-11-18 11:15:56 +08:00
hiyouga
3dc7c0a88c add benchmark
Former-commit-id: 329134f58c
2023-11-18 11:09:52 +08:00
hiyouga
303956cbb9 update dataset
Former-commit-id: 7b1aa6f63c
2023-11-17 23:19:12 +08:00
hiyouga
0d98d1a28c fix quantization
Former-commit-id: ccb0f58e22
2023-11-17 22:21:29 +08:00
hiyouga
f9df6c17ed fix #1550
Former-commit-id: 1bbc1be95e
2023-11-17 17:23:13 +08:00
Yuchen Han
b92259ae02 Update README_zh.md
Former-commit-id: 7cab47b822
2023-11-17 00:18:07 -08:00
Yuchen Han
c761a9987c Update README.md
Former-commit-id: c9b499fa7e
2023-11-17 00:17:36 -08:00
Yuchen Han
a419122179 Update workflow.py
Former-commit-id: eeb5249d0b
2023-11-17 00:16:27 -08:00
Yuchen Han
ec910a87c0 Update finetuning_args.py
Former-commit-id: b24635d22b
2023-11-17 00:15:51 -08:00
hiyouga
d3c4881ccb fix packages
Former-commit-id: 999bc0ed93
2023-11-17 16:11:48 +08:00
hoshi-hiyouga
0d2262ffd0 Merge #1544 from Outsider565/main, fix #1548
Fix: Change rouge-chinese package name to rouge_chinese
Former-commit-id: 7f9770b2c6
2023-11-17 16:09:42 +08:00