Commit Graph

122 Commits

Author SHA1 Message Date
hiyouga
4581d09fa6 fix #944
Former-commit-id: 338b8664ed
2023-09-21 19:51:02 +08:00
hiyouga
8ab5566dc0 support FlashAttention2
Former-commit-id: d8aa1404be
2023-09-10 20:43:56 +08:00
hiyouga
9ed4bb63d4 change to right-padding, update reward score #803
Former-commit-id: 8ea32e4046
2023-09-08 20:04:31 +08:00
hiyouga
a4fd976048 refactor dataset_attr, add eos in pt, fix #757
Former-commit-id: a9d1fb72f7
2023-09-01 19:00:45 +08:00
codemayq
2b979d39f2 add stage in DatasetAttr
Former-commit-id: ba94c8729d
2023-08-23 20:54:53 +08:00
hiyouga
802494e20a update template
Former-commit-id: 4318347d3f
2023-08-22 19:46:09 +08:00
hiyouga
b88f0b396c support ppo score norm (trl 0.5.1.dev required)
Former-commit-id: 53e33418d0
2023-08-18 12:02:42 +08:00
hiyouga
03edfd07e7 fix PPO trainer #551 , update readme
Former-commit-id: 9020524418
2023-08-18 11:43:10 +08:00
hiyouga
edc15c62fa fix system prompt
Former-commit-id: 7407d9daa1
2023-08-16 01:35:52 +08:00
hiyouga
3f0a2d6adc support rope scaling, fix #475 #476 #478
Former-commit-id: fa940c17b8
2023-08-12 20:46:27 +08:00
hiyouga
79f4ba0d26 Release v0.1.6
Former-commit-id: a48cb0d474
2023-08-11 23:25:57 +08:00
hiyouga
abdfa26d06 support DPO training (2305.18290)
Former-commit-id: 3ec4351cfd
2023-08-11 03:02:53 +08:00
jiongxuc
7ffd961b8b huggingface login for projects must login while running
Former-commit-id: 3e000c2b60
2023-08-10 14:57:12 +08:00
hiyouga
6404167ab7 support val set in streaming mode
Former-commit-id: d86ea314a1
2023-08-09 23:00:26 +08:00
hiyouga
921778a7cf update args spec
Former-commit-id: 5453b93db0
2023-08-07 15:23:35 +08:00
hiyouga
b32ed1d7be support interleave probs
Former-commit-id: 69744c17e8
2023-08-04 21:27:35 +08:00
hiyouga
9c84c4ed5d support Qwen-7B, fix InternLM-7B inference
Former-commit-id: 87f8f830e2
2023-08-03 15:53:32 +08:00
hiyouga
e80b75b560 support streaming data, fix #284 #274 #268
Former-commit-id: 0411a4b3e1
2023-07-31 23:33:00 +08:00
hiyouga
2a664783c3 Update data_args.py
Former-commit-id: 513e1f1ec9
2023-07-28 17:42:41 +08:00
hiyouga
2e0342dc54 fix #194
Former-commit-id: 8f7819fcaa
2023-07-19 17:07:33 +08:00
hiyouga
b8b38a9ade create chat model
Former-commit-id: 657cf0f55a
2023-07-15 19:26:20 +08:00
hiyouga
a696148d6b modity code structure
Former-commit-id: f751376613
2023-07-15 16:54:28 +08:00