311 Commits

Author SHA1 Message Date
hoshi-hiyouga
a80cd5d86b Merge pull request #619 from hiyouga/feature-templateTest
add template encode test

Former-commit-id: bc7795655ffa5527020cf3d808b4ab46342e4585
2023-08-21 20:56:34 +08:00
codemayq
4b83d692b1 add template encode test
Former-commit-id: cbbee7933e80df9a9af45160c8e6c076df00b4f8
2023-08-21 20:51:24 +08:00
hiyouga
d05d535a58 fix #617
Former-commit-id: 5235b15c9181f2b68f7d6caa9a6324b8570d3d0c
2023-08-21 18:16:11 +08:00
hiyouga
e6f4eab4ab fix #608
Former-commit-id: 02d69b6fdefa6b303b84fb8195a159006fe3f50a
2023-08-21 17:49:36 +08:00
hiyouga
d3bef03dc6 fix baichuan template for training #597 #616
Former-commit-id: 0a3f6984259526775b0efdb8a1b0b24f564a7239
2023-08-21 17:41:51 +08:00
hiyouga
fa15416024 fix #595
Former-commit-id: 5c052836a0a19afd556418f1f6da8db8fc3cd37d
2023-08-20 16:40:00 +08:00
hoshi-hiyouga
d24ece4a57 Merge pull request #596 from beat4ocean/beat
fix KeyError: 'lang' bug

Former-commit-id: 1968d9d1d0f64d2d332c29d3c05c87d2e79b828f
2023-08-20 16:37:40 +08:00
beat4ocean
c81d1d40fa fix KeyError: 'lang' bug
Former-commit-id: 7b45de6b9f5a0159335614e78a106b595d726fc9
2023-08-20 15:32:36 +08:00
hiyouga
570ccc3618 fix ppo trainer #551
Former-commit-id: 0676497104eccc8a737d27890eabf1ca8713c235
2023-08-20 14:07:11 +08:00
hiyouga
8f9f618bcc Update wechat.jpg
Former-commit-id: 290be836b7f4c36020684259a0a0d31ef30cab80
2023-08-19 18:03:36 +08:00
hiyouga
acaac6df9e Release v0.1.7
Former-commit-id: 9c9009f49fdff83a29b83d8f97eb5c99e2574256
v0.1.7
2023-08-18 17:21:27 +08:00
hiyouga
9f1688924d tiny fix
Former-commit-id: d75e377b0f6f3fd7c034676b81ddef3aab1d6901
2023-08-18 13:07:35 +08:00
hiyouga
b88f0b396c support ppo score norm (trl 0.5.1.dev required)
Former-commit-id: 53e33418d02ee0f34c783e30ae510b811308c598
2023-08-18 12:02:42 +08:00
hiyouga
03edfd07e7 fix PPO trainer #551 , update readme
Former-commit-id: 90205244186df558cd6b0000728d638348db3a10
2023-08-18 11:43:10 +08:00
hiyouga
e93e9641f5 update readme
Former-commit-id: e4eec9ddfd3a9688733e018a96274dff0d5d9962
2023-08-18 01:51:55 +08:00
hiyouga
429a9e2512 Update .gitignore
Former-commit-id: 10cd6c9171d25ea2407a62cfa2b8adddddaf91b0
2023-08-18 01:43:42 +08:00
hiyouga
fceca0bb6a update training resuming
Former-commit-id: 58f13e22da18babed0d2d4348474e07745da8fa5
2023-08-18 01:41:17 +08:00
hoshi-hiyouga
f128cfe880 Merge pull request #434 from niuba/main
add last_checkpoint support

Former-commit-id: 7926432d2796c44649d27f515b2b69fcd9967dfa
2023-08-18 01:38:31 +08:00
hoshi-hiyouga
49d4ae3704 Merge branch 'main' into main
Former-commit-id: 725290324562e093565fae79a05341ebf64486d5
2023-08-18 01:37:23 +08:00
hiyouga
66771352bb support bf16 ppo #551
Former-commit-id: d125218cde893c7c8527ab27b4d2dfb2474c384d
2023-08-18 00:40:32 +08:00
hiyouga
caf4a61e21 fix ChatGLM2 ppo #527 #528
Former-commit-id: 9f4c2adc9a9ca8e458d3868805e077182e0d336a
2023-08-18 00:34:59 +08:00
hiyouga
623a34b16f fix generation bug #532
Former-commit-id: be21fc83f9aed0af1e5a2f83f5d5eeb36f1d283c
2023-08-17 22:21:34 +08:00
hiyouga
a46f277477 fix streaming in pt stage #548 #549
Former-commit-id: b0ed0dec5e6788a0344c09a6cc58d1116265fd68
2023-08-17 17:59:26 +08:00
hiyouga
327e14d3ea update readme
Former-commit-id: ff0aa793b6750830b3865c439ef64ed129ec9406
2023-08-17 11:00:22 +08:00
hiyouga
3021a01b71 fix baichuan and intern template
Former-commit-id: 892fd39373b816cf079e0decc9cb57dfb5565242
2023-08-17 01:27:20 +08:00
hiyouga
048f99354f fix generation
Former-commit-id: d9e62711a3349d7c6fd3512fb25c709bdfbb311a
2023-08-16 22:39:54 +08:00
hiyouga
edc15c62fa fix system prompt
Former-commit-id: 7407d9daa16bf6b3cd5002e16b2c53e402d2bc39
2023-08-16 01:35:52 +08:00
hiyouga
2ceaecfb42 fix baichuan template #481
Former-commit-id: 273135f59500a36cc30333ef2dd3689c6030e2ef
2023-08-15 11:38:21 +08:00
hoshi-hiyouga
c73e4de046 Merge pull request #516 from liuyanyi/add_gitignore
[Enhance] Add .gitignore file

Former-commit-id: 7f35487c4af8b4e1773fbfd3b0cdbb906c64b5aa
2023-08-15 11:25:40 +08:00
hiyouga
a9ab8f71d7 fix ChatGLM RLHF
Former-commit-id: af6c011fcb8ea9e5cf2eb4699da33d8668df04b4
2023-08-15 11:19:20 +08:00
hiyouga
603cb30887 Update wechat.jpg
Former-commit-id: a7dd9611dbd53f8cc4bb471c4f335558a349ea52
2023-08-15 11:13:46 +08:00
Yanyi Liu
d37be06aa0 Add .gitignore
Former-commit-id: 448478f9384a65f48895d5e0983a39a1296ebc36
2023-08-15 11:13:45 +08:00
hiyouga
d15fe288df alert pad_token source
Former-commit-id: 80b4053602c02aec724ecf980f8a279ffdf9f975
2023-08-15 00:07:56 +08:00
hiyouga
02a61b08b1 update webui
Former-commit-id: 9d0f6214b68a653c0a67632437b227ab8f589bed
2023-08-14 22:45:26 +08:00
hoshi-hiyouga
d021d31a9c Merge pull request #511 from hiyouga/feature-autoTemplate
add template match and stage in webui

Former-commit-id: adb0f186e9715e2772211b52463933a741d4ab70
2023-08-14 22:44:04 +08:00
codemayq
4a4623cf2d auto match template when change model_name
Former-commit-id: 0bf892ff1a4b9253ee569a7b0dc5270762af13d3
2023-08-14 20:56:05 +08:00
codemayq
ee7da14f81 add template match and stage in webui
Former-commit-id: 79c68e552722079faf2ab0858870b481844d66ae
2023-08-14 20:42:59 +08:00
hiyouga
2aa2c363ad fix ChatGLM lm_head #494
Former-commit-id: d0199568081f46e5d338ea511266eb420dd43594
2023-08-14 14:14:48 +08:00
hiyouga
2f70f96463 fix bug in webui
Former-commit-id: 20a29297b1c55678e7a56b2610b35edec063253b
2023-08-14 11:38:42 +08:00
hiyouga
307c814f38 fix webui cache
Former-commit-id: ca08e5efd3ccf155c6a0baa0a08bb7879f6b9b9b
2023-08-14 11:37:01 +08:00
hiyouga
5feb2eda89 update readme_zh
Former-commit-id: 2391a84e26d7acb3afc18617fefdfb1b3ab6f45d
2023-08-14 11:13:25 +08:00
hiyouga
6c9b035c0e web UI integrating RLHF
Former-commit-id: ec94274ca155300aee27621c018dd1bbaf78194b
2023-08-14 10:48:47 +08:00
hiyouga
e75024fde3 fix #480
Former-commit-id: 2f2fd55d8175eb3c6ce94bc821ab4e6331f79d8e
2023-08-14 00:23:56 +08:00
hiyouga
7984ae8b62 fix webui
Former-commit-id: d69b1388e61e5e867ec5c9a9a223677c5b5860ce
2023-08-12 23:52:07 +08:00
hiyouga
e1b43dfc7f tiny fix
Former-commit-id: 9dc6a296e327c5ff27cbd1697437d9d3145e3d9a
2023-08-12 22:02:43 +08:00
hiyouga
bd611e0090 fix rope scaling
Former-commit-id: 8545c11c45906b33c78e144c2338963eaf0406b8
2023-08-12 22:00:01 +08:00
hiyouga
2bcf0025d6 update readme
Former-commit-id: 8a79ded55d6e696368c96a6d9958e7c8cdaf977b
2023-08-12 21:29:06 +08:00
hiyouga
6af7fe15ad update readme
Former-commit-id: 3ea1fa35d1f951ae411248c03a8549b9714d876a
2023-08-12 21:25:19 +08:00
hiyouga
8686e62dfa update readme
Former-commit-id: 2618e0b5a7ad88f68971f21d0e7eb4560866400f
2023-08-12 21:23:05 +08:00
hiyouga
ba65dcb15e update readme
Former-commit-id: 1836c020c514e7a94aaa48abdf19ea8accbc1a2a
2023-08-12 21:00:11 +08:00