hiyouga
|
8ba0996a53
|
Update callbacks.py
Former-commit-id: d640c5545f6b2fcf7840df2b35a80f49a2de4ba5
|
2023-07-15 17:39:16 +08:00 |
|
hiyouga
|
70b5232f9a
|
fix callback
Former-commit-id: 22d9a9c2af6674eb832ae4aee80d679f19b7006f
|
2023-07-15 17:18:16 +08:00 |
|
hiyouga
|
a696148d6b
|
modity code structure
Former-commit-id: f75137661358f9070bc70c341dfa2cc5fd69cf94
|
2023-07-15 16:54:28 +08:00 |
|
hiyouga
|
7ea072536b
|
fix eval and pred loss
Former-commit-id: c30db9f1f0db5a6a660cdc60016755241762aae7
|
2023-07-14 13:11:57 +08:00 |
|
hiyouga
|
32ec440d2e
|
fix pretrain
Former-commit-id: a04115ec278e646347a6fec12f562494abd66e51
|
2023-07-13 23:41:54 +08:00 |
|
hiyouga
|
2b0fced03b
|
fix Baichuan-13B
Former-commit-id: 08439d29b2031ffbe77fe581c148e6d94e68bfc4
|
2023-07-13 23:08:45 +08:00 |
|
Jinghuan Shang
|
d42d175670
|
Fix typo in common.py
lastest -> latest
Former-commit-id: dc1e8b7181ba86f028dc22bea9d3c419814c1e19
|
2023-07-11 18:03:53 -04:00 |
|
hiyouga
|
925a026853
|
fix sft encode
Former-commit-id: b2f7cb446591e3722b5be8d250ddfe0caa226384
|
2023-07-11 19:50:33 +08:00 |
|
hiyouga
|
685ba1e02e
|
add baichuan template
Former-commit-id: 1af031c02b3871341cc4d719b5d62cfd02bc126a
|
2023-07-11 18:57:50 +08:00 |
|
hiyouga
|
c812429011
|
update api to match langchain
Former-commit-id: 84a06318d40fb595f3aa6d1141c107ef7710376c
|
2023-07-07 20:35:39 +08:00 |
|
hiyouga
|
8b4ef062b7
|
support InternLM
Former-commit-id: a2f507c56238d7fb2670edbab52d6b275f245e27
|
2023-07-07 11:02:28 +08:00 |
|
hiyouga
|
50a27509a3
|
fix rouge score
Former-commit-id: caa00d3ac24c989d56887af9d7ca79244e72714a
|
2023-07-06 14:28:34 +08:00 |
|
hiyouga
|
22fa516bd5
|
fix streaming response in API
Former-commit-id: 4abd2485e10e7198b63a396ce5432ac89c9f2d98
|
2023-07-05 22:42:31 +08:00 |
|
hiyouga
|
bcd7493212
|
fix freeze tuning
Former-commit-id: e6603977f695b71f6a36640256f6653ff513d783
|
2023-07-05 21:18:28 +08:00 |
|
hiyouga
|
265dc1b6a0
|
fix bug in PPO stage
Former-commit-id: a2ba69183b8e72c09242317a34545ab966ea8991
|
2023-07-05 19:14:10 +08:00 |
|
hiyouga
|
7ba52f5b6e
|
fix compute dtype
Former-commit-id: 8e3540c62d748dcecbcb3a29d57abe665e7b31c8
|
2023-07-05 15:13:00 +08:00 |
|
hiyouga
|
c87910ada3
|
support falcon model #72
Former-commit-id: c136f362c1aa75d3374b151188ba4a55d9313a59
|
2023-07-05 15:00:06 +08:00 |
|
hiyouga
|
9e8854535b
|
fix bleu score
Former-commit-id: cac87fd553500e6fc76430909a469e8b037a46ef
|
2023-07-05 00:11:21 +08:00 |
|
hiyouga
|
023f9367fc
|
set use_cache before saving model
Former-commit-id: 395ed1cf1b811a21fe61dd6b57040653ce98ba49
|
2023-07-04 23:18:20 +08:00 |
|
hiyouga
|
b9d56b2ac5
|
fix seq2seq predictions
Former-commit-id: 65e9ce2cdda3924170774f7b3c6e3fbdbdd87b7f
|
2023-07-04 22:56:51 +08:00 |
|
hiyouga
|
4ec1bff116
|
fix typo
Former-commit-id: 92fa515e977889bf1003f174785fe600db14f70e
|
2023-06-30 10:09:59 +08:00 |
|
hiyouga
|
1ce7b5e0f3
|
update loading logic
Former-commit-id: 4d0fddba213beaa55146b047a78963d1d18185a1
|
2023-06-28 12:07:16 +08:00 |
|
hiyouga
|
2ff577810a
|
fix loading best model
Former-commit-id: 0a46313ccaee91b51bec9f9f92e3111a4a04ce2e
|
2023-06-28 01:55:12 +08:00 |
|
hiyouga
|
c3cd2067b2
|
fix RM accuracy
Former-commit-id: 7826a8ca7722b138e79b13c42b1070771f6d5994
|
2023-06-28 01:40:13 +08:00 |
|
hiyouga
|
c5ec4eaef5
|
tiny fix
Former-commit-id: 450910c1db969533c5268022cb064cbc2c9cb7e6
|
2023-06-27 23:54:24 +08:00 |
|
hiyouga
|
e877b8d55b
|
fix initializing data arguments
Former-commit-id: 18f87c1b25c7d1bbc06ea2260a1473b7f296e0ff
|
2023-06-27 22:50:23 +08:00 |
|
hiyouga
|
307f5866e9
|
support save full model, replace BOS token
Former-commit-id: 2e01abfda5706d8913860f52ce3bab98739eae55
|
2023-06-27 21:40:11 +08:00 |
|
hiyouga
|
86a7b2dc5d
|
fix decoding in seq2seq
Former-commit-id: 1c732e2537cce56f537c3dcabf96f9245706f4bb
|
2023-06-27 19:33:08 +08:00 |
|
hiyouga
|
c145ca4ad6
|
fix generation in seq2seq.py
Former-commit-id: 117594802921177272032e58eba7012ae4805b99
|
2023-06-26 18:07:06 +08:00 |
|
hiyouga
|
8f1d99c926
|
support prefixes, loading multiple local files
Former-commit-id: cec9760eb890d37b733d8da73d0f3dbf924ca4ef
|
2023-06-26 15:32:40 +08:00 |
|
hiyouga
|
e4a869dc42
|
update api
Former-commit-id: f030b09924f0fb07305c244115759ac295e957c7
|
2023-06-26 13:39:57 +08:00 |
|
hiyouga
|
cf29a9af35
|
update readme
Former-commit-id: 0697643358ade295f3c6eb239765d231b46afe0b
|
2023-06-23 00:17:05 +08:00 |
|
hiyouga
|
0c7eb90f6b
|
update API
Former-commit-id: 614d3a996cd7a9444605b174d302ef9edd3c66c0
|
2023-06-22 20:46:24 +08:00 |
|
hiyouga
|
620cd2eb7e
|
match api with OpenAI format
Former-commit-id: 76ecb8c222cec34fa6dbcef71e3907c95f67c22f
|
2023-06-22 20:27:00 +08:00 |
|
Bun
|
cd066afa7b
|
Compatible with OpenAI API.
Former-commit-id: 6e4db0903fc1cdf57096a27b91fe904239719c9f
|
2023-06-21 14:45:04 +08:00 |
|
hiyouga
|
eeb78bd75c
|
add default template
Former-commit-id: f621f7631a4a9db4a927a6aeb8fefd3a94f14467
|
2023-06-16 21:12:17 +08:00 |
|
hiyouga
|
653ce9397e
|
fix freeze layers
Former-commit-id: a6c4b141cd5e75a411277a0b43d9967a8abdaae6
|
2023-06-16 17:38:21 +08:00 |
|
hiyouga
|
36ea46e85c
|
add source prefix
Former-commit-id: fc4d8155b35dcc453a64a50b21ce59050a15be99
|
2023-06-16 16:32:17 +08:00 |
|
hiyouga
|
c6d56e7109
|
support loading lora from hub
Former-commit-id: 0574b590ef3c4e317f7e2da25b0e5084dcef42a1
|
2023-06-16 00:02:17 +08:00 |
|
hiyouga
|
a68808d6d9
|
support baichuan model
Former-commit-id: 0cee6ad67ffb06f0d7165a0284e39f510a2abc36
|
2023-06-15 16:02:01 +08:00 |
|
hiyouga
|
50494db8d6
|
fix bug in template vanilla
Former-commit-id: c527399424d027a49d8584f4f7884eeabe5ea0df
|
2023-06-15 14:36:55 +08:00 |
|
hiyouga
|
dd1e7ed3cf
|
add BOS token in pre-training
Former-commit-id: d668f8b501c367276ef4be372f2eb1753a1b7e86
|
2023-06-15 01:46:17 +08:00 |
|
hiyouga
|
3419396945
|
support multiturn training like FastChat
Former-commit-id: b6faf0207d5b637722a1fd45984d27b3ac095fd4
|
2023-06-14 22:27:39 +08:00 |
|
hiyouga
|
ca90a1e6d9
|
fix loading valuehead
Former-commit-id: 875e8e23498f6933d657ad154b53611310327e3e
|
2023-06-13 11:13:06 +08:00 |
|
hiyouga
|
c92bfb158f
|
fix generating args
Former-commit-id: 531a3764d99ab00a0d217ce2ced0347b263dfe68
|
2023-06-13 01:33:56 +08:00 |
|
hiyouga
|
1fbda5d139
|
support RM metrics, add generating Args
Former-commit-id: cec6524d6b1be65c5d171a5b3dcaae7818132bc5
|
2023-06-12 15:48:48 +08:00 |
|
BUAADreamer
|
b1c6ee9cf5
|
add code for reading from multi files in one directory
Former-commit-id: a2af9df5a99ad529d0a280099b115cde69e02973
|
2023-06-10 16:27:30 +08:00 |
|
BUAADreamer
|
53727aee3e
|
add code for reading from multi files in one directory
Former-commit-id: 3dd5f9a874d66353bb4379bbe39a89cd425dac3d
|
2023-06-10 15:53:47 +08:00 |
|
hiyouga
|
587d7a907f
|
tiny fix
Former-commit-id: 2ba5d69c7f6e00e348c88b95331af9a80ede9561
|
2023-06-07 16:42:31 +08:00 |
|
hiyouga
|
fb9dedcb36
|
tiny fix
Former-commit-id: 16c2860d56581b90b20ad88631ddc3659ab7b56f
|
2023-06-07 16:02:07 +08:00 |
|