hiyouga
|
c9a477322d
|
fix #3316
|
2024-04-17 22:54:34 +08:00 |
|
hoshi-hiyouga
|
38a56706e0
|
Update utils.py
|
2024-04-16 17:29:30 +08:00 |
|
Jonery
|
7ecb61822b
|
resolve gradient checkpointing issue.
|
2024-04-16 12:05:27 +08:00 |
|
Jonery
|
06c8908d3f
|
Feature BAdam
|
2024-04-15 23:15:27 +08:00 |
|
hiyouga
|
92dab8a90b
|
simplify readme
|
2024-04-02 20:07:43 +08:00 |
|
hiyouga
|
8c77b10912
|
update trainers
|
2024-03-28 18:16:27 +08:00 |
|
hiyouga
|
72367307df
|
improve lora+ impl.
|
2024-03-13 23:32:51 +08:00 |
|
hiyouga
|
d07ad5cc1c
|
support vllm
|
2024-03-07 20:26:31 +08:00 |
|
hiyouga
|
9aeb404a94
|
support lora for llama pro
|
2024-02-21 02:17:22 +08:00 |
|
hiyouga
|
7924ffc55d
|
support llama pro #2338 , add rslora
|
2024-02-15 02:27:36 +08:00 |
|
hiyouga
|
19d33ede13
|
fix #2420
|
2024-02-04 15:51:47 +08:00 |
|
hiyouga
|
638234ceee
|
format style
|
2024-01-20 20:15:56 +08:00 |
|
hiyouga
|
d9f1cae351
|
support function calling
|
2024-01-18 09:54:23 +08:00 |
|
hiyouga
|
4571068e1e
|
fix #1789
|
2024-01-09 18:31:27 +08:00 |
|
hiyouga
|
ebee4f6a2a
|
fix #2127
|
2024-01-09 14:49:13 +08:00 |
|
hiyouga
|
1696698eb9
|
fix dispatch
|
2024-01-03 16:33:16 +08:00 |
|
hiyouga
|
55021097d5
|
fix rm server
|
2024-01-03 15:30:46 +08:00 |
|
hiyouga
|
6629087e12
|
update loader
|
2023-12-24 19:10:23 +08:00 |
|
hiyouga
|
e44b82ee24
|
update patcher
|
2023-12-23 15:24:27 +08:00 |
|
hiyouga
|
c4a3977ad7
|
add max_memory for gptq #1923
|
2023-12-20 18:15:17 +08:00 |
|
hiyouga
|
f86857bd9e
|
fix mixtral inference #1821
|
2023-12-20 15:11:15 +08:00 |
|
hiyouga
|
870426ff70
|
fix #1742
|
2023-12-16 20:50:45 +08:00 |
|
hiyouga
|
a66186b872
|
add noisy mean initialization #1815
|
2023-12-16 19:47:51 +08:00 |
|
hiyouga
|
2740aa9cbb
|
add configurer
|
2023-12-15 21:46:40 +08:00 |
|
hiyouga
|
0716f5e470
|
refactor adapter hparam
|
2023-12-15 20:53:11 +08:00 |
|
hoshi-hiyouga
|
81167cd19d
|
tiny fix
|
2023-12-13 17:32:36 +08:00 |
|
hoshi-hiyouga
|
6953096c9d
|
tiny fix
|
2023-12-13 10:21:29 +08:00 |
|
hoshi-hiyouga
|
1fcd545c3d
|
fix #1819
|
2023-12-13 10:14:01 +08:00 |
|
hiyouga
|
8cace77808
|
update readme
|
2023-12-12 11:44:30 +08:00 |
|
hiyouga
|
f4657de7d5
|
fix baichuan resize
|
2023-12-11 20:55:50 +08:00 |
|
hiyouga
|
0239d29fa0
|
tiny fix
|
2023-12-11 18:09:40 +08:00 |
|
hiyouga
|
64744dde89
|
support resize embeddings #1786
|
2023-12-11 17:50:02 +08:00 |
|
hiyouga
|
b69763ff92
|
fix #1642
|
2023-12-02 00:37:53 +08:00 |
|
hiyouga
|
f57445c7a0
|
fix gptq training
|
2023-12-02 00:27:15 +08:00 |
|
hiyouga
|
a973ce6e89
|
tiny fix
|
2023-12-01 23:37:10 +08:00 |
|
hiyouga
|
01e6c539b0
|
fix gptq model inference
|
2023-12-01 23:34:14 +08:00 |
|
hiyouga
|
211b2db5a8
|
fix model card network issue
|
2023-11-19 23:03:19 +08:00 |
|
hiyouga
|
1740131d63
|
fix #1558
|
2023-11-19 14:15:47 +08:00 |
|
hiyouga
|
ff6056405d
|
fix evaluator and cached_file in 4.31.0
|
2023-11-18 19:39:23 +08:00 |
|
hiyouga
|
1bbc1be95e
|
fix #1550
|
2023-11-17 17:23:13 +08:00 |
|
hiyouga
|
35b91ea34c
|
fix import bug
|
2023-11-16 02:27:03 +08:00 |
|
hiyouga
|
ce78303600
|
support full-parameter PPO
|
2023-11-16 02:08:04 +08:00 |
|
hiyouga
|
4736344eb1
|
disentangle model from tuner and rename modules
|
2023-11-15 16:29:09 +08:00 |
|