Commit Graph

43 Commits

Author SHA1 Message Date
hiyouga
c9a477322d fix #3316 2024-04-17 22:54:34 +08:00
hoshi-hiyouga
38a56706e0 Update utils.py 2024-04-16 17:29:30 +08:00
Jonery
7ecb61822b resolve gradient checkpointing issue. 2024-04-16 12:05:27 +08:00
Jonery
06c8908d3f Feature BAdam 2024-04-15 23:15:27 +08:00
hiyouga
92dab8a90b simplify readme 2024-04-02 20:07:43 +08:00
hiyouga
8c77b10912 update trainers 2024-03-28 18:16:27 +08:00
hiyouga
72367307df improve lora+ impl. 2024-03-13 23:32:51 +08:00
hiyouga
d07ad5cc1c support vllm 2024-03-07 20:26:31 +08:00
hiyouga
9aeb404a94 support lora for llama pro 2024-02-21 02:17:22 +08:00
hiyouga
7924ffc55d support llama pro #2338 , add rslora 2024-02-15 02:27:36 +08:00
hiyouga
19d33ede13 fix #2420 2024-02-04 15:51:47 +08:00
hiyouga
638234ceee format style 2024-01-20 20:15:56 +08:00
hiyouga
d9f1cae351 support function calling 2024-01-18 09:54:23 +08:00
hiyouga
4571068e1e fix #1789 2024-01-09 18:31:27 +08:00
hiyouga
ebee4f6a2a fix #2127 2024-01-09 14:49:13 +08:00
hiyouga
1696698eb9 fix dispatch 2024-01-03 16:33:16 +08:00
hiyouga
55021097d5 fix rm server 2024-01-03 15:30:46 +08:00
hiyouga
6629087e12 update loader 2023-12-24 19:10:23 +08:00
hiyouga
e44b82ee24 update patcher 2023-12-23 15:24:27 +08:00
hiyouga
c4a3977ad7 add max_memory for gptq #1923 2023-12-20 18:15:17 +08:00
hiyouga
f86857bd9e fix mixtral inference #1821 2023-12-20 15:11:15 +08:00
hiyouga
870426ff70 fix #1742 2023-12-16 20:50:45 +08:00
hiyouga
a66186b872 add noisy mean initialization #1815 2023-12-16 19:47:51 +08:00
hiyouga
2740aa9cbb add configurer 2023-12-15 21:46:40 +08:00
hiyouga
0716f5e470 refactor adapter hparam 2023-12-15 20:53:11 +08:00
hoshi-hiyouga
81167cd19d tiny fix 2023-12-13 17:32:36 +08:00
hoshi-hiyouga
6953096c9d tiny fix 2023-12-13 10:21:29 +08:00
hoshi-hiyouga
1fcd545c3d fix #1819 2023-12-13 10:14:01 +08:00
hiyouga
8cace77808 update readme 2023-12-12 11:44:30 +08:00
hiyouga
f4657de7d5 fix baichuan resize 2023-12-11 20:55:50 +08:00
hiyouga
0239d29fa0 tiny fix 2023-12-11 18:09:40 +08:00
hiyouga
64744dde89 support resize embeddings #1786 2023-12-11 17:50:02 +08:00
hiyouga
b69763ff92 fix #1642 2023-12-02 00:37:53 +08:00
hiyouga
f57445c7a0 fix gptq training 2023-12-02 00:27:15 +08:00
hiyouga
a973ce6e89 tiny fix 2023-12-01 23:37:10 +08:00
hiyouga
01e6c539b0 fix gptq model inference 2023-12-01 23:34:14 +08:00
hiyouga
211b2db5a8 fix model card network issue 2023-11-19 23:03:19 +08:00
hiyouga
1740131d63 fix #1558 2023-11-19 14:15:47 +08:00
hiyouga
ff6056405d fix evaluator and cached_file in 4.31.0 2023-11-18 19:39:23 +08:00
hiyouga
1bbc1be95e fix #1550 2023-11-17 17:23:13 +08:00
hiyouga
35b91ea34c fix import bug 2023-11-16 02:27:03 +08:00
hiyouga
ce78303600 support full-parameter PPO 2023-11-16 02:08:04 +08:00
hiyouga
4736344eb1 disentangle model from tuner and rename modules 2023-11-15 16:29:09 +08:00