hoshi-hiyouga
|
5226c4fa97
|
Update trainer.py
Former-commit-id: 6700a1b9fa
|
2024-04-16 17:29:52 +08:00 |
|
Jonery
|
b3260c7456
|
resolve gradient checkpointing issue.
Former-commit-id: 7ecb61822b
|
2024-04-16 12:05:27 +08:00 |
|
Jonery
|
025f329445
|
Feature BAdam
Former-commit-id: 06c8908d3f
|
2024-04-15 23:15:27 +08:00 |
|
hiyouga
|
fb385b8c26
|
update examples
Former-commit-id: cce52351b5
|
2024-04-15 22:14:34 +08:00 |
|
hiyouga
|
89c400633a
|
update trainers
Former-commit-id: 8c77b10912
|
2024-03-28 18:16:27 +08:00 |
|
hoshi-hiyouga
|
ae9ad13f2a
|
fix ds optimizer
Former-commit-id: 3bcd41b639
|
2024-03-26 23:39:56 +08:00 |
|
hiyouga
|
8717e98200
|
fix #2777 #2895
Former-commit-id: 9bec3c98a2
|
2024-03-20 17:59:45 +08:00 |
|
hiyouga
|
3d483e0914
|
fix packages
Former-commit-id: 8e04794b2d
|
2024-03-17 22:32:03 +08:00 |
|
hiyouga
|
8b8671817f
|
improve lora+ impl.
Former-commit-id: 72367307df
|
2024-03-13 23:32:51 +08:00 |
|
齐保元
|
24c9277488
|
[FEATURE]: ADD LORA+ ALGORITHM
Former-commit-id: a0965cd62c
|
2024-03-13 19:43:27 +08:00 |
|
hiyouga
|
4a4e4b4354
|
support layerwise galore
Former-commit-id: 8664262cde
|
2024-03-10 00:24:11 +08:00 |
|
hiyouga
|
2c010c72b8
|
support galore
Former-commit-id: 28f7862188
|
2024-03-07 22:41:36 +08:00 |
|
hiyouga
|
31c618f1f7
|
tiny fix
Former-commit-id: 0048a2021e
|
2024-03-06 17:25:08 +08:00 |
|
hiyouga
|
d1e6e02461
|
fix #2649
Former-commit-id: 4e5fae2fac
|
2024-03-01 13:02:41 +08:00 |
|
hiyouga
|
b27e91222c
|
format style
Former-commit-id: 638234ceee
|
2024-01-20 20:15:56 +08:00 |
|
hiyouga
|
2f7684a8ee
|
fix tests
Former-commit-id: f6d6e00337
|
2024-01-20 19:58:04 +08:00 |
|
hiyouga
|
4e3bfb799d
|
support function calling
Former-commit-id: d9f1cae351
|
2024-01-18 09:54:23 +08:00 |
|
hiyouga
|
82a79e9fdf
|
fix #1073 #1462 #1735 #1908
Former-commit-id: 31165a9822
|
2023-12-20 17:15:40 +08:00 |
|
hiyouga
|
1cb390b9b2
|
implement rm server #1543
Former-commit-id: 7df4f3ab20
|
2023-12-03 20:52:54 +08:00 |
|
hiyouga
|
a53afb27eb
|
fix #1263
Former-commit-id: 065bfaeed4
|
2023-11-19 16:05:18 +08:00 |
|
hiyouga
|
48d6d925f7
|
fix #1558
Former-commit-id: 1740131d63
|
2023-11-19 14:15:47 +08:00 |
|
hiyouga
|
f441932bd1
|
support full-parameter PPO
Former-commit-id: ce78303600
|
2023-11-16 02:08:04 +08:00 |
|
hiyouga
|
06a4820836
|
disentangle model from tuner and rename modules
Former-commit-id: 4736344eb1
|
2023-11-15 16:29:09 +08:00 |
|