hiyouga
|
fecde5c13f
|
tiny fix
Former-commit-id: 2d8d47f6126d68db1701ed18fc31310c6f14dd49
|
2024-06-20 22:56:05 +08:00 |
|
hiyouga
|
0680f18633
|
update patcher
Former-commit-id: afb365e515d615dd62f791622450debab60ce5cc
|
2024-06-19 21:27:00 +08:00 |
|
hiyouga
|
650bb45954
|
fix #4357
Former-commit-id: a6741bba8cebd16a6a3f97a2dc81057d0e27eb39
|
2024-06-18 22:42:45 +08:00 |
|
hiyouga
|
bb8c7e7048
|
fix #4326
Former-commit-id: 3c2c45812a720d92f7f5b15b9f03370fe6bf069e
|
2024-06-17 18:17:48 +08:00 |
|
ancv
|
84e1f06e45
|
update packing with sdpa and eager attention mode
Former-commit-id: 285636ba3a57a1038b2f2fd4cf909a1ca07708d4
|
2024-06-16 02:25:47 +07:00 |
|
hiyouga
|
0b571f84b4
|
support pissa
Former-commit-id: ef8e45f2eaf466c54e9a671512a2974575677b08
|
2024-06-16 01:08:12 +08:00 |
|
hiyouga
|
640372cb66
|
tiny fix
Former-commit-id: f7f440986b0ae3b38ea9f2da80789629d4f79ea1
|
2024-06-16 01:06:41 +08:00 |
|
ancv
|
c5e1dfb3a0
|
remove some unused params
Former-commit-id: fef8132c50505a5fb6a246bd024491bd31798a3c
|
2024-06-15 23:00:55 +07:00 |
|
hiyouga
|
acfae2e677
|
add license
Former-commit-id: 69cfc98d7c81756a5ab6bf962240e393e449fef0
|
2024-06-15 17:54:33 +08:00 |
|
hiyouga
|
bbeb3b10aa
|
add test cases
Former-commit-id: 731176ff34cdf0cbf6b41c40c69f4ceb54c2daf6
|
2024-06-15 04:05:54 +08:00 |
|
hiyouga
|
344d1192ac
|
clean code
Former-commit-id: f54cafd5c7f0383370d1a2f357834a61a97397ce
|
2024-06-13 01:58:16 +08:00 |
|
ancv
|
4463a5227a
|
implement efficient packing without cross-contamination attention
Former-commit-id: a64a5305c0da5ef092d4cc26faf829bb44de65d1
|
2024-06-12 11:56:01 +07:00 |
|
hiyouga
|
a7233181f2
|
fix deepspeed version
Former-commit-id: 938a69bb07d4de7d82928ff01c582032162c1480
|
2024-06-11 16:52:36 +08:00 |
|
hiyouga
|
95f95bef60
|
fix #4198
Former-commit-id: 945d2c6cc73542adf9272ebd9aa332ea2c1c7361
|
2024-06-11 15:38:38 +08:00 |
|
hiyouga
|
8c7943c4de
|
tiny fix
Former-commit-id: b5e9711ef375cc323fc083e742cccfc974550416
|
2024-06-11 01:04:16 +08:00 |
|
hiyouga
|
68df064c1f
|
fix #4160
The split heads should be concatenated in dim=2
Former-commit-id: 4b3f247f270d44df9fe226cfe0dabfb7fcd2deda
|
2024-06-11 00:37:17 +08:00 |
|
hiyouga
|
7474e8035f
|
fix #2666
Former-commit-id: f121d5c4f94af9f165132c4309cb9bdc8217d985
|
2024-06-10 21:24:15 +08:00 |
|
hiyouga
|
35a36d96e5
|
reorganize adapter code
Former-commit-id: b26c2df9d97f4efffccbf7d28de13619b43f10dd
|
2024-06-08 00:47:23 +08:00 |
|
hoshi-hiyouga
|
17c66e9502
|
fix #4139
Former-commit-id: c025a4d74f293c14c2705e68af20a82a84608520
|
2024-06-08 00:45:02 +08:00 |
|
hiyouga
|
5606780ab6
|
add resume args in webui
Former-commit-id: 1d86ad768b1f36e54b4c2a9f18f6ea5a7df04c90
|
2024-06-08 00:22:16 +08:00 |
|
hiyouga
|
0b1f4a34f8
|
rename files
Former-commit-id: e1a8431770fc36c0c9ee7fed4abbc3d7fdcc5efd
|
2024-06-07 00:09:06 +08:00 |
|
hiyouga
|
d3a378ffea
|
fix torch gc
Former-commit-id: e173799d057598e5692a407601c30d8ce1513461
|
2024-06-06 20:30:25 +08:00 |
|
hiyouga
|
c91655e952
|
support train from scratch #4033 #4075
Former-commit-id: 1290b9d01077e62f8de7a23637daa2586cc82bfa
|
2024-06-06 02:43:19 +08:00 |
|
hiyouga
|
a3dd6f887c
|
fix full/freeze tuning for mllm
Former-commit-id: df5860ddb593d5b82163a585d12160b41dbce0f3
|
2024-05-27 20:37:57 +08:00 |
|
BUAADreamer
|
765cd370da
|
add regex of only tune lm and mm_proj
Former-commit-id: 38d540b3e69bceabafafab524fcfc78aeb05612d
|
2024-05-27 18:59:00 +08:00 |
|
BUAADreamer
|
97484fa020
|
Merge branch 'hiyouga:main' into main
Former-commit-id: fd5420c43e1414bcd3fadb6239f4e5d42e6ac10e
|
2024-05-25 14:18:49 +08:00 |
|
hiyouga
|
8f233e50ef
|
fix #3853
Former-commit-id: 465a5500bae1f30744d4b9b3db40aaf9171da2cb
|
2024-05-24 23:29:45 +08:00 |
|
BUAADreamer
|
aaadaa18f6
|
support pretraining of llava
Former-commit-id: 6a4c8cf0a6a1674c693b9337f018ff8df7477f8f
|
2024-05-21 08:57:14 +08:00 |
|
hiyouga
|
94887b296b
|
fix zero2 high ram usage
Former-commit-id: 01797126eb173250250e31f8e76b69ae0047745d
|
2024-05-19 21:53:54 +08:00 |
|
hiyouga
|
4114e67a5f
|
fix #3807
Former-commit-id: 08b695969049de8bf9bd3e90b9700736d90385ee
|
2024-05-19 17:07:57 +08:00 |
|
hiyouga
|
621d755a53
|
fix jetmoe z3 block
Former-commit-id: cb00a14d905395c4b8fadb955f0424a4c56668de
|
2024-05-18 22:28:45 +08:00 |
|
hiyouga
|
18b2f23f4f
|
better dtype handle in loading
Former-commit-id: 663f0577dd61a1a31191db2c6fbb0c7cea533b21
|
2024-05-17 02:14:56 +08:00 |
|
hiyouga
|
ee759aa0d8
|
rename package
Former-commit-id: a07ff0c083558cfe6f474d13027642d3052fee08
|
2024-05-16 18:39:08 +08:00 |
|