-.-
|
acde60b6d8
|
fix src/llamafactory/train/callbacks.py
Former-commit-id: c79a21aeaa5462770790887a6826d335e1ded5a2
|
2024-07-10 12:05:51 +08:00 |
|
hiyouga
|
88cf5c3cc2
|
fix #4731
Former-commit-id: 99e016ee552a551b52b6fcf3616cb57a5b927715
|
2024-07-10 11:32:36 +08:00 |
|
hiyouga
|
b778f3f949
|
fix ppo trainer
Former-commit-id: a03b2e5ef0d5d6b1b27753438745385d290cb211
|
2024-07-10 11:05:45 +08:00 |
|
hiyouga
|
970031b25c
|
fix #4742
Former-commit-id: ae9cf84347878fcc462f35db941c14e1df104276
|
2024-07-09 23:24:24 +08:00 |
|
hoshi-hiyouga
|
2e11c6ecdc
|
Update packages.py
Former-commit-id: c61ee780f3aed51c31a81e912f25fbfd11dc7edd
|
2024-07-07 15:48:29 +08:00 |
|
Lian Junhong
|
25e086e02d
|
chore: Update vllm_engine.py to support vllm version >= 0.5.1
Former-commit-id: b73c23a88cef237db626a16ab2a30261afd36564
|
2024-07-07 15:08:12 +08:00 |
|
hiyouga
|
22409d7ee9
|
fix #4705
Former-commit-id: cfd25c6463bcc263c8672d1de365dd81a028b66a
|
2024-07-07 13:10:06 +08:00 |
|
marko1616
|
dfedd43464
|
Update utils.py
In windows mutiline command should like
command --arg1 xxx `
--arg2 xxx `
Former-commit-id: b189750520af1fccd0485052792eda269692df89
|
2024-07-06 20:40:13 +08:00 |
|
hiyouga
|
1fe104fd2c
|
add codegeex4, internlm2.5
Former-commit-id: 349a5fbc934ac289cad44b4e3eb16f458b94710c
|
2024-07-06 16:16:47 +08:00 |
|
codingma
|
82e941ff61
|
1. add custom eval dataset support
2. merge load dataset and split dataset function
Former-commit-id: 963d97ba07e7efa3a4544c4d077283d9e112b3ad
|
2024-07-05 15:52:10 +08:00 |
|
hiyouga
|
a0fd90ce05
|
fix processors
Former-commit-id: 7215f3a8612b570cd322802d14db532927900117
|
2024-07-05 08:33:22 +08:00 |
|
hiyouga
|
a49956efd9
|
fix #4683
Former-commit-id: cbff0ea0db6971f8ced503a2f0cb6bc43e7037ac
|
2024-07-05 00:58:05 +08:00 |
|
hiyouga
|
feb3b09081
|
fix #4674
Former-commit-id: c4f35627b4f0aeb6d4337c3d0e58318c46449f65
|
2024-07-05 00:41:03 +08:00 |
|
hiyouga
|
c5c91a364c
|
Merge branch 'main' of https://github.com/hiyouga/LLaMA-Factory
Former-commit-id: f0b54254b43e93063232f633cdcf1e31d1419bfe
|
2024-07-04 14:23:37 +08:00 |
|
hiyouga
|
5dced9c740
|
fix #4677
Former-commit-id: d4b6715cab2e475dee2ff9f75c637f7611549ec7
|
2024-07-04 14:22:07 +08:00 |
|
hzhaoy
|
9ea80d83b8
|
tiny fix
Former-commit-id: 8f43ad988a4fd518a708fba53a173596ce2c59dd
|
2024-07-04 10:20:28 +08:00 |
|
hiyouga
|
ab24bde597
|
tiny fix
Former-commit-id: 9b211861eba19ae9fc360bc96eeb8ad67ba40c49
|
2024-07-04 03:47:05 +08:00 |
|
hiyouga
|
4a590180d5
|
tiny fix
Former-commit-id: 935703b46d2871ce1014832da067dfe4a50c0610
|
2024-07-04 03:02:23 +08:00 |
|
hiyouga
|
a718f0eb51
|
fix data map for packing
Former-commit-id: ee6f8f926f084a195b2dbbd074e041e6c62c6ef4
|
2024-07-04 03:01:31 +08:00 |
|
hiyouga
|
a0df8be4e8
|
fix packing for eager/sdpa attn
Former-commit-id: 735a033ceb7f2da6da71d138ea091d8a665411a9
|
2024-07-04 01:52:43 +08:00 |
|
hoshi-hiyouga
|
9dcdaee09c
|
Merge pull request #4224 from chuan298/main
Implement efficient packing without cross-contamination attention
Former-commit-id: ac382cc9fe4ec483658fd54f07f9a123788ce1b1
|
2024-07-04 01:18:54 +08:00 |
|
hiyouga
|
bd294e7cc3
|
update packing
Former-commit-id: f3d9c31efa0e64317bdd5b4ed6f78653cf3b5ba4
|
2024-07-04 01:10:55 +08:00 |
|
hoshi-hiyouga
|
d124ce001b
|
Update packing.py
Former-commit-id: 3cc11aa88839c5b99cfd83d9225770a33d0eb6fd
|
2024-07-03 23:36:01 +08:00 |
|
hiyouga
|
f849d03533
|
update func name
Former-commit-id: ed93ac0829fa656194fd32e1ac063843f475746f
|
2024-07-03 23:29:33 +08:00 |
|
hiyouga
|
7c08a4a82a
|
update arg name
Former-commit-id: 1509ed550b2060f946ce20e3c5a9e5c49e86e3ab
|
2024-07-03 23:23:24 +08:00 |
|
hiyouga
|
fe888a9073
|
update hparams
Former-commit-id: 1c4feac44192b1f540208837f5a530b0d3f5fb37
|
2024-07-03 23:18:58 +08:00 |
|
hiyouga
|
1c8d199740
|
update ui
Former-commit-id: b1522a3c0951e2e57f873dc6c758aaed33ca374e
|
2024-07-03 23:13:49 +08:00 |
|
hiyouga
|
767aae4b72
|
fix #4609
unwrap_model_for_generation(reward_model) is necessary for zero3 training
Former-commit-id: c8d5b21700577cae8d6ca03359bcf1762c8b7cb8
|
2024-07-03 19:45:51 +08:00 |
|
hiyouga
|
e8a1dc2785
|
tiny fix
Former-commit-id: d944020257f363f38e62de6279b337e399b7c65e
|
2024-07-03 02:31:50 +08:00 |
|
hiyouga
|
3f2b9e9326
|
tiny fix
Former-commit-id: 98c4a0af6b3e27ae393d2847f48a01d23d9c8780
|
2024-07-02 23:06:13 +08:00 |
|
hiyouga
|
1aaee45a94
|
remove rlhf support for chatglm2&3
Former-commit-id: bcbb5b71961b89719bffb0d202c431c82e6067cc
|
2024-07-02 23:03:17 +08:00 |
|
hiyouga
|
6b755749b9
|
upcast logits
Former-commit-id: df61660351c8af30591471807a20869a45bb055a
|
2024-07-02 22:32:05 +08:00 |
|
hiyouga
|
ca106d1f1b
|
improve rlhf
Former-commit-id: e441780e3db256ca09a442ea9254e7ce16898a07
|
2024-07-02 22:23:08 +08:00 |
|
ancv
|
260f55ea47
|
move efficient_packing from data_args to model_args
Former-commit-id: 7b61659c707480bcf8c802c73e10d12ad5b9b965
|
2024-07-02 18:37:55 +07:00 |
|
hoshi-hiyouga
|
5a1f8a7888
|
Merge pull request #4651 from hzhaoy/add-telechat-1b
Add TeleChat-1B
Former-commit-id: 2da64665d3da9dc0084bb782c65e88bac21f45a1
|
2024-07-02 17:56:43 +08:00 |
|
hzhaoy
|
1df3f02aca
|
add TeleChat-1B
Former-commit-id: 1b81b43fc483a21e0c2985b98459ecf5137aa4c4
|
2024-07-02 17:49:04 +08:00 |
|
hiyouga
|
8c3b285da2
|
fix ppo callbacks
Former-commit-id: 54f1c67c2a802b1d8368a6d1837d4c9a729f2695
|
2024-07-02 17:34:56 +08:00 |
|
hoshi-hiyouga
|
9174675ba9
|
Merge branch 'main' into main
Former-commit-id: 7be442f37d53a0c6324728fa1fa8e2c84d7f0fa5
|
2024-07-01 21:01:09 +08:00 |
|
hiyouga
|
14b37e1e03
|
tiny fix
Former-commit-id: 5dd2e5c3323f56420b5845a5ed28bcd9d4da5e41
|
2024-07-01 05:43:17 +08:00 |
|
hiyouga
|
711ffd0aaf
|
tiny fix
Former-commit-id: 19e43c3a9ed771e991cb273d394ab28fb923f868
|
2024-07-01 03:55:20 +08:00 |
|
hiyouga
|
8baf04d772
|
add eval acc
Former-commit-id: 7ffde76fbfb6192e3aac31ccc098f31ce89181ae
|
2024-07-01 03:51:20 +08:00 |
|
hiyouga
|
a43f518389
|
fix #4402 #4617
Deprecate reserved_label_len arg
Former-commit-id: 4b6568984c0be4b31e7aa91b7c0d52b7f7b12b0b
|
2024-07-01 01:19:27 +08:00 |
|
hiyouga
|
35c65ddf8c
|
fix #4398 #4592
Former-commit-id: 8c92d268903c00392c8bd75a731daa1f107d6202
|
2024-06-30 21:28:51 +08:00 |
|
hiyouga
|
f7a4f3d9c0
|
loose gemma2 attention
Former-commit-id: a0b645017a2de3d58b6cbc71bd91ec96fc7a818b
|
2024-06-29 01:42:14 +08:00 |
|
hiyouga
|
6ce0b5891b
|
bf16 by default, gemma2 attns
Gemma2 finetuning cannot work until merging https://github.com/huggingface/transformers/pull/31674
Former-commit-id: da66c32c7be0adc28d2185b23e9f62d56acb961c
|
2024-06-28 06:00:26 +08:00 |
|
hiyouga
|
0bd6bcd95f
|
increase pissa_iter for stability
Former-commit-id: 03f8d9b0fb10ae58e7f68508197330d616957899
|
2024-06-28 03:18:54 +08:00 |
|
hiyouga
|
81094dc09a
|
add Gemma2 models
Former-commit-id: 8fc5a248ecfd6861cb90dac6c14fe89cdeaf8921
|
2024-06-28 01:26:50 +08:00 |
|
hiyouga
|
884a4a33ee
|
refactor pissa, improve llamaboard
Former-commit-id: 619556e46c19718f702c97df5d570a2a4c5fb13a
|
2024-06-28 01:04:24 +08:00 |
|
hoshi-hiyouga
|
bf8855f90b
|
Merge pull request #4580 from hzhaoy/bugfix-deepspeed-pissa
Fix bug when using pissa method with deepspeed
Former-commit-id: f260d458f91d6d2b4ed141f64844cded11d5aaad
|
2024-06-28 00:46:51 +08:00 |
|
hiyouga
|
b588a099db
|
fix #4549
Former-commit-id: c9fdef10de737d1f433209812ef73e29cb60490a
|
2024-06-28 00:41:58 +08:00 |
|