hiyouga
|
22995af79a
|
tiny fix
Former-commit-id: 952807b16cd85fa193a05a83b1a735a6b06abc82
|
2024-07-15 23:09:50 +08:00 |
|
hoshi-hiyouga
|
e2404a49d0
|
Merge pull request #4821 from codemayq/feature-eval-split
add "split" as suffix in eval task name
Former-commit-id: 5b6033eef3c2cfd5b47bb67e0d803d8de68f3ff0
|
2024-07-15 22:59:44 +08:00 |
|
hiyouga
|
473b77f6a0
|
fix #4820
Former-commit-id: 8c0f8357e1eebee32010fe715554f1136b68b4ba
|
2024-07-15 22:32:07 +08:00 |
|
codingma
|
bfadfdd27b
|
1. change the task name format
2. delete split param in data_args.py
Former-commit-id: 309d30efe24785912ff751fc573677875fc5819e
|
2024-07-15 09:55:33 +08:00 |
|
hiyouga
|
392ac88d78
|
allow computing rouge in training
Former-commit-id: ac67d50673989e8137965f5f718fec67c184f55b
|
2024-07-15 01:16:26 +08:00 |
|
hiyouga
|
8ce43766c6
|
fix up
Former-commit-id: 43a56cb331fae899ca35b0c312730d4ab79d0c42
|
2024-07-15 01:04:56 +08:00 |
|
hoshi-hiyouga
|
07ea5796e5
|
Merge pull request #4691 from codemayq/feature-suppot-eval-dataset
add eval dataset support
Former-commit-id: 51eb379b44fad0336fc96c329ec98dc4528b9c2c
|
2024-07-15 01:00:34 +08:00 |
|
hoshi-hiyouga
|
9f16e33570
|
Update data_args.py
Former-commit-id: c3cee10294d56a1bc226871819b3a725b09aa67e
|
2024-07-15 00:56:03 +08:00 |
|
hoshi-hiyouga
|
3cf18efeef
|
Update preprocess.py
Former-commit-id: da92f4a1b9c12a8e2489b964baba5e2c8e739ef1
|
2024-07-15 00:55:36 +08:00 |
|
hoshi-hiyouga
|
51265f81e7
|
Update parser.py
Former-commit-id: 145687997c86b8785e37dd60fbb9f3a5986730a6
|
2024-07-15 00:55:21 +08:00 |
|
hoshi-hiyouga
|
1e47c4a9d0
|
Update data_utils.py
Former-commit-id: 5c2a0e3b1d1afd2a9219d935d3421fffffc3a2c9
|
2024-07-15 00:54:34 +08:00 |
|
hoshi-hiyouga
|
f78cd9f9da
|
Update loader.py
Former-commit-id: 860e3eb374947b72dcae88cab0a93ef561e3bfb3
|
2024-07-15 00:50:06 +08:00 |
|
hoshi-hiyouga
|
6978a9838f
|
Update parser.py
Former-commit-id: b9760df588e64270a140d9111241c62c1cefe781
|
2024-07-14 23:04:34 +08:00 |
|
hiyouga
|
95f47490f9
|
fix #4699
slow tokenizer for yi models
Former-commit-id: 4d23a0bcda0c15a903a62eec72d14c584ce020dd
|
2024-07-14 15:34:22 +08:00 |
|
hiyouga
|
71275c49f8
|
tiny fix
Former-commit-id: 220d7c1ce15e8013a900e59fe0c7937e38b5c3b5
|
2024-07-14 10:56:45 +08:00 |
|
hiyouga
|
9cd850c3b9
|
fix gemma2 attention
Former-commit-id: aeafc68e169ae0ea5939cc81cb0cf89f0ca044b6
|
2024-07-13 23:33:45 +08:00 |
|
hoshi-hiyouga
|
3f6feba1ff
|
Merge pull request #4781 from hzhaoy/fix-dockerfile-cuda
Fix cuda Dockerfile
Former-commit-id: 56696f6c112f82d514dc3bf93182707297642639
|
2024-07-13 22:25:32 +08:00 |
|
hiyouga
|
0a56404dd2
|
fix #4792
Former-commit-id: d7547d6b9e4c660897e3ce0f4022e08686c172d5
|
2024-07-13 22:07:58 +08:00 |
|
hzhaoy
|
7043219f00
|
tiny fix
Former-commit-id: 48be67c41eb394d276b41ca22b28e1ef10af4920
|
2024-07-12 00:28:44 +08:00 |
|
hoshi-hiyouga
|
4c636b34e1
|
Merge pull request #4700 from marko1616/patch-1
Fix Windows command preview
Former-commit-id: bc49af1e8bde9c396ca4b1e608b7fad02b016ce6
|
2024-07-10 13:51:50 +08:00 |
|
hoshi-hiyouga
|
a4e3b21b9e
|
Update callbacks.py
Former-commit-id: 526376967deaad73b7ca11063a2e3f0c9a0add98
|
2024-07-10 13:32:20 +08:00 |
|
-.-
|
acde60b6d8
|
fix src/llamafactory/train/callbacks.py
Former-commit-id: c79a21aeaa5462770790887a6826d335e1ded5a2
|
2024-07-10 12:05:51 +08:00 |
|
hiyouga
|
88cf5c3cc2
|
fix #4731
Former-commit-id: 99e016ee552a551b52b6fcf3616cb57a5b927715
|
2024-07-10 11:32:36 +08:00 |
|
hiyouga
|
b778f3f949
|
fix ppo trainer
Former-commit-id: a03b2e5ef0d5d6b1b27753438745385d290cb211
|
2024-07-10 11:05:45 +08:00 |
|
hiyouga
|
970031b25c
|
fix #4742
Former-commit-id: ae9cf84347878fcc462f35db941c14e1df104276
|
2024-07-09 23:24:24 +08:00 |
|
hoshi-hiyouga
|
2e11c6ecdc
|
Update packages.py
Former-commit-id: c61ee780f3aed51c31a81e912f25fbfd11dc7edd
|
2024-07-07 15:48:29 +08:00 |
|
Lian Junhong
|
25e086e02d
|
chore: Update vllm_engine.py to support vllm version >= 0.5.1
Former-commit-id: b73c23a88cef237db626a16ab2a30261afd36564
|
2024-07-07 15:08:12 +08:00 |
|
hiyouga
|
22409d7ee9
|
fix #4705
Former-commit-id: cfd25c6463bcc263c8672d1de365dd81a028b66a
|
2024-07-07 13:10:06 +08:00 |
|
marko1616
|
dfedd43464
|
Update utils.py
In windows mutiline command should like
command --arg1 xxx `
--arg2 xxx `
Former-commit-id: b189750520af1fccd0485052792eda269692df89
|
2024-07-06 20:40:13 +08:00 |
|
hiyouga
|
1fe104fd2c
|
add codegeex4, internlm2.5
Former-commit-id: 349a5fbc934ac289cad44b4e3eb16f458b94710c
|
2024-07-06 16:16:47 +08:00 |
|
codingma
|
82e941ff61
|
1. add custom eval dataset support
2. merge load dataset and split dataset function
Former-commit-id: 963d97ba07e7efa3a4544c4d077283d9e112b3ad
|
2024-07-05 15:52:10 +08:00 |
|
hiyouga
|
a0fd90ce05
|
fix processors
Former-commit-id: 7215f3a8612b570cd322802d14db532927900117
|
2024-07-05 08:33:22 +08:00 |
|
hiyouga
|
a49956efd9
|
fix #4683
Former-commit-id: cbff0ea0db6971f8ced503a2f0cb6bc43e7037ac
|
2024-07-05 00:58:05 +08:00 |
|
hiyouga
|
feb3b09081
|
fix #4674
Former-commit-id: c4f35627b4f0aeb6d4337c3d0e58318c46449f65
|
2024-07-05 00:41:03 +08:00 |
|
hiyouga
|
c5c91a364c
|
Merge branch 'main' of https://github.com/hiyouga/LLaMA-Factory
Former-commit-id: f0b54254b43e93063232f633cdcf1e31d1419bfe
|
2024-07-04 14:23:37 +08:00 |
|
hiyouga
|
5dced9c740
|
fix #4677
Former-commit-id: d4b6715cab2e475dee2ff9f75c637f7611549ec7
|
2024-07-04 14:22:07 +08:00 |
|
hzhaoy
|
9ea80d83b8
|
tiny fix
Former-commit-id: 8f43ad988a4fd518a708fba53a173596ce2c59dd
|
2024-07-04 10:20:28 +08:00 |
|
hiyouga
|
ab24bde597
|
tiny fix
Former-commit-id: 9b211861eba19ae9fc360bc96eeb8ad67ba40c49
|
2024-07-04 03:47:05 +08:00 |
|
hiyouga
|
4a590180d5
|
tiny fix
Former-commit-id: 935703b46d2871ce1014832da067dfe4a50c0610
|
2024-07-04 03:02:23 +08:00 |
|
hiyouga
|
a718f0eb51
|
fix data map for packing
Former-commit-id: ee6f8f926f084a195b2dbbd074e041e6c62c6ef4
|
2024-07-04 03:01:31 +08:00 |
|
hiyouga
|
a0df8be4e8
|
fix packing for eager/sdpa attn
Former-commit-id: 735a033ceb7f2da6da71d138ea091d8a665411a9
|
2024-07-04 01:52:43 +08:00 |
|
hoshi-hiyouga
|
9dcdaee09c
|
Merge pull request #4224 from chuan298/main
Implement efficient packing without cross-contamination attention
Former-commit-id: ac382cc9fe4ec483658fd54f07f9a123788ce1b1
|
2024-07-04 01:18:54 +08:00 |
|
hiyouga
|
bd294e7cc3
|
update packing
Former-commit-id: f3d9c31efa0e64317bdd5b4ed6f78653cf3b5ba4
|
2024-07-04 01:10:55 +08:00 |
|
hoshi-hiyouga
|
d124ce001b
|
Update packing.py
Former-commit-id: 3cc11aa88839c5b99cfd83d9225770a33d0eb6fd
|
2024-07-03 23:36:01 +08:00 |
|
hiyouga
|
f849d03533
|
update func name
Former-commit-id: ed93ac0829fa656194fd32e1ac063843f475746f
|
2024-07-03 23:29:33 +08:00 |
|
hiyouga
|
7c08a4a82a
|
update arg name
Former-commit-id: 1509ed550b2060f946ce20e3c5a9e5c49e86e3ab
|
2024-07-03 23:23:24 +08:00 |
|
hiyouga
|
fe888a9073
|
update hparams
Former-commit-id: 1c4feac44192b1f540208837f5a530b0d3f5fb37
|
2024-07-03 23:18:58 +08:00 |
|
hiyouga
|
1c8d199740
|
update ui
Former-commit-id: b1522a3c0951e2e57f873dc6c758aaed33ca374e
|
2024-07-03 23:13:49 +08:00 |
|
hiyouga
|
767aae4b72
|
fix #4609
unwrap_model_for_generation(reward_model) is necessary for zero3 training
Former-commit-id: c8d5b21700577cae8d6ca03359bcf1762c8b7cb8
|
2024-07-03 19:45:51 +08:00 |
|
hiyouga
|
e8a1dc2785
|
tiny fix
Former-commit-id: d944020257f363f38e62de6279b337e399b7c65e
|
2024-07-03 02:31:50 +08:00 |
|