codingma
|
5f2bd04799
|
1. add custom eval dataset support
2. merge load dataset and split dataset function
Former-commit-id: 963d97ba07e7efa3a4544c4d077283d9e112b3ad
|
2024-07-05 15:52:10 +08:00 |
|
hiyouga
|
9a1a5f9778
|
fix processors
Former-commit-id: 7215f3a8612b570cd322802d14db532927900117
|
2024-07-05 08:33:22 +08:00 |
|
hiyouga
|
edc8aefa59
|
fix #4683
Former-commit-id: cbff0ea0db6971f8ced503a2f0cb6bc43e7037ac
|
2024-07-05 00:58:05 +08:00 |
|
hiyouga
|
ee1c786a12
|
fix #4674
Former-commit-id: c4f35627b4f0aeb6d4337c3d0e58318c46449f65
|
2024-07-05 00:41:03 +08:00 |
|
hiyouga
|
a3e4f2b716
|
Merge branch 'main' of https://github.com/hiyouga/LLaMA-Factory
Former-commit-id: f0b54254b43e93063232f633cdcf1e31d1419bfe
|
2024-07-04 14:23:37 +08:00 |
|
hiyouga
|
6685f1fb9e
|
fix #4677
Former-commit-id: d4b6715cab2e475dee2ff9f75c637f7611549ec7
|
2024-07-04 14:22:07 +08:00 |
|
hoshi-hiyouga
|
c89ff328f6
|
Merge pull request #4673 from hzhaoy/main
tiny fix
Former-commit-id: e0ef32fc3a5469cdd854288c4bb9eb78bb7e27f1
|
2024-07-04 10:40:41 +08:00 |
|
hzhaoy
|
c6f1bc65c0
|
tiny fix
Former-commit-id: 8f43ad988a4fd518a708fba53a173596ce2c59dd
|
2024-07-04 10:20:28 +08:00 |
|
hiyouga
|
0f43c61229
|
update tests
Former-commit-id: 8c479a4f7fc97dedc9ca9ceea9e0dd3c4d573253
|
2024-07-04 04:00:12 +08:00 |
|
hiyouga
|
8567dab167
|
tiny fix
Former-commit-id: 9b211861eba19ae9fc360bc96eeb8ad67ba40c49
|
2024-07-04 03:47:05 +08:00 |
|
hiyouga
|
0517d7bee5
|
tiny fix
Former-commit-id: 935703b46d2871ce1014832da067dfe4a50c0610
|
2024-07-04 03:02:23 +08:00 |
|
hiyouga
|
5bc0b9b31c
|
fix data map for packing
Former-commit-id: ee6f8f926f084a195b2dbbd074e041e6c62c6ef4
|
2024-07-04 03:01:31 +08:00 |
|
hiyouga
|
3d219b91b9
|
fix packing for eager/sdpa attn
Former-commit-id: 735a033ceb7f2da6da71d138ea091d8a665411a9
|
2024-07-04 01:52:43 +08:00 |
|
hoshi-hiyouga
|
a90c6306f8
|
Merge pull request #4224 from chuan298/main
Implement efficient packing without cross-contamination attention
Former-commit-id: ac382cc9fe4ec483658fd54f07f9a123788ce1b1
|
2024-07-04 01:18:54 +08:00 |
|
hiyouga
|
60558388ec
|
update packing
Former-commit-id: f3d9c31efa0e64317bdd5b4ed6f78653cf3b5ba4
|
2024-07-04 01:10:55 +08:00 |
|
hoshi-hiyouga
|
b29a7f8cd6
|
Update packing.py
Former-commit-id: 3cc11aa88839c5b99cfd83d9225770a33d0eb6fd
|
2024-07-03 23:36:01 +08:00 |
|
hiyouga
|
a1501591e8
|
update func name
Former-commit-id: ed93ac0829fa656194fd32e1ac063843f475746f
|
2024-07-03 23:29:33 +08:00 |
|
hiyouga
|
1408aa078d
|
update arg name
Former-commit-id: 1509ed550b2060f946ce20e3c5a9e5c49e86e3ab
|
2024-07-03 23:23:24 +08:00 |
|
hiyouga
|
5acaa476d6
|
update hparams
Former-commit-id: 1c4feac44192b1f540208837f5a530b0d3f5fb37
|
2024-07-03 23:18:58 +08:00 |
|
hiyouga
|
8ac4f87c91
|
update ui
Former-commit-id: b1522a3c0951e2e57f873dc6c758aaed33ca374e
|
2024-07-03 23:13:49 +08:00 |
|
hiyouga
|
14d3001824
|
test
Former-commit-id: 610eea0c0a0069fdc9148620b15ffffcfef731ea
|
2024-07-03 23:05:39 +08:00 |
|
hiyouga
|
1ac9389ddc
|
update scripts
Former-commit-id: 6dd6bae598d4d0b7b7d80341e88e313e49a49c00
|
2024-07-03 20:07:44 +08:00 |
|
hiyouga
|
0b0e27c2f1
|
fix #4609
unwrap_model_for_generation(reward_model) is necessary for zero3 training
Former-commit-id: c8d5b21700577cae8d6ca03359bcf1762c8b7cb8
|
2024-07-03 19:45:51 +08:00 |
|
hiyouga
|
fd1199cce4
|
update readme
Former-commit-id: 4b5f05b791fce9fdc4678598d7be8dc954f9ff73
|
2024-07-03 19:39:05 +08:00 |
|
hoshi-hiyouga
|
3c9eda8265
|
Merge pull request #4662 from wzh1994/wzh/readme
Add `LazyLLM` to `Projects using LLaMA Factory` in `README.md`
Former-commit-id: 5ac6334cc40cefda91f5344f60ec0d4757d76df4
|
2024-07-03 15:51:02 +08:00 |
|
wangzhihong
|
6622cdb43f
|
Update README_zh.md
Former-commit-id: d4036add433989ad88d54895b6f5af90b393c009
|
2024-07-03 14:59:09 +08:00 |
|
wangzhihong
|
49c28a7dab
|
add LazyLLM to Projects using LLaMA Factory in README.md
Former-commit-id: e1d8587ea120ad356df35431f84af92193fcbaf3
|
2024-07-03 11:12:20 +08:00 |
|
hiyouga
|
a42671c2d7
|
tiny fix
Former-commit-id: d944020257f363f38e62de6279b337e399b7c65e
|
2024-07-03 02:31:50 +08:00 |
|
hiyouga
|
f17ab6ad92
|
tiny fix
Former-commit-id: 98c4a0af6b3e27ae393d2847f48a01d23d9c8780
|
2024-07-02 23:06:13 +08:00 |
|
hiyouga
|
ca548af2a2
|
remove rlhf support for chatglm2&3
Former-commit-id: bcbb5b71961b89719bffb0d202c431c82e6067cc
|
2024-07-02 23:03:17 +08:00 |
|
hiyouga
|
579997688f
|
upcast logits
Former-commit-id: df61660351c8af30591471807a20869a45bb055a
|
2024-07-02 22:32:05 +08:00 |
|
hiyouga
|
e6ba7ef3e6
|
improve rlhf
Former-commit-id: e441780e3db256ca09a442ea9254e7ce16898a07
|
2024-07-02 22:23:08 +08:00 |
|
ancv
|
20fdf177e8
|
move efficient_packing from data_args to model_args
Former-commit-id: 7b61659c707480bcf8c802c73e10d12ad5b9b965
|
2024-07-02 18:37:55 +07:00 |
|
hiyouga
|
f0b01803ea
|
Update bug-report.yml
Former-commit-id: b92636feff19f144850d7741d8f3fa9fcfdb0580
|
2024-07-02 19:18:56 +08:00 |
|
hiyouga
|
f5c4841ff2
|
Update bug-report.yml
Former-commit-id: dc04e33b17dfb798eaee137eef08879a0b7114c7
|
2024-07-02 19:16:12 +08:00 |
|
hoshi-hiyouga
|
1e01283d81
|
Merge pull request #4651 from hzhaoy/add-telechat-1b
Add TeleChat-1B
Former-commit-id: 2da64665d3da9dc0084bb782c65e88bac21f45a1
|
2024-07-02 17:56:43 +08:00 |
|
hzhaoy
|
2196448c21
|
add TeleChat-1B
Former-commit-id: 1b81b43fc483a21e0c2985b98459ecf5137aa4c4
|
2024-07-02 17:49:04 +08:00 |
|
hiyouga
|
96a81ce89d
|
fix ppo callbacks
Former-commit-id: 54f1c67c2a802b1d8368a6d1837d4c9a729f2695
|
2024-07-02 17:34:56 +08:00 |
|
hoshi-hiyouga
|
a715490c2a
|
Merge branch 'main' into main
Former-commit-id: 7be442f37d53a0c6324728fa1fa8e2c84d7f0fa5
|
2024-07-01 21:01:09 +08:00 |
|
hiyouga
|
973cf8e980
|
tiny fix
Former-commit-id: 5dd2e5c3323f56420b5845a5ed28bcd9d4da5e41
|
2024-07-01 05:43:17 +08:00 |
|
hiyouga
|
4357e42391
|
tiny fix
Former-commit-id: 19e43c3a9ed771e991cb273d394ab28fb923f868
|
2024-07-01 03:55:20 +08:00 |
|
hiyouga
|
884b49e662
|
add eval acc
Former-commit-id: 7ffde76fbfb6192e3aac31ccc098f31ce89181ae
|
2024-07-01 03:51:20 +08:00 |
|
hiyouga
|
38c94d2e9c
|
Update label_issue.yml
Former-commit-id: fffa3defdda02ad579cb703c0704f94bad94f21a
|
2024-07-01 01:29:09 +08:00 |
|
hiyouga
|
67d2eb6b2a
|
fix #4402 #4617
Deprecate reserved_label_len arg
Former-commit-id: 4b6568984c0be4b31e7aa91b7c0d52b7f7b12b0b
|
2024-07-01 01:19:27 +08:00 |
|
hiyouga
|
b670fb57db
|
update readme
Former-commit-id: 7998d969bf942c91cf41a189e3941f6e04c81c6f
|
2024-07-01 00:22:52 +08:00 |
|
hiyouga
|
188b4be64d
|
fix #4398 #4592
Former-commit-id: 8c92d268903c00392c8bd75a731daa1f107d6202
|
2024-06-30 21:28:51 +08:00 |
|
hiyouga
|
889c042ecd
|
update npu docker
Former-commit-id: 2f4d5174205605b8821d4fb626283e07694ecf80
|
2024-06-30 21:05:31 +08:00 |
|
hiyouga
|
3c4f8eaa55
|
loose gemma2 attention
Former-commit-id: a0b645017a2de3d58b6cbc71bd91ec96fc7a818b
|
2024-06-29 01:42:14 +08:00 |
|
hiyouga
|
6a75d57060
|
update readme
Former-commit-id: 9f809c311af373508cb51b204ae54b047729a9dc
|
2024-06-28 06:55:19 +08:00 |
|
hiyouga
|
fda2cf677b
|
bf16 by default, gemma2 attns
Gemma2 finetuning cannot work until merging https://github.com/huggingface/transformers/pull/31674
Former-commit-id: da66c32c7be0adc28d2185b23e9f62d56acb961c
|
2024-06-28 06:00:26 +08:00 |
|