hiyouga
bf5ffeeae0
simplify readme
...
Former-commit-id: 92dab8a90bdd82a72a06559943467b56dde12c71
2024-04-02 20:07:43 +08:00
hiyouga
f4be51f356
add moe aux loss control #3085
...
Former-commit-id: b267aeb53fc49d2eeb0f3fc5ebe55e643f5db377
2024-04-02 14:26:31 +08:00
hiyouga
c7104f8fab
fix #3022
...
Former-commit-id: 9ddbe2866a4a4433d7635659a5635d16c59800b1
2024-04-02 13:58:39 +08:00
hiyouga
ab6476a9a4
set dev version
...
Former-commit-id: dd73a0c2489a69e32afc2131407ee62be7779d8b
2024-04-01 23:24:08 +08:00
hiyouga
829cf6458a
fix #3083
...
Former-commit-id: 4a6ca621c09d179561acc5957c8c911a4e44184c
2024-04-01 22:53:52 +08:00
hiyouga
8d987b7af7
add qwen1.5 moe
...
Former-commit-id: 54b7d349088a828d32551fde56b3467a82df1b9b
2024-04-01 21:49:40 +08:00
hiyouga
34f1de0574
fix #3077
...
Former-commit-id: aee634cd20e6dfdfbe2fbb47ae57f62b2da2bf9a
2024-04-01 21:35:18 +08:00
hiyouga
b7468ea0a8
support infer 4bit model on GPUs #3023
...
Former-commit-id: eb259cc5738dfb383e4cc5d32579501c580e11b1
2024-04-01 17:34:04 +08:00
hiyouga
cefe7f7bcf
update webui
...
Former-commit-id: d0842f682897cb227cda9e9747f42a7281970463
2024-04-01 16:23:28 +08:00
hiyouga
bd52e2b404
fix ORPO loss
...
Former-commit-id: 816d71414617590f95de89a49f38358e597ed121
2024-04-01 14:42:41 +08:00
hiyouga
69e1d39832
fix IPO and ORPO loss
...
Former-commit-id: 5b9b40403d59982431a526e337f31d394f8b882b
2024-04-01 14:37:53 +08:00
hiyouga
e7ade84bba
fix plots
...
Former-commit-id: 5907216a1cc7a75a43d681ede410c2fba7fb7b92
2024-03-31 19:43:48 +08:00
hiyouga
b873dcb09d
use log1p in orpo loss
...
https://github.com/huggingface/trl/pull/1491
Former-commit-id: 68aaa4904b8dfb6cc791fdcee613edc681a8a198
2024-03-31 19:27:08 +08:00
hiyouga
ddad9be81d
update readme
...
Former-commit-id: 099db6acc08f9a0da18f642455f29012607a920f
2024-03-31 18:46:34 +08:00
hiyouga
35b3516812
support orpo in webui
...
Former-commit-id: 5195add324194d2583db40365522e5e2916592b6
2024-03-31 18:34:59 +08:00
hiyouga
2f878bde11
support ORPO
...
Former-commit-id: 17bf8a2c3a7bb5b83071c8659cfd8751e894e692
2024-03-31 18:29:50 +08:00
hiyouga
3cf35e57db
tiny fix
...
Former-commit-id: 27776c34741ca0c58ed793bcdf1acd5e4a81fb39
2024-03-31 00:10:29 +08:00
marko1616
5721074af1
fix blank line contains whitespace
...
Former-commit-id: d9a5134617d494ef13ba73f9c540123e89a8c29c
2024-03-30 23:46:55 +08:00
marko1616
67c05c2031
Fix Llama model save for full param train
...
Former-commit-id: eb178eaff390a1dc342cc35ab8c7820d654f3717
2024-03-30 23:45:04 +08:00
hiyouga
3bf6dde3a5
support save args in webui #2807 #3046
...
some ideas are borrowed from @marko1616
Former-commit-id: 7a086ed33388551e0f835adf61fac638f96ed188
2024-03-30 23:09:12 +08:00
hiyouga
808ad2071f
upgrade gradio to 4.21.0
...
Former-commit-id: 831c5321ac9b5ec91d9cc1dbcc87967dcdc486f5
2024-03-30 20:37:08 +08:00
hiyouga
fc066cad7f
release v0.6.1
...
Former-commit-id: ca793028c69433eae405009c5ebb790c6c2d40c4
2024-03-29 11:36:08 +08:00
hiyouga
e4f3d583df
fix #2982
...
Former-commit-id: 8d603f8820efd1617557f2bc5d9674143abe7c57
2024-03-28 20:22:31 +08:00
hiyouga
eac2a5b1d3
fix #3010
...
Former-commit-id: b19c14870d30c57fbea81e9cfa737d762922c54b
2024-03-28 18:31:17 +08:00
hiyouga
89c400633a
update trainers
...
Former-commit-id: 8c77b1091296e204dc3c8c1f157c288ca5b236bd
2024-03-28 18:16:27 +08:00
hoshi-hiyouga
ae9ad13f2a
fix ds optimizer
...
Former-commit-id: 3bcd41b639899e72bcabc51d59bac8967af19899
2024-03-26 23:39:56 +08:00
hiyouga
c311375b50
fix bug
...
Former-commit-id: 3164b4f11b72684c8aa2105037cb36c47b6acfd4
2024-03-26 17:30:12 +08:00
hiyouga
ec94e5e876
fix #2961
...
Former-commit-id: 511f6754026fbbf48bd481018015338a6a3ad92f
2024-03-26 17:26:14 +08:00
hiyouga
62312716d9
release v0.6.0 (real)
...
Former-commit-id: ba70aca8fb1275ed3b6af69a7e639303d7201da4
2024-03-25 23:37:48 +08:00
hiyouga
196a33cca4
tiny fix
...
Former-commit-id: 98a42cbdaa4a90dbe5edda1c412c17e628324f52
2024-03-25 23:28:52 +08:00
hiyouga
b18749fb01
add arg check
...
Former-commit-id: 1484f76a95bcf40e4c668d52fed68d68c9745a75
2024-03-25 22:42:58 +08:00
hiyouga
27151b8c65
release v0.6.0
...
Former-commit-id: 6f2b563f125fe51ee32753e58f902a4911ab757c
2024-03-25 22:38:56 +08:00
hiyouga
2d73831177
tiny fix
...
Former-commit-id: 558a538724db373319a6bba26c76943bac1b5063
2024-03-25 21:18:08 +08:00
marko1616
1d0e24549f
pass ruff check
...
Former-commit-id: c8f0d99704308ac1886b16e437dea601eb20658d
2024-03-24 16:12:10 +08:00
marko1616
a68101cbbb
fix Llama lora merge crash
...
Former-commit-id: 6f080fdba3f99145b7722964dd027179dc2eeb2b
2024-03-24 03:06:11 +08:00
marko1616
645c27e5e2
fix Llama lora merge crash
...
Former-commit-id: 51349ea1ccbf3e53b408037986abd850a0963468
2024-03-24 02:55:23 +08:00
marko1616
c083708433
fix Llama lora merge crash
...
Former-commit-id: c1e2c4ea45ad210e776a192e05e226b34d764135
2024-03-24 02:44:35 +08:00
hiyouga
84c3d509fa
fix #2936
...
Former-commit-id: 140ad4ad567de8817a14972175e668971bae6a0a
2024-03-24 00:43:21 +08:00
hiyouga
75829c8699
fix #2928
...
Former-commit-id: 7afbc85daee295cf38dcee9ded5afd87b2c4cfd1
2024-03-24 00:34:54 +08:00
hiyouga
58aa576ae5
fix #2941
...
Former-commit-id: a1c8c98c5fecfc0dd0ed1be33ee8dd2ade05b708
2024-03-24 00:28:44 +08:00
hiyouga
7999836fb6
support fsdp + qlora
...
Former-commit-id: 84082251621e1470b3b5406a56d0a967780a1804
2024-03-21 00:36:06 +08:00
hiyouga
8717e98200
fix #2777 #2895
...
Former-commit-id: 9bec3c98a22c91b1c28fda757db51eb780291641
2024-03-20 17:59:45 +08:00
hiyouga
cf149bf43c
fix #2346
...
Former-commit-id: 7b8f5029018f0481f7da83cc5ee4408d95c9beb2
2024-03-20 17:56:33 +08:00
hiyouga
3d483e0914
fix packages
...
Former-commit-id: 8e04794b2da067a4123b9d7091a54c5647f44244
2024-03-17 22:32:03 +08:00
hiyouga
a5537f3ee8
fix patcher
...
Former-commit-id: 85c376fc1e0bcc854ed6e70e6455a0b00b341655
2024-03-15 19:18:42 +08:00
hoshi-hiyouga
30765baa91
Merge pull request #2849 from S3Studio/DockerizeSupport
...
Improve Dockerize support
Former-commit-id: 113cc047198325b51dac50d8a7ea70396c51e0d9
2024-03-15 19:16:02 +08:00
hiyouga
06860e8f0f
fix export
...
Former-commit-id: 6bc2c23b6d26b52f54ac37fa6149e6eb3cc18ee6
2024-03-15 15:06:30 +08:00
S3Studio
46ef7416e6
Use official Nvidia base image
...
Note that the flash-attn library is installed in this image and the qwen model will use it automatically.
However, if the the host machine's GPU is not compatible with the library, an exception will be raised during the training process as follows:
FlashAttention only supports Ampere GPUs or newer.
So if the --flash_attn flag is not set, an additional patch for the qwen model's config is necessary to set the default value of use_flash_attn from "auto" to False.
Former-commit-id: e75407febdec086f2bdca723a7f69a92b3b1d63f
2024-03-15 08:59:13 +08:00
hiyouga
7ef49586be
tiny fix
...
Former-commit-id: 6ebde4f23e761b8a3e3ea6ca6dff249e657608a1
2024-03-14 21:19:06 +08:00
hiyouga
2cf95d4efe
fix export
...
Former-commit-id: 3b4a59bfb1866a270b9934a4a2303197ffdab531
2024-03-14 18:17:01 +08:00