Commit Graph

29 Commits

Author SHA1 Message Date
hiyouga
0a94fab357 support badam for all stages
Former-commit-id: e3d8fc75eb
2024-04-16 17:44:48 +08:00
hiyouga
bf5ffeeae0 simplify readme
Former-commit-id: 92dab8a90b
2024-04-02 20:07:43 +08:00
hiyouga
829cf6458a fix #3083
Former-commit-id: 4a6ca621c0
2024-04-01 22:53:52 +08:00
hiyouga
69e1d39832 fix IPO and ORPO loss
Former-commit-id: 5b9b40403d
2024-04-01 14:37:53 +08:00
hiyouga
e7ade84bba fix plots
Former-commit-id: 5907216a1c
2024-03-31 19:43:48 +08:00
hiyouga
2f878bde11 support ORPO
Former-commit-id: 17bf8a2c3a
2024-03-31 18:29:50 +08:00
hiyouga
fc066cad7f release v0.6.1
Former-commit-id: ca793028c6
2024-03-29 11:36:08 +08:00
hiyouga
89c400633a update trainers
Former-commit-id: 8c77b10912
2024-03-28 18:16:27 +08:00
hoshi-hiyouga
ae9ad13f2a fix ds optimizer
Former-commit-id: 3bcd41b639
2024-03-26 23:39:56 +08:00
hiyouga
ec94e5e876 fix #2961
Former-commit-id: 511f675402
2024-03-26 17:26:14 +08:00
hiyouga
8717e98200 fix #2777 #2895
Former-commit-id: 9bec3c98a2
2024-03-20 17:59:45 +08:00
hiyouga
4a4e4b4354 support layerwise galore
Former-commit-id: 8664262cde
2024-03-10 00:24:11 +08:00
hiyouga
868444e124 allow non-packing pretraining
Former-commit-id: bdb496644c
2024-03-09 22:21:46 +08:00
hiyouga
2c010c72b8 support galore
Former-commit-id: 28f7862188
2024-03-07 22:41:36 +08:00
hiyouga
d1e6e02461 fix #2649
Former-commit-id: 4e5fae2fac
2024-03-01 13:02:41 +08:00
hiyouga
2f738a1db6 fix #2532
Former-commit-id: 3cc10a01a7
2024-02-21 21:55:14 +08:00
hiyouga
b27e91222c format style
Former-commit-id: 638234ceee
2024-01-20 20:15:56 +08:00
hiyouga
2f7684a8ee fix tests
Former-commit-id: f6d6e00337
2024-01-20 19:58:04 +08:00
hiyouga
69e8925249 support longlora for main branch
Former-commit-id: 38af076a75
2024-01-20 19:25:22 +08:00
hiyouga
4e3bfb799d support function calling
Former-commit-id: d9f1cae351
2024-01-18 09:54:23 +08:00
hiyouga
69d966eb1f fix #2164
Former-commit-id: 4b2d11ec28
2024-01-12 00:27:57 +08:00
hiyouga
938c4cb132 fix dpo trainer
Former-commit-id: 074745b170
2023-12-23 01:51:55 +08:00
hiyouga
f0d405f392 support unsloth
Former-commit-id: 7aad0b889d
2023-12-23 00:14:33 +08:00
hiyouga
4e75ca1222 support dpo-ftx
Former-commit-id: b87c74289d
2023-12-16 19:21:41 +08:00
hiyouga
1cb390b9b2 implement rm server #1543
Former-commit-id: 7df4f3ab20
2023-12-03 20:52:54 +08:00
hiyouga
48d6d925f7 fix #1558
Former-commit-id: 1740131d63
2023-11-19 14:15:47 +08:00
hiyouga
eb5a852dd5 fix import bug
Former-commit-id: 35b91ea34c
2023-11-16 02:27:03 +08:00
hiyouga
f441932bd1 support full-parameter PPO
Former-commit-id: ce78303600
2023-11-16 02:08:04 +08:00
hiyouga
06a4820836 disentangle model from tuner and rename modules
Former-commit-id: 4736344eb1
2023-11-15 16:29:09 +08:00