15 Commits

Author SHA1 Message Date
hiyouga
89c400633a update trainers
Former-commit-id: 8c77b1091296e204dc3c8c1f157c288ca5b236bd
2024-03-28 18:16:27 +08:00
hoshi-hiyouga
ae9ad13f2a fix ds optimizer
Former-commit-id: 3bcd41b639899e72bcabc51d59bac8967af19899
2024-03-26 23:39:56 +08:00
hiyouga
ec94e5e876 fix #2961
Former-commit-id: 511f6754026fbbf48bd481018015338a6a3ad92f
2024-03-26 17:26:14 +08:00
hiyouga
8717e98200 fix #2777 #2895
Former-commit-id: 9bec3c98a22c91b1c28fda757db51eb780291641
2024-03-20 17:59:45 +08:00
hiyouga
868444e124 allow non-packing pretraining
Former-commit-id: bdb496644ce2c18806fc4fdae1fedcb3e5b5f808
2024-03-09 22:21:46 +08:00
hiyouga
2f738a1db6 fix #2532
Former-commit-id: 3cc10a01a792a92b99b952a45bb21c25097fccf6
2024-02-21 21:55:14 +08:00
hiyouga
b27e91222c format style
Former-commit-id: 638234ceee1b19716e45b6e5f4ea54d9122da4df
2024-01-20 20:15:56 +08:00
hiyouga
69e8925249 support longlora for main branch
Former-commit-id: 38af076a75c33da26d641780820694e4b7342d92
2024-01-20 19:25:22 +08:00
hiyouga
4e3bfb799d support function calling
Former-commit-id: d9f1cae35150cce594a7abd96dd2beb811fa33f2
2024-01-18 09:54:23 +08:00
hiyouga
69d966eb1f fix #2164
Former-commit-id: 4b2d11ec28130ee6c21dc85614ffcee61a4a5847
2024-01-12 00:27:57 +08:00
hiyouga
938c4cb132 fix dpo trainer
Former-commit-id: 074745b1707f98e092749f57041d866c5d55bc04
2023-12-23 01:51:55 +08:00
hiyouga
f0d405f392 support unsloth
Former-commit-id: 7aad0b889d9a316fffd65f32a419078418fc0986
2023-12-23 00:14:33 +08:00
hiyouga
4e75ca1222 support dpo-ftx
Former-commit-id: b87c74289d523ef88611b376074199ffd03cf103
2023-12-16 19:21:41 +08:00
hiyouga
f441932bd1 support full-parameter PPO
Former-commit-id: ce783036001397a20b0b4c5da2fea6d0c03389d2
2023-11-16 02:08:04 +08:00
hiyouga
06a4820836 disentangle model from tuner and rename modules
Former-commit-id: 4736344eb1595ee023a50d49e8118f4eee46305f
2023-11-15 16:29:09 +08:00