hiyouga
|
0a94fab357
|
support badam for all stages
Former-commit-id: e3d8fc75eb2cfc54efd35bfd9ad6c4ac5acc458c
|
2024-04-16 17:44:48 +08:00 |
|
hiyouga
|
bf5ffeeae0
|
simplify readme
Former-commit-id: 92dab8a90bdd82a72a06559943467b56dde12c71
|
2024-04-02 20:07:43 +08:00 |
|
hiyouga
|
829cf6458a
|
fix #3083
Former-commit-id: 4a6ca621c09d179561acc5957c8c911a4e44184c
|
2024-04-01 22:53:52 +08:00 |
|
hiyouga
|
69e1d39832
|
fix IPO and ORPO loss
Former-commit-id: 5b9b40403d59982431a526e337f31d394f8b882b
|
2024-04-01 14:37:53 +08:00 |
|
hiyouga
|
e7ade84bba
|
fix plots
Former-commit-id: 5907216a1cc7a75a43d681ede410c2fba7fb7b92
|
2024-03-31 19:43:48 +08:00 |
|
hiyouga
|
2f878bde11
|
support ORPO
Former-commit-id: 17bf8a2c3a7bb5b83071c8659cfd8751e894e692
|
2024-03-31 18:29:50 +08:00 |
|
hiyouga
|
fc066cad7f
|
release v0.6.1
Former-commit-id: ca793028c69433eae405009c5ebb790c6c2d40c4
|
2024-03-29 11:36:08 +08:00 |
|
hiyouga
|
89c400633a
|
update trainers
Former-commit-id: 8c77b1091296e204dc3c8c1f157c288ca5b236bd
|
2024-03-28 18:16:27 +08:00 |
|
hoshi-hiyouga
|
ae9ad13f2a
|
fix ds optimizer
Former-commit-id: 3bcd41b639899e72bcabc51d59bac8967af19899
|
2024-03-26 23:39:56 +08:00 |
|
hiyouga
|
ec94e5e876
|
fix #2961
Former-commit-id: 511f6754026fbbf48bd481018015338a6a3ad92f
|
2024-03-26 17:26:14 +08:00 |
|
hiyouga
|
8717e98200
|
fix #2777 #2895
Former-commit-id: 9bec3c98a22c91b1c28fda757db51eb780291641
|
2024-03-20 17:59:45 +08:00 |
|
hiyouga
|
4a4e4b4354
|
support layerwise galore
Former-commit-id: 8664262cde3919e10eaecbd66e8c5d356856362e
|
2024-03-10 00:24:11 +08:00 |
|
hiyouga
|
868444e124
|
allow non-packing pretraining
Former-commit-id: bdb496644ce2c18806fc4fdae1fedcb3e5b5f808
|
2024-03-09 22:21:46 +08:00 |
|
hiyouga
|
2c010c72b8
|
support galore
Former-commit-id: 28f78621883917425fabe49f5473778111012127
|
2024-03-07 22:41:36 +08:00 |
|
hiyouga
|
d1e6e02461
|
fix #2649
Former-commit-id: 4e5fae2fac85227641bd16159cf296a32e0b18b4
|
2024-03-01 13:02:41 +08:00 |
|
hiyouga
|
2f738a1db6
|
fix #2532
Former-commit-id: 3cc10a01a792a92b99b952a45bb21c25097fccf6
|
2024-02-21 21:55:14 +08:00 |
|
hiyouga
|
b27e91222c
|
format style
Former-commit-id: 638234ceee1b19716e45b6e5f4ea54d9122da4df
|
2024-01-20 20:15:56 +08:00 |
|
hiyouga
|
2f7684a8ee
|
fix tests
Former-commit-id: f6d6e00337ebef8740d180836dcb18d0e6a3c59a
|
2024-01-20 19:58:04 +08:00 |
|
hiyouga
|
69e8925249
|
support longlora for main branch
Former-commit-id: 38af076a75c33da26d641780820694e4b7342d92
|
2024-01-20 19:25:22 +08:00 |
|
hiyouga
|
4e3bfb799d
|
support function calling
Former-commit-id: d9f1cae35150cce594a7abd96dd2beb811fa33f2
|
2024-01-18 09:54:23 +08:00 |
|
hiyouga
|
69d966eb1f
|
fix #2164
Former-commit-id: 4b2d11ec28130ee6c21dc85614ffcee61a4a5847
|
2024-01-12 00:27:57 +08:00 |
|
hiyouga
|
938c4cb132
|
fix dpo trainer
Former-commit-id: 074745b1707f98e092749f57041d866c5d55bc04
|
2023-12-23 01:51:55 +08:00 |
|
hiyouga
|
f0d405f392
|
support unsloth
Former-commit-id: 7aad0b889d9a316fffd65f32a419078418fc0986
|
2023-12-23 00:14:33 +08:00 |
|
hiyouga
|
4e75ca1222
|
support dpo-ftx
Former-commit-id: b87c74289d523ef88611b376074199ffd03cf103
|
2023-12-16 19:21:41 +08:00 |
|
hiyouga
|
1cb390b9b2
|
implement rm server #1543
Former-commit-id: 7df4f3ab206fddb462f6ed865eaf04234fd72ed6
|
2023-12-03 20:52:54 +08:00 |
|
hiyouga
|
48d6d925f7
|
fix #1558
Former-commit-id: 1740131d63d32aefc0370441baf4716ddb5ebcfe
|
2023-11-19 14:15:47 +08:00 |
|
hiyouga
|
eb5a852dd5
|
fix import bug
Former-commit-id: 35b91ea34caade45dd51813b94da5177b852aa4c
|
2023-11-16 02:27:03 +08:00 |
|
hiyouga
|
f441932bd1
|
support full-parameter PPO
Former-commit-id: ce783036001397a20b0b4c5da2fea6d0c03389d2
|
2023-11-16 02:08:04 +08:00 |
|
hiyouga
|
06a4820836
|
disentangle model from tuner and rename modules
Former-commit-id: 4736344eb1595ee023a50d49e8118f4eee46305f
|
2023-11-15 16:29:09 +08:00 |
|