37 Commits

Author SHA1 Message Date
hiyouga
d2df4c22ab support mllm hf inference
Former-commit-id: e057c8de486bfbc829240924f9238d6212c917f1
2024-04-26 05:34:58 +08:00
hiyouga
0a94fab357 support badam for all stages
Former-commit-id: e3d8fc75eb2cfc54efd35bfd9ad6c4ac5acc458c
2024-04-16 17:44:48 +08:00
hiyouga
88d9f47a0b fix #3116
Former-commit-id: ce77d98872fa377fd4bc961701b07982f4b51491
2024-04-03 14:47:59 +08:00
hiyouga
829cf6458a fix #3083
Former-commit-id: 4a6ca621c09d179561acc5957c8c911a4e44184c
2024-04-01 22:53:52 +08:00
hiyouga
fc066cad7f release v0.6.1
Former-commit-id: ca793028c69433eae405009c5ebb790c6c2d40c4
2024-03-29 11:36:08 +08:00
hiyouga
e4f3d583df fix #2982
Former-commit-id: 8d603f8820efd1617557f2bc5d9674143abe7c57
2024-03-28 20:22:31 +08:00
hiyouga
89c400633a update trainers
Former-commit-id: 8c77b1091296e204dc3c8c1f157c288ca5b236bd
2024-03-28 18:16:27 +08:00
hiyouga
8717e98200 fix #2777 #2895
Former-commit-id: 9bec3c98a22c91b1c28fda757db51eb780291641
2024-03-20 17:59:45 +08:00
hiyouga
4a4e4b4354 support layerwise galore
Former-commit-id: 8664262cde3919e10eaecbd66e8c5d356856362e
2024-03-10 00:24:11 +08:00
hiyouga
868444e124 allow non-packing pretraining
Former-commit-id: bdb496644ce2c18806fc4fdae1fedcb3e5b5f808
2024-03-09 22:21:46 +08:00
hiyouga
c561b268ef fix #2756 , patch #2746
Former-commit-id: e8dd38b7fdf8e172745d2538eb103895f2839c38
2024-03-09 02:01:26 +08:00
hiyouga
2c010c72b8 support galore
Former-commit-id: 28f78621883917425fabe49f5473778111012127
2024-03-07 22:41:36 +08:00
hiyouga
d1e6e02461 fix #2649
Former-commit-id: 4e5fae2fac85227641bd16159cf296a32e0b18b4
2024-03-01 13:02:41 +08:00
stephen
1b4d54b873 update project_kwargs for ppo config
Former-commit-id: 42c23798f27977af777587ded7f4845010f0184a
2024-02-21 13:47:38 +08:00
hiyouga
7cc0721028 fix #2189
Former-commit-id: b988ce0a0c164213ad2e52efadd6aa5b71fd39c5
2024-02-04 00:47:37 +08:00
hiyouga
b27e91222c format style
Former-commit-id: 638234ceee1b19716e45b6e5f4ea54d9122da4df
2024-01-20 20:15:56 +08:00
hiyouga
2f7684a8ee fix tests
Former-commit-id: f6d6e00337ebef8740d180836dcb18d0e6a3c59a
2024-01-20 19:58:04 +08:00
hiyouga
4e3bfb799d support function calling
Former-commit-id: d9f1cae35150cce594a7abd96dd2beb811fa33f2
2024-01-18 09:54:23 +08:00
hiyouga
6378864390 fix #2161
Former-commit-id: 898ec3696a4d2db48485fb7263f866599437d626
2024-01-11 17:04:13 +08:00
hiyouga
61960189b2 fix #1789
Former-commit-id: 4571068e1e00dc234c9131185fe0924c726add84
2024-01-09 18:31:27 +08:00
hiyouga
d0946f08db fix ppo trainer
Former-commit-id: 5431be42f9c43095d478f2250fac64ef189eb3ad
2023-12-28 18:09:28 +08:00
hiyouga
8154b4bdf6 fix #1742
Former-commit-id: 870426ff70c060213ac283b10a9b1f4bf71679ef
2023-12-16 20:50:45 +08:00
hiyouga
027caabbb6 fix ppo trainer save logic
Former-commit-id: d3dccd0693ede18a99f04780f2fd6e3a89810405
2023-12-04 19:00:19 +08:00
hiyouga
6493558c3b fix bug
Former-commit-id: 8b681ee273c28813c599d9d55b2a3540c8ac257d
2023-12-03 21:40:40 +08:00
hiyouga
64eead3fb1 ppo support rm server
Former-commit-id: 747db4017291b0eb91946f57011bb31659056037
2023-12-03 21:38:51 +08:00
hiyouga
1cb390b9b2 implement rm server #1543
Former-commit-id: 7df4f3ab206fddb462f6ed865eaf04234fd72ed6
2023-12-03 20:52:54 +08:00
hiyouga
3d291a82d3 fix #1597
Former-commit-id: 327d7f7efe1fefe4bf4646c07fc4917a42c13383
2023-11-30 21:47:06 +08:00
hiyouga
ba6d290d0b fix #1668
Former-commit-id: 1585962eb7ed042890d4c56422aae749c669dda8
2023-11-30 21:02:00 +08:00
hiyouga
ecfc7d1b50 fix #1658
Former-commit-id: 77d1b14fc2d9703d15bbd879f67df037db9fbb28
2023-11-28 20:57:24 +08:00
hiyouga
f06c4c8f7a update ppo trainer
Former-commit-id: 5021062493ed63ad1f6133cfb543e4e7f528d2cc
2023-11-20 21:39:15 +08:00
hoshi-hiyouga
d72f123851 Merge pull request #1553 from hannlp/hans
Change the default argument settings for PPO training

Former-commit-id: 48211e3799a16de946360930d3d92f5a40e9d12d
2023-11-20 20:32:55 +08:00
hiyouga
682d81caa9 fix #1567
Former-commit-id: 99a3f06377d2886c4000ce7e3583b12ca965534d
2023-11-20 18:46:36 +08:00
Yuchen Han
a419122179 Update workflow.py
Former-commit-id: eeb5249d0b6ce0816e1fa47afc3a853c7b267cbf
2023-11-17 00:16:27 -08:00
hiyouga
678052a7ef fix rlhf callback
Former-commit-id: 1817ffc86fe3463ea91e9359c0e3611979a9d53e
2023-11-16 03:26:19 +08:00
hiyouga
eb5a852dd5 fix import bug
Former-commit-id: 35b91ea34caade45dd51813b94da5177b852aa4c
2023-11-16 02:27:03 +08:00
hiyouga
f441932bd1 support full-parameter PPO
Former-commit-id: ce783036001397a20b0b4c5da2fea6d0c03389d2
2023-11-16 02:08:04 +08:00
hiyouga
06a4820836 disentangle model from tuner and rename modules
Former-commit-id: 4736344eb1595ee023a50d49e8118f4eee46305f
2023-11-15 16:29:09 +08:00