This website requires JavaScript.
Explore
Help
Register
Sign In
423A35C7
/
LLaMA-Factory
Watch
1
Star
0
Fork
0
You've already forked LLaMA-Factory
mirror of
https://github.com/hiyouga/LLaMA-Factory.git
synced
2025-08-05 05:02:50 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
LLaMA-Factory
/
src
/
llmtuner
/
tuner
History
hiyouga
b88f0b396c
support ppo score norm (trl 0.5.1.dev required)
...
Former-commit-id: 53e33418d02ee0f34c783e30ae510b811308c598
2023-08-18 12:02:42 +08:00
..
core
fix PPO trainer
#551
, update readme
2023-08-18 11:43:10 +08:00
dpo
update training resuming
2023-08-18 01:41:17 +08:00
ppo
support ppo score norm (trl 0.5.1.dev required)
2023-08-18 12:02:42 +08:00
pt
update training resuming
2023-08-18 01:41:17 +08:00
rm
fix ChatGLM2 ppo
#527
#528
2023-08-18 00:34:59 +08:00
sft
update training resuming
2023-08-18 01:41:17 +08:00
__init__.py
modify code structure
2023-08-02 23:17:36 +08:00
tune.py
support rope scaling,
fix
#475
#476
#478
2023-08-12 20:46:27 +08:00