6 Commits

Author SHA1 Message Date
hoshi-hiyouga
21df5f0bd0 Update supervised.py
Former-commit-id: 8cecade7082a52f413517ea20b1c5dd812db8e53
2024-06-07 03:38:04 +08:00
ylfeng
62d55b71a3 remove empty line
Former-commit-id: b47e3174472f458a3a8b84a66b475da8fce6db79
2024-05-31 21:43:08 +08:00
ylfeng
0feb2ad35c fix eos
Former-commit-id: 84aee579013f0c095a918a8c61611ccbb1d7fc84
2024-05-31 21:40:41 +08:00
ylfeng
8350e508d3 supervised packing with greedy knapsack algorithm
Former-commit-id: f9db439cb7511b12aa3524d5fdcc45864aebda91
2024-05-31 15:33:54 +08:00
hiyouga
df33548b39 update readme
Former-commit-id: 5581cb2e4e59f3f8109e2acd4611789f9e50bfca
2024-05-27 18:14:02 +08:00
hiyouga
3e729798df refactor data preprocessing, fix mllm rlhf
Former-commit-id: 3a023bca2a502810a436cfba7708df164754ea62
2024-05-24 04:08:25 +08:00