yinpu aa7c07caf0 fix: avoid redundant normalization in DPO's SFT loss calculation (#6722)
Former-commit-id: 0f45982bac6b65533a94054ea5f792cb0f9e5a1f
2025-01-21 13:38:02 +08:00
..
2024-12-17 19:13:26 +00:00
2025-01-14 18:40:07 +08:00
2025-01-18 13:56:09 +08:00
2025-01-15 01:42:50 +08:00
2025-01-18 13:56:09 +08:00
2025-01-18 13:56:09 +08:00
2025-01-20 19:46:38 +08:00
2024-12-17 10:26:19 +00:00
2024-09-01 20:52:47 +08:00