Commit Graph

37 Commits

Author SHA1 Message Date
hiyouga
23f738fda5 update examples
Former-commit-id: 2efd9b6ba0
2024-04-23 18:29:46 +08:00
hiyouga
d8deb0f99e update readme and examples
Former-commit-id: a1f1fac33b
2024-04-22 00:37:32 +08:00
hiyouga
92e24a73cb remove extras
Former-commit-id: ddbd29d777
2024-04-22 00:35:41 +08:00
hiyouga
9e45f82be7 fix bug in galore optimizer
Former-commit-id: 5c62881c5a
2024-04-21 18:53:22 +08:00
hiyouga
ec81d45d27 fix mod stuff
Former-commit-id: f58425ab45
2024-04-21 18:11:10 +08:00
Marco
639297a5ef Added Mixture of Depths
Former-commit-id: 620add7b9f
2024-04-18 20:31:24 +02:00
hiyouga
0a94fab357 support badam for all stages
Former-commit-id: e3d8fc75eb
2024-04-16 17:44:48 +08:00
hoshi-hiyouga
507ab397f5 Update sft.sh
Former-commit-id: 57dcd91e17
2024-04-16 17:25:40 +08:00
Jonery
b3260c7456 resolve gradient checkpointing issue.
Former-commit-id: 7ecb61822b
2024-04-16 12:05:27 +08:00
Jonery
025f329445 Feature BAdam
Former-commit-id: 06c8908d3f
2024-04-15 23:15:27 +08:00
hiyouga
fb385b8c26 update examples
Former-commit-id: cce52351b5
2024-04-15 22:14:34 +08:00
khazic
652caa3cbd Upgrade README.md
Former-commit-id: fe5d3bb8f0
2024-04-13 20:50:49 +08:00
khazic
21c4a43085 Added specimens for single-card full parameter prediction
Former-commit-id: 47111ce506
2024-04-13 20:45:19 +08:00
hiyouga
5dd3c1ab79 update examples
Former-commit-id: b87f8f1519
2024-04-04 14:48:21 +08:00
hiyouga
86513f28dc update examples
Former-commit-id: fc7f1cc365
2024-04-02 21:09:25 +08:00
hiyouga
03c538ebb3 add zh readme
Former-commit-id: 7765f337c7
2024-04-02 20:58:45 +08:00
hiyouga
e341fa59fe update examples
Former-commit-id: f22eaeb5bc
2024-04-02 20:51:21 +08:00
hiyouga
9df316931b update examples
Former-commit-id: 31ffbde24d
2024-04-02 20:41:49 +08:00
hiyouga
135c4e3512 update readme
Former-commit-id: 11a6c1bad6
2024-04-02 20:37:37 +08:00
hiyouga
bf5ffeeae0 simplify readme
Former-commit-id: 92dab8a90b
2024-04-02 20:07:43 +08:00
hiyouga
cefe7f7bcf update webui
Former-commit-id: d0842f6828
2024-04-01 16:23:28 +08:00
hiyouga
2f878bde11 support ORPO
Former-commit-id: 17bf8a2c3a
2024-03-31 18:29:50 +08:00
hiyouga
89c400633a update trainers
Former-commit-id: 8c77b10912
2024-03-28 18:16:27 +08:00
hiyouga
300437a5e9 fix #2981
Former-commit-id: b29d5560f1
2024-03-26 17:53:04 +08:00
hiyouga
7999836fb6 support fsdp + qlora
Former-commit-id: 8408225162
2024-03-21 00:36:06 +08:00
hiyouga
8b8671817f improve lora+ impl.
Former-commit-id: 72367307df
2024-03-13 23:32:51 +08:00
齐保元
24c9277488 [FEATURE]: ADD LORA+ ALGORITHM
Former-commit-id: a0965cd62c
2024-03-13 19:43:27 +08:00
hiyouga
4a4e4b4354 support layerwise galore
Former-commit-id: 8664262cde
2024-03-10 00:24:11 +08:00
hiyouga
eb363b04b9 update examples
Former-commit-id: 4c00bcdcae
2024-03-09 02:30:37 +08:00
hiyouga
398c261c7c fix aqlm version
Former-commit-id: 10be2f0ecc
2024-03-09 00:09:09 +08:00
hiyouga
ccec17f773 fix example params
Former-commit-id: 8a45213440
2024-03-08 20:41:43 +08:00
hiyouga
5b50458acf fix galore
Former-commit-id: 33a4c24a8a
2024-03-08 00:44:51 +08:00
hiyouga
cb2bf680c9 add galore examples
Former-commit-id: 7230e1177d
2024-03-07 22:53:45 +08:00
hiyouga
34533b2f35 support vllm
Former-commit-id: d07ad5cc1c
2024-03-07 20:26:31 +08:00
hiyouga
8d386775f2 update examples
Former-commit-id: d1587c80de
2024-03-06 13:14:57 +08:00
hiyouga
8cf9842f7a add examples
Former-commit-id: 76f31b18eb
2024-03-05 03:16:35 +08:00
hiyouga
845e750abd add examples
Former-commit-id: 804c1e7083
2024-02-28 23:19:25 +08:00