HN Reader
Top
New
Best
Ask
Show
Jobs
Top
New
Best
Ask
Show
Jobs
Show HN: RLHF and Lora finetuning to mistralai 7B with DeepSpeed learning
1
genji970
0
7/30/2025, 8:10:09 PM
github.com ↗
Using multiple gpus, training 7B model with lora and RLHF with external dataset.
Comments (0)
No comments yet
No comments yet