Show HN: RLHF and Lora finetuning to mistralai 7B with DeepSpeed learning

1 genji970 0 7/30/2025, 8:10:09 PM github.com ↗
Using multiple gpus, training 7B model with lora and RLHF with external dataset.

Comments (0)

No comments yet