DUAL-GPO/phi-2-gpo-newSFT-b0.001-v7-lightai-i1-merged-renew Text Generation • Updated about 2 hours ago
S4nto/lora-dpo-finetuned-model-beta-0.5-rate-1e6-stage2-iter40000-sft Text Generation • Updated about 2 hours ago
simonycl/self-seq-Meta-Llama-3-8B-flancot_full_sit_llama_70b Text Generation • Updated about 1 hour ago
simonycl/self-seq-Meta-Llama-3-8B-flancot_full_it_llama_70b Text Generation • Updated about 1 hour ago