-
-
-
-
-
-
Inference Providers
Active filters:
dpo
tsavage68/IE_M2_1000steps_1e5rate_01beta_cSFTDPO
Text Generation
•
7B
•
Updated
•
4
tsavage68/IE_M2_1000steps_1e5rate_03beta_SFT
Text Generation
•
7B
•
Updated
•
3
DUAL-GPO/zephyr-7b-ipo-10k-40k-0.001-i1
Updated
•
10
Katayoon/VPO-Pess-Zephyr-7B-iter-1
7B
•
Updated
•
4
tsavage68/IE_L3_1000steps_1e5rate_01beta_cSFTDPO
Text Generation
•
8B
•
Updated
•
4
tsavage68/IE_M2_1000steps_1e5rate_05beta_cSFTDPO
Text Generation
•
7B
•
Updated
•
4
tsavage68/IE_M2_1000steps_1e6rate_01beta_cSFTDPO
Text Generation
•
7B
•
Updated
•
4
tsavage68/IE_M2_1000steps_1e6rate_03beta_cSFTDPO
Text Generation
•
7B
•
Updated
•
4
tsavage68/IE_M2_1000steps_1e6rate_05beta_cSFTDPO
Text Generation
•
7B
•
Updated
•
3
Katayoon/VPO-Pess-Zephyr-7B-iter-2
7B
•
Updated
•
3
tsavage68/IE_L3_1000steps_1e5rate_03beta_SFT
Text Generation
•
8B
•
Updated
•
3
tsavage68/IE_M2_1000steps_1e7rate_01beta_cSFTDPO
Text Generation
•
7B
•
Updated
•
4
tsavage68/IE_L3_1000steps_1e5rate_05beta_cSFTDPO
Text Generation
•
8B
•
Updated
•
3
tsavage68/IE_M2_1000steps_1e7rate_03beta_SFT
Text Generation
•
7B
•
Updated
•
3
tsavage68/IE_L3_1000steps_1e6rate_01beta_cSFTDPO
Text Generation
•
8B
•
Updated
•
4
tsavage68/IE_L3_1000steps_1e6rate_03beta_cSFTDPO
Text Generation
•
8B
•
Updated
•
3
tsavage68/IE_M2_1000steps_1e7rate_05beta_cSFTDPO
Text Generation
•
7B
•
Updated
•
3
tsavage68/IE_L3_1000steps_1e6rate_05beta_cSFTDPO
Text Generation
•
8B
•
Updated
•
3
tsavage68/IE_M2_1000steps_1e8rate_01beta_cSFTDPO
Text Generation
•
7B
•
Updated
•
4
tsavage68/IE_L3_1000steps_1e7rate_01beta_cSFTDPO
Text Generation
•
8B
•
Updated
•
4
tsavage68/IE_M2_1000steps_1e8rate_03beta_cSFTDPO
Text Generation
•
7B
•
Updated
•
4
tsavage68/IE_L3_1000steps_1e7rate_03beta_cSFTDPO
Text Generation
•
8B
•
Updated
•
4
tsavage68/IE_M2_1000steps_1e8rate_05beta_cSFTDPO
Text Generation
•
7B
•
Updated
•
4
Katayoon/VPO-Pess-Zephyr-7B-iter-3
7B
•
Updated
•
4
tsavage68/IE_M2_100steps_1e7rate_01beta_cSFTDPO
Text Generation
•
7B
•
Updated
•
3
tsavage68/IE_L3_1000steps_1e7rate_05beta_cSFTDPO
Text Generation
•
8B
•
Updated
•
4
tsavage68/IE_M2_50steps_1e7rate_03beta_SFT
Text Generation
•
7B
•
Updated
•
4
SongTonyLi/gemma-2b-it-CPT-D1_chosen-then-DPO-D2a-HuggingFaceH4-ultrafeedback_binarized-Xlarge
Text Generation
•
3B
•
Updated
•
5
tsavage68/IE_M2_350steps_1e8rate_01beta_cSFTDPO
Text Generation
•
7B
•
Updated
•
3
tsavage68/IE_M2_350steps_1e8rate_03beta_cSFTDPO
Text Generation
•
7B
•
Updated
•
4