Doc's Choice
Models that I personally recommend.
Text Generation • 357B • Updated • 62.8k • • 1.02kNote My recommendation for the big-several-hundred-B MoE size class. I run it in non-thinking mode with assistant prefill. It's a bit too slow to run in thinking mode locally on CPU+GPU or CPU.
-
deepseek-ai/DeepSeek-R1-0528
Text Generation • 685B • Updated • 566k • • 2.39k -
deepseek-ai/DeepSeek-V3-0324
Text Generation • 685B • Updated • 241k • • 3.08k
Doctor-Shotgun/ML2-123B-Magnum-Diamond
Text Generation • 123B • Updated • 32 • 8Note Focusing on applying enough heat and pressure to dry, assistant-tuned models until they turn into creative writing gems!
Doctor-Shotgun/L3.3-70B-Magnum-Diamond
Text Generation • 71B • Updated • 381 • 3Note Focusing on applying enough heat and pressure to dry, assistant-tuned models until they turn into creative writing gems!
Doctor-Shotgun/MS3.2-24B-Magnum-Diamond
Text Generation • 24B • Updated • 117 • 43Note Focusing on applying enough heat and pressure to dry, assistant-tuned models until they turn into creative writing gems!
Doctor-Shotgun/L3.3-70B-Magnum-Nexus
Text Generation • 71B • Updated • 217 • 9Note This is a merge of my various L3.3 Magnum tunes, leading to a more stable result. Works with or without prepending character names, and with or without prefill.
Doctor-Shotgun/L3.3-70B-Magnum-v5-SFT-Alpha
Text Generation • 71B • Updated • 2Note Out of the L3.3 Magnum tunes, this one is the mad and prompt-format-sensitive genius. Works best with prefill + no character names, matching the formatting of the majority of the RP data in the set, can be difficult to wrangle otherwise.
Doctor-Shotgun/L3.3-70B-Magnum-v4-SE
Text Generation • 71B • Updated • 333 • 16Note An epilogue, if you will, to the v4 series. With minor dataset updates and a new base, this model has been an upgrade to v4 72b in my own testing.
Sao10K/70B-L3.3-Cirrus-x1
Text Generation • 71B • Updated • 63 • • 36Note Sao did it once again - a smart model with a refreshing prose style!