Why are you uploading fake models

#1
by screamingiraffe - opened

Why are you uploading fake models

I am very confused, what justifies a "fake model"?

I am very confused, what justifies a "fake model"?

^^^

@screamingiraffe

Why are you uploading fake models

They aren't. These are distillations of larger reasoning models, following a similar naming convention DeepSeek used eg:

deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
^ That is Qwen2.5-7B trained on Deepseek-R1 traces

TeichAI have a much clearer naming convention

[Base-Model]-[Data-Source]-Distill-[number of samples]

eg: this is Qwen3-8B trained on 1000 samples from Gemini-3-Pro-Preview:

TeichAI/Qwen3-8B-Gemini-3-Pro-Preview-Distill-1000x

And they're even sharing the datasets they create:
https://huggingface.co/TeichAI/datasets

What are you contributing to the scene?

Yea I think he is being misled by the DeepSeek distills.

I think DeepSeek v3.2 was trained on chatgpt's responses at some point because the model seems to be convinced that it's chatgpt.
Either way the models are anything but fake. They are weak models trying to imitate the strong ๐Ÿฆพ

Sign up or log in to comment