Adrien Gallouët
angt
AI & ML interests
None yet
Recent Activity
posted
an
update
about 21 hours ago
Just published: Nano-vLLM meets Inference Endpoints
I show how to bind Nano-vLLM (supporting Qwen3-0.6B) to a web service — and deploy it easily on Hugging Face Inference Endpoints.
Minimalist engine, maximum fun!
https://huggingface.co/blog/angt/nano-vllm-meets-inference-endpoints
published
an
article
about 22 hours ago
Nano-vLLM meets Inference Endpoints