2 16 101

Alberto Cetoli PRO

fractalego

https://fractalego.social/@alberto

AI & ML interests

Entity/relation extraction, Q&A, Summarisation

Recent Activity

liked a model about 11 hours ago

openai/gpt-oss-20b

reacted to mitkox's post with 😎 13 days ago

I run Qwen3-Coder 480B locally on my Z8, with a 1-million token context window. It’s the equivalent of parallel-parking a Nimitz-class carrier in a kiddie pool. Thanks to whatever dark pact the llama.cpp, CUDA, and kernel folks signed, hybrid inferencing + VRAM↔RAM offload let me stream the model’s synapses across Xeon, RAM, and four lonely A6000s without summoning either the OOM killer or a small house fire.

liked a model 20 days ago

mradermacher/Agentic-R1-GGUF

View all activity

Organizations

liked a model about 11 hours ago

openai/gpt-oss-20b

Text Generation • 12B • Updated about 6 hours ago • 6.82k • • 1.45k

reacted to mitkox's post with 😎 13 days ago

Post

2076

liked a model 20 days ago

mradermacher/Agentic-R1-GGUF

8B • Updated 28 days ago • 332 • 1

liked a model 21 days ago

moonshotai/Kimi-K2-Instruct

Text Generation • Updated 9 days ago • 422k • • 2.02k

reacted to merve's post with 🔥 about 2 months ago

Post

2928

Qwen2.5-Omni is soooo good that people build multimodal reasoning models off of it 🥹
> KE-Team/Ke-Omni-R-3B is open-source audio reasoning model sota on average of benchmarks, based on Qwen/Qwen2.5-Omni-3B 🗣️
> Haoz0206/Omni-R1 is a video reasoning model with pixel level grounding (see below) and it's super competitive ⏯️ based on Qwen/Qwen2.5-Omni-7B

liked 2 models 2 months ago

Qwen/Qwen3-Embedding-0.6B-GGUF

0.6B • Updated 22 days ago • 29.8k • 427

DAMO-NLP-SG/VideoLLaMA3-7B

Visual Question Answering • 8B • Updated Mar 20 • 94.5k • 63

liked a model 3 months ago

ByteDance-Seed/BAGEL-7B-MoT

Any-to-Any • 15B • Updated Jun 23 • 1.09k • 1.1k

liked a dataset 3 months ago

nvidia/OpenMathReasoning

Viewer • Updated May 27 • 5.68M • 12.9k • 318

reacted to jeffboudier's post with 🚀 3 months ago

Post

2594

Transcribing 1 hour of audio for less than $0.01 🤯

@mfuntowicz cooked with 8x faster Whisper speech recognition - whisper-large-v3-turbo transcribes at 100x real time on a $0.80/hr L4 GPU!

How they did it: https://huggingface.co/blog/fast-whisper-endpoints

1-click deploy with HF Inference Endpoints: https://endpoints.huggingface.co/new?repository=openai%2Fwhisper-large-v3-turbo&vendor=aws&region=us-east&accelerator=gpu&instance_id=aws-us-east-1-nvidia-l4-x1&task=automatic-speech-recognition&no_suggested_compute=true

liked a model 3 months ago

Qwen/Qwen3-30B-A3B

Text Generation • 31B • Updated 11 days ago • 1.08M • • 747

liked 2 models 4 months ago

microsoft/bitnet-b1.58-2B-4T

Text Generation • 0.8B • Updated May 1 • 5.08k • 1.15k

meta-llama/Llama-4-Scout-17B-16E-Instruct

Image-Text-to-Text • 109B • Updated May 22 • 759k • • 1.04k

liked 2 models 5 months ago

sesame/csm-1b

Text-to-Speech • Updated 14 days ago • 29.3k • 2.16k

manycore-research/SpatialLM-Llama-1B

Text Generation • 1B • Updated Mar 21 • 1.04k • 980

reacted to BrigitteTousi's post with 🔥🚀 5 months ago

Post

3437

LeRobot goes to driving school! 🚗🚗🚗

Hugging Face just announced a new collab with Yaak to bring the largest open-source self-driving dataset to LeRobot!

Major kudos to HF's @cadene , as well as @sandhawalia , @Shnissen and the Yaak team!

Check out the blog post here: https://huggingface.co/blog/lerobot-goes-to-driving-school

1 reply

reacted to csabakecskemeti's post with 🔥 5 months ago

Post

2842

Testing Training on AMD/ROCm the first time!

I've got my hands on an AMD Instinct MI100. It's about the same price used as a V100 but on paper has more TOPS (V100 14TOPS vs MI100 23TOPS) also the HBM has faster clock so the memory bandwidth is 1.2TB/s.
For quantized inference it's a beast (MI50 was also surprisingly fast)

For LORA training with this quick test I could not make the bnb config works so I'm running the FT on the fill size model.

Will share all the install, setup and setting I've learned in a blog post, together with the cooling shroud 3D design.

8 replies

upvoted an article 5 months ago

Article

Open-R1: Update #1

and 7 others •

Feb 2

• 305

reacted to burtenshaw's post with 🔥 5 months ago

Post

6488

Now the Hugging Face agent course is getting real! With frameworks like smolagents, LlamaIndex, and LangChain.

🔗 Follow the org for updates

agents-course

This week we are releasing the first framework unit in the course and it’s on smolagents. This is what the unit covers:

- why should you use smolagents vs another library?
- how to build agents that use code
- build multiagents systems
- use vision language models for browser use

The team has been working flat out on this for a few weeks. Led by @sergiopaniego and supported by smolagents author @m-ric .

Alberto Cetoli PRO

AI & ML interests

Recent Activity

Organizations

fractalego's activity

Open-R1: Update #1