H company

Team

company

Verified

https://www.hcompany.ai/

hcompany_ai

hcompai

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

hamza-hcompany updated a model 19 days ago

Hcompany/Holo2-30B-A3B

plcedoz38 authored a paper 20 days ago

Surfer 2: The Next Generation of Cross-Platform Computer Use Agents

plcedoz38 updated a model 26 days ago

Hcompany/Holo2-8B

View all activity

Papers

Surfer 2: The Next Generation of Cross-Platform Computer Use Agents

View all Papers

Articles

Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H

Jun 3

•

sergiopaniego

posted an update about 6 hours ago

Post

We just released TRL v0.26.0!

It comes packed with updates:
> Agent training with tools in GRPO
> New CISPO & SAPO losses + reasoning rewards
> vLLM quantization in colocate mode
> Dataset shuffling in SFT
> Lots of NEW examples
> Tons of fixes and documentation improvements

2 replies

sergiopaniego

posted an update 1 day ago

Post

1864

NEW: @EssentialAI just released Rnj-1, their first 8B model.

You can easily fine-tune it with GRPO using TRL to add reasoning capabilities to a compact mode

Free Colab link: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/grpo_rnj_1_instruct.ipynb

More free TRL notebooks: https://huggingface.co/docs/trl/main/en/example_overview#notebooks

sergiopaniego

posted an update 5 days ago

Post

2753

Want to get started with fine-tuning but don’t know where to begin? 🤓☝️

We’re expanding our collection of beginner-friendly free Colab notebooks so you can learn and fine-tune models using TRL at no cost

🔬 Check out the full list of free notebooks: https://huggingface.co/docs/trl/main/en/example_overview#notebooks

🔬 If you want more advanced content, we also have a lot to cover in the community tutorials: https://huggingface.co/docs/trl/community_tutorials

And now the obvious question: what would you like us to add next?

sergiopaniego

posted an update 7 days ago

Post

2301

NEW: @mistralai released a fantastic family of multimodal models, Ministral 3.

You can fine-tune them for free on Colab using TRL ⚡️, supporting both SFT and GRPO

Link to the notebooks:
- SFT: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/sft_ministral3_vl.ipynb
- GRPO: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/grpo_ministral3_vl.ipynb
- TRL and more examples: https://huggingface.co/docs/trl/index

2 replies

sergiopaniego

posted an update 8 days ago

Post

2135

ICYMI, transformers v5 is out!

Grab a coffee ☕ and go read the announcement blog https://huggingface.co/blog/transformers-v5

sergiopaniego

posted an update 9 days ago

Post

3072

want to use open models easily through an API?

Inference Providers might be exactly what you’re looking for sooo here’s a complete beginner-friendly walkthrough 🧐

https://www.youtube.com/watch?v=oxwsizy1Spw

2 replies

sergiopaniego

posted an update 13 days ago

Post

1718

nanochat is now in transformers!

The LLM by @karpathy is officially in the library, and we wrote a blog covering: how did we port the model, differences from the original, and how to run or train it.

go read it 🤓

nanochat-students/transformers

sergiopaniego

posted an update 15 days ago

Post

3941

you gotta go fast and go read the latest blog by @ror et al. explaining Continuous Batching in depth

https://huggingface.co/blog/continuous_batching

sergiopaniego

posted an update 16 days ago

Post

1707

Interested in RL training environments?

We just released a beginner-friendly walkthrough notebook!

Train a model to play Wordle using TRL + OpenEnv (TextArena) + GRPO + vLLM.

happy learning! 🌱

Notebook: https://github.com/huggingface/trl/blob/main/examples/notebooks/openenv_wordle_grpo.ipynb

OpenEnv guide in TRL: https://huggingface.co/docs/trl/main/en/openenv

hamza-hcompany

updated a model 19 days ago

Hcompany/Holo2-30B-A3B

Image-Text-to-Text • 31B • Updated 19 days ago • 1.66k • 36

sergiopaniego

posted an update 20 days ago

Post

325

Ya está disponible el vídeo de la charla del otro día en @nerdearla sobre IA abierta, por si queréis verla! 🤠

https://www.youtube.com/watch?v=p-JLn4xAkMw

1 reply

plcedoz38

authored a paper 20 days ago

Surfer 2: The Next Generation of Cross-Platform Computer Use Agents

Paper • 2510.19949 • Published Oct 22 • 38

sergiopaniego

posted an update 21 days ago

Post

2578

we've just added several example scripts to TRL showing how to train models with GRPO using some of the new OpenEnv environments

train a model to interact with a browser (🎮 BrowserGym Env), play Wordle (🎮 Wordle Env) and moooore!

TRL (GRPO + vLLM) + OpenEnv! ⚡️

📝 go play with them: https://github.com/huggingface/trl/tree/main/examples/scripts/openenv

📝 examples list: https://huggingface.co/docs/trl/main/en/example_overview#scripts