SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper β’ 2503.11576 β’ Published Mar 14 β’ 108
view article Article Welcome to Inference Providers on the Hub π₯ By julien-c and 6 others β’ Jan 28 β’ 484
view article Article Fine-tuning LLMs to 1.58bit: extreme quantization made easy By medmekk and 5 others β’ Sep 18, 2024 β’ 253
Building and better understanding vision-language models: insights and future directions Paper β’ 2408.12637 β’ Published Aug 22, 2024 β’ 132
Gemma 2: Improving Open Language Models at a Practical Size Paper β’ 2408.00118 β’ Published Jul 31, 2024 β’ 78
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper β’ 2404.14219 β’ Published Apr 22, 2024 β’ 257
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper β’ 2402.17764 β’ Published Feb 27, 2024 β’ 619
Recent models: last 100 repos, sorted by creation date Collection The last 100 repos I have created. Sorted by creation date descending, so the most recently created repos appear at the top. β’ 121 items β’ Updated Jan 31, 2024 β’ 546
A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis Paper β’ 2307.12856 β’ Published Jul 24, 2023 β’ 36
Llama 2: Open Foundation and Fine-Tuned Chat Models Paper β’ 2307.09288 β’ Published Jul 18, 2023 β’ 242