Open to Work

22 15 17

Aritra Dutta

dutta18

https://vpnleaderboard.com/

AI & ML interests

None yet

Recent Activity

upvoted a collection about 1 month ago

LLaVa-NeXT

updated a dataset about 1 month ago

dutta18/esnlive

published a dataset about 1 month ago

dutta18/esnlive

View all activity

Organizations

upvoted a collection about 1 month ago

LLaVa-NeXT

Collection

LLaVa-NeXT (also known as LLaVa-1.6) improves upon the 1.5 series by incorporating higher image resolutions and more reasoning/OCR datasets. • 8 items • Updated Jul 19, 2024 • 34

upvoted an article about 1 month ago

Article

Multimodal Embedding & Reranker Models with Sentence Transformers

tomaarsen

•

Apr 9

• 59

upvoted a collection about 2 months ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 10 items • Updated Mar 2 • 561

upvoted an article 4 months ago

Article

Making automatic speech recognition work on large files with Wav2Vec2 in 🤗 Transformers

Narsil

•

Feb 1, 2022

• 16

upvoted an article 5 months ago

Article

Running Large Transformer Models on Mobile and Edge Devices

tugrulkaya

•

Nov 3, 2025

• 13

upvoted 4 articles 6 months ago

Article

Fine-tuning SmolLM with Group Relative Policy Optimization (GRPO) by following the Methodologies

prithivMLmods

•

Feb 17, 2025

• 29

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

NormalUhr

•

Feb 7, 2025

• 293

Article

Preference Optimization for Vision Language Models

qgallouedec, vwxyzjn, merve, kashif

•

Jul 10, 2024

• 93

Article

Vision Language Model Alignment in TRL ⚡️

sergiopaniego, merve, qgallouedec, kashif, ariG23498

•

Aug 7, 2025

• 111

upvoted an article 8 months ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

not-lain

•

Jan 30, 2025

• 337

upvoted an article 9 months ago

Article

Fine-tune Llama 2 with DPO

kashif, ybelkada, lvwerra

•

Aug 8, 2023

• 69

upvoted an article 10 months ago

Article

Decoding Strategies in Large Language Models

mlabonne

•

Oct 29, 2024

• 113

upvoted a collection 10 months ago

Llama 3.2

Collection

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 673

upvoted an article 10 months ago

Article

Introducing Command A Vision: Multimodal AI built for Business

CohereLabs

•

Jul 31, 2025

• 64

upvoted an article over 1 year ago

Article

SmolVLM - small yet mighty Vision Language Model

andito, merve, mfarre, eliebak, pcuenq

•

Nov 26, 2024

• 418

Aritra Dutta

AI & ML interests

Recent Activity

Organizations

dutta18's activity

Multimodal Embedding & Reranker Models with Sentence Transformers

Making automatic speech recognition work on large files with Wav2Vec2 in 🤗 Transformers

Running Large Transformer Models on Mobile and Edge Devices

Fine-tuning SmolLM with Group Relative Policy Optimization (GRPO) by following the Methodologies

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Preference Optimization for Vision Language Models

Vision Language Model Alignment in TRL ⚡️

KV Caching Explained: Optimizing Transformer Inference Efficiency

Fine-tune Llama 2 with DPO

Decoding Strategies in Large Language Models

Introducing Command A Vision: Multimodal AI built for Business

SmolVLM - small yet mighty Vision Language Model