Together

Team

company

Verified

https://together.ai

togethercompute

togethercomputer

Inference Provider

508,871 monthly requests

AI & ML interests

Foundation Models, Decentralized Computing, Open Source AI.

Recent Activity

ionutmodo authored a paper 29 days ago

Error Feedback Can Accurately Compress Preconditioners

ionutmodo authored a paper 29 days ago

MicroAdam: Accurate Adaptive Optimization with Low Space Overhead and Provable Convergence

ionutmodo authored a paper 29 days ago

SVD-Free Low-Rank Adaptive Gradient Optimization for Large Language Models

View all activity

Articles

Welcome to Inference Providers on the Hub 🔥

ionutmodo

authored 3 papers 29 days ago

Error Feedback Can Accurately Compress Preconditioners

Paper • 2306.06098 • Published Jun 9, 2023

MicroAdam: Accurate Adaptive Optimization with Low Space Overhead and Provable Convergence

Paper • 2405.15593 • Published May 24, 2024 • 1

SVD-Free Low-Rank Adaptive Gradient Optimization for Large Language Models

Paper • 2505.17967 • Published May 23 • 17

JunxiongWang

authored a paper 2 months ago

M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models

Paper • 2504.10449 • Published Apr 14 • 12

mryab

authored a paper 5 months ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published Jan 14 • 64

kezhentogether

authored a paper 7 months ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 56

benathi

authored a paper 7 months ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 56

VirginiaAdams

authored a paper 7 months ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 56

mryab

authored a paper 7 months ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 56

pragaash

authored 4 papers 7 months ago

Feedback-Based Self-Learning in Large-Scale Conversational AI Agents

Paper • 1911.02557 • Published Nov 6, 2019

A Vocabulary-Free Multilingual Neural Tokenizer for End-to-End Task Learning

Paper • 2204.10815 • Published Apr 22, 2022

Self-Aware Feedback-Based Self-Learning in Large-Scale Conversational AI

Paper • 2205.00029 • Published Apr 29, 2022

Training-Free Activation Sparsity in Large Language Models

Paper • 2408.14690 • Published Aug 26, 2024

xiaoxiawu123

authored a paper 9 months ago

GRIN: GRadient-INformed MoE

Paper • 2409.12136 • Published Sep 18, 2024 • 16

danielepaliotta

authored a paper 10 months ago

The Mamba in the Llama: Distilling and Accelerating Hybrid Models

Paper • 2408.15237 • Published Aug 27, 2024 • 42

JunxiongWang

authored a paper 10 months ago

The Mamba in the Llama: Distilling and Accelerating Hybrid Models

Paper • 2408.15237 • Published Aug 27, 2024 • 42

rhubarbwu

authored 2 papers 12 months ago

NeuralArTS: Structuring Neural Architecture Search with Type Theory

Paper • 2110.08710 • Published Oct 17, 2021

Towards One Shot Search Space Poisoning in Neural Architecture Search

Paper • 2111.07138 • Published Nov 13, 2021

mryab

authored a paper about 1 year ago

Distributed Methods with Compressed Communication for Solving Variational Inequalities, with Theoretical Guarantees

Paper • 2110.03313 • Published Oct 7, 2021 • 1

rhubarbwu

authored a paper about 1 year ago

Poisoning the Search Space in Neural Architecture Search

Paper • 2106.14406 • Published Jun 28, 2021