LLM Model VRAM Calculator
Calculate VRAM requirements for running large language models
GGUF, Prompt gen, Repo tools, followed by "Bench" and "Leaderboards". Leader boards get more specific going down. See also: "Run LLMs ..." collection.
Calculate VRAM requirements for running large language models
Refine your prompts
Create and quantize Hugging Face models
Request custom GGML quantized models via email
Edit GGUF metadata on Hugging Face or locally
Convert your PEFT LoRA into GGUF
Copy a Hugging Face repository
Convert and upload Hugging Face models to MLX format
View EQ-Bench Leaderboard for LLMs
Note A specialized Bench for evaluation of the creativity of a model with testing outputs shown as well as judgements / ratings including a model's "emotional intelligence".
Uncensored General Intelligence Leaderboard
Note Uncensored General Intelligence. Another great source for creative and/or role play models.
Track, rank and evaluate open LLMs and chatbots
Compare model answers to questions
Compact LLM Battle Arena: Frugal AI Face-Off!
View LLM Performance Leaderboard
Compare Open LLM Leaderboard results
Run a Streamlit web app
Display chatbot leaderboard and stats
View and filter leaderboard scores for AI models
Display and filter model evaluation results
Embedding Leaderboard
A leaderboard for multimodal models
Explore LLM performance across hardware
Track, rank and evaluate open LLMs' CoT quality
Visualize model performance with interactive plots and tables
Track, rank and evaluate open LLMs and chatbots
Ranking for Open-sourced LLMs in different domains
More advanced and challenging multi-task evaluation
Blind vote on HF TTS models!
Explore and analyze code evaluation data
Search and submit code models for evaluation
Image Generation and Image Editing Arena & Leaderboard
Explore speech recognition model performance with filters
Request evaluation for a speech model
Display OCR model leaderboard and evaluation data
Text to Video and Image to Video Arena & Leaderboard
VLMEvalKit Evaluation Results Collection
Submit model evaluation and view leaderboard
Browse and submit LLM evaluations
Explore and filter language model benchmark results
Browse Q-Bench leaderboard for vision model performance
Display and filter LLM benchmark results
View and filter LLM leaderboard data
View and submit LLM evaluations
Explore and compare QA and long doc benchmarks
View and submit machine learning model evaluations
Browse and submit model evaluations in LLM benchmarks