Latest SOTA models supported on Qualcomm NPU.
AI & ML interests
On Device AI Deployment and Research
Recent Activity
Organization Card
Welcome to Nexa AI org on HuggingFace!
NexaâŻAI is an on device AI deployment and research company. We craft optimized foundation models and on-device inference framework that runs any model on any device, across any backendâwithin minutes. Our mission is to make on device AI frictionâfree and productionâready.
On this page youâll find
- Our own trained checkpoints
- Handâpicked community models in GGUF or MLX formats, ready to run on nexa-sdk
Resources
- âď¸ Download nexaSDK â get up and run models locally in minutes
- đŹ Discord Community
- đź Slack Community
spaces
5
Running
64
Nexa Omni Demo
đ§
Generate text from audio input
Running
79
Omnivlm Dpo Demo
đ
Ask questions about images and get detailed answers
Running
on
CPU Upgrade
30
Open LLM Leaderboard for domains
đ
Ranking for Open-sourced LLMs in different domains
Running
on
CPU Upgrade
37
Nexa AI GGUF Convertor
âĄ
Submit a model for quantization and receive an email notification
models
102
NexaAI/smolVLA-npu
Updated
â˘
10
NexaAI/Qwen3-8B-NPU
Text Generation
â˘
Updated
NexaAI/Qwen2.5-Omni-3B-GGUF
Any-to-Any
â˘
3B
â˘
Updated
â˘
455
â˘
2
NexaAI/Gemma3-1B-ANE
Updated
â˘
22
â˘
1
NexaAI/Qwen3-0.6B-ANE
Updated
â˘
25
â˘
1
NexaAI/Granite-4-Micro-ANE
Text Generation
â˘
Updated
â˘
25
â˘
1
NexaAI/jina-v2-rerank-npu-mobile
Updated
â˘
14
â˘
1
NexaAI/paddleocr-npu-mobile
Image-to-Text
â˘
Updated
â˘
13
â˘
1
NexaAI/parakeet-tdt-0.6b-v3-npu-mobile
Automatic Speech Recognition
â˘
Updated
â˘
17
â˘
2
NexaAI/embeddinggemma-300m-npu-mobile
Updated
â˘
18
â˘
1