About inference

by AnnLi0507 - opened Sep 15

Sep 15

Thank you to the InclusionAI team for this excellent open-source model—both the inference speed and the metrics are incredible. Can I run inference on an RTX 3090 with 24 GB of VRAM via vLLM?

J22

Sep 15

Yes, you can.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment