Multimodal-VLM-Thinking / requirements.txt
prithivMLmods's picture
Update requirements.txt
53a60b1 verified
raw
history blame
304 Bytes
accelerate
albumentations==1.4.0
av
docling-core
gradio
pillow
numpy
huggingface_hub
loguru==0.7.3
opencv-python==4.11.0.86
opencv-python-headless==4.5.5.64
pymupdf==1.25.5
qwen-vl-utils
requests==2.32.3
spaces
timm==0.5.4
torch
torch==2.1.0
torchvision
transformers==4.47.0
transformers-stream-generator