jakep-allenai's picture
Update README.md
70beee1 verified
metadata
language:
  - en
license: apache-2.0
datasets:
  - allenai/olmOCR-mix-0225
base_model:
  - Qwen/Qwen2-VL-7B-Instruct
library_name: transformers
olmOCR Logo

olmOCR-7B-0225-preview-FP8

This is the official FP8 quantized version of olmOCR-7B-0225-preview for use with the olmOCR pipeline.

Be sure you have olmOCR v0.1.75 or newer and run:

# Download a sample PDF
curl -o olmocr-sample.pdf https://olmocr.allenai.org/papers/olmocr_3pg_sample.pdf

# Convert it to markdown
python -m olmocr.pipeline ./localworkspace --markdown --pdfs olmocr-sample.pdf --model allenai/olmOCR-7B-0225-preview-FP8

License and use

olmOCR is licensed under the Apache 2.0 license. olmOCR is intended for research and educational use. For more information, please see our Responsible Use Guidelines.