scb10x
/

typhoon-ocr-7b

Image-Text-to-Text

vision-language

document-understanding

text-generation-inference

Model card Files Files and versions Community

kunato commited on 2 days ago

Commit

9df43b5

·

verified ·

1 Parent(s): 3d45510

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -105,7 +105,7 @@ from openai import OpenAI
 from PIL import Image
 from typhoon_ocr.ocr_utils import render_pdf_to_base64png, get_anchor_text
-PROMPTS_SYS = {
     "default": lambda base_text: (f"Below is an image of a document page along with its dimensions. "
         f"Simply return the markdown representation of this document, presenting tables in markdown format as they naturally appear.\n"
         f"If the document contains images, use a placeholder like dummy.png for each image.\n"
@@ -128,7 +128,7 @@ def get_prompt(prompt_name: str) -> Callable[[str], str]:
     :param prompt_name: The identifier for the desired prompt.
     :return: The system prompt as a string.
     """
-    return PROMPTS_SYS.get(prompt_name, lambda x: "Invalid PROMPT_NAME provided.")
@@ -209,7 +209,7 @@ print(text_output[0])
 This model only works with the specific prompts defined below, where `{base_text}` refers to information extracted from the PDF metadata using the `get_anchor_text` function from the `typhoon-ocr` package. It will not function correctly with any other prompts.
 ```python
-PROMPTS_SYS = {
     "default": lambda base_text: (f"Below is an image of a document page along with its dimensions. "
         f"Simply return the markdown representation of this document, presenting tables in markdown format as they naturally appear.\n"
         f"If the document contains images, use a placeholder like dummy.png for each image.\n"

 from PIL import Image
 from typhoon_ocr.ocr_utils import render_pdf_to_base64png, get_anchor_text
+PROMPTS = {
     "default": lambda base_text: (f"Below is an image of a document page along with its dimensions. "
         f"Simply return the markdown representation of this document, presenting tables in markdown format as they naturally appear.\n"
         f"If the document contains images, use a placeholder like dummy.png for each image.\n"
     :param prompt_name: The identifier for the desired prompt.
     :return: The system prompt as a string.
     """
+    return PROMPTS.get(prompt_name, lambda x: "Invalid PROMPT_NAME provided.")
 This model only works with the specific prompts defined below, where `{base_text}` refers to information extracted from the PDF metadata using the `get_anchor_text` function from the `typhoon-ocr` package. It will not function correctly with any other prompts.
 ```python
+PROMPTS = {
     "default": lambda base_text: (f"Below is an image of a document page along with its dimensions. "
         f"Simply return the markdown representation of this document, presenting tables in markdown format as they naturally appear.\n"
         f"If the document contains images, use a placeholder like dummy.png for each image.\n"