Markit_v2 / src

Commit History

Update embedding model to Google Generative AI and enhance vector store functionality
4dfec96
Running

AnseMin commited on

Refactor LimitedEnsembleRetriever for improved compatibility and functionality
5da24ca

AnseMin commited on

Enhance vector store retrieval with limited results
9e9e9ff

AnseMin commited on

Refactor OCR configuration in DoclingParser to use EasyOCR exclusively
18e6067

AnseMin commited on

Enhance DoclingParser for CPU-only processing and improved error handling
5e0609f

AnseMin commited on

Implement ZeroGPU support in DoclingParser for enhanced document processing
d66e90c

AnseMin commited on

Enhance README and parser functionality for improved document processing
4a97b0c

AnseMin commited on

Integrate Gemini API for enhanced image processing in MarkItDown
033e4ba

AnseMin commited on

Refactor UI components for modular architecture and enhance functionality
6ea41ec

AnseMin commited on

Refactor document ingestion and chunking to support LaTeX content
63279a9

AnseMin commited on

Enhance UI with new Query Ranker feature and improve document search capabilities
623ad58

AnseMin commited on

Add advanced retrieval strategies and update dependencies for RAG implementation
21c909d

AnseMin commited on

Enhance Docling and Mistral OCR parsers with improved response handling and logging
c61b4e2

AnseMin commited on

Enhance multi-document processing capabilities in parsers
d437733

AnseMin commited on

Implement multi-document processing capabilities and enhance UI
111954a

AnseMin commited on

Add data clearing service and vector store management
f46dfbd

AnseMin commited on

Refactor document ingestion and output file handling
3f1b4af

AnseMin commited on

Import configuration from core module in gemini_flash_parser.py to enhance parser functionality.
63f3b68

AnseMin commited on

Update .gitignore and enhance README with data management instructions
a4f1c9e

AnseMin commited on

Enhance RAG (Retrieval-Augmented Generation) functionality and dependencies
575f1c7

AnseMin commited on

Add Docling support for advanced document processing
c0c51c2

AnseMin commited on

Implement environment-based UI launch configuration in main.py
57f6aa0

AnseMin commited on

Refactor and enhance application structure for Markit_v2
a773878

AnseMin commited on

Minor UI edit
55627c9

AnseMin commited on

New feature: Mistral OCR
98482ce

AnseMin commited on

Modifying the UI
49c5606

AnseMin commited on

Change in UI : the bottom margin was white which was ugly so it's being change
7022f7f

AnseMin commited on

Initial Implementation of Markitdown. Implemented:
dbdd7c8

AnseMin commited on

Approach #2 -- converting latex output from GOT OCR to markdown
5b7f920

AnseMin commited on

restore to version 1
23ad33e

AnseMin commited on

Tabular is not defined
34d180e

AnseMin commited on

Latex2Markdown display changes --attemtp1
33f1b65

AnseMin commited on

Error: Error processing document with GOT-OCR: cannot pickle '_thread.lock' object
4cac30a

AnseMin commited on

ERROR - Failed to load GOT-OCR model: CUDA must not be initialized in the main process on Spaces with Stateless GPU environment.
3415bc4

AnseMin commited on

runtime error fix
36e49b4

Ansemin101 commited on

Please work
4fab3b3

AnseMin commited on

handling zerogpu usage
610b772

AnseMin commited on

enable zerogpu
62f9c09

AnseMin commited on

CUDA dfloat 16 issue again
dcdb7ac

AnseMin commited on

fixing the missing path of run_ocr_2.0.py by editing got_ocr_parser.py
7d77a56

AnseMin commited on

changes on logging for better debugging
5bb2b30

AnseMin commited on

missing run_ocr_2.0.py file
c9c21c7

AnseMin commited on

change in strategy --implementing github got ocr instead of hugging face model
0f5865d

AnseMin commited on

script to convert markdown to latex, changing UI output to fit right with got ocr
ad248f7

AnseMin commited on

restore to check point
2184c47

AnseMin commited on

failed to load got ocr model
f89451e

AnseMin commited on

Error: Too many output
fa54d05

AnseMin commited on

Error: Error processing document with GOT-OCR: GOTQwenForCausalLM.chat() got an unexpected keyword argument 'format'
c4c3253

AnseMin commited on

adding format=true
1312a63

AnseMin commited on

complete reimplementation of got ocr
3332d94

AnseMin commited on