代码库
Python
Python
datasets resource
Open-source multimodal data annotation platform with AI auto-annotation support.
Python
[ICCV25 Highlight] The official implementation of the paper "LEGION: Learning to Ground and Explain for Synthetic Image Detection"
Python
A diffusion-based framework for document OCR that replaces autoregressive decoding with block-level parallel diffusion decoding.
Python
ai4sciencediffusiondlmdocument-analysisextract-datalayout-analysislladaocrparserpdfpdf-converterpdf-extractor-llmpdf-extractor-pretrainpdf-extractor-ragpdf-parserpython
Agent-native knowledge engine with MCP tools for document indexing, wiki organization, fast retrieval and deep reading across PDF/DOCX/PPTX/Markdown
TypeScript
Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
Python
ai4sciencedocument-analysisdocxextract-datalayout-analysisocrparserpdfpdf-converterpdf-extractor-llmpdf-extractor-pretrainpdf-extractor-ragpdf-parserpptxpythonxlsx