Integrates AWS Bedrock's multimodal capabilities (Claude 3) into the Docling framework for generating image descriptions within document processing pipelines.
-
Updated
Apr 28, 2025 - Python
Integrates AWS Bedrock's multimodal capabilities (Claude 3) into the Docling framework for generating image descriptions within document processing pipelines.
A serverless solution to streamline ESG compliance using AI-driven automation. Built with the AWS CDK (Python), Amazon Textract, Amazon Bedrock, and other AWS services to process and analyse compliance reports.
pRISM is a repository that combines Retrieval-Augmented Generation (RAG) with a multi-LLM voting approach to create accurate and reliable AI-generated outputs. It integrates multiple language models, including Mistral, Claude 3.5, and OpenAI, to enhance performance through advanced consensus techniques
Distributed GCS-GCS multilingual PDF processing service built for horizontal scaling and concurrency, can be deployed using docker compose for voluminous processing
AI-powered system for summarizing PDF content with Armenian, Russian, and English language support. Automatically extracts and summarizes text, applies OCR to images, and identifies visual elements in documents. Built for efficient multilingual PDF processing.
Customized LangChain Azure Document Intelligence loader for table extraction and summarization
A fast, flexible API for extracting text from PDFs and images using smart file detection and OCR—perfect for automating your document workflows.
AI-powered invoice processing system using Google Document AI - Automated AP workflows with CI/CD pipeline for enterprise finance operations
Add a description, image, and links to the document-processing-pipeline topic page so that developers can more easily learn about it.
To associate your repository with the document-processing-pipeline topic, visit your repo's landing page and select "manage topics."